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METHODS OF DIAGNOSIS OF ANGIOGENESIS, COMPOSITIONS 
AND METHODS OF SCREENING FOR ANGIOGENESIS 

MODULATORS 



CROSS-REFERENCES TO RELATED APPLICATIONS 
This application claims priority to USSN 09/784,356, filed February 14 2001; 
10 USSN 09/791,390, filed February 22, 2001 ; USSN 60/285,475, filed April 19, 2001, USSN 
60/310,025, filed August 3, 2001, and USSN 60/334,244, filed November 29, 2001, each of 
which is herein incorporated by reference in its entirety. 

FIELD OF THE INVENTION 
1 5 The invention relates to the identification of nucleic acid and protein 

expression profiles and nucleic acids, products, and antibodies thereto that are involved in 
angiogenesis; and to the use of such expression profiles and compositions in diagnosis and 
therapy of angiogenesis. The invention further relates to methods for identifying and using 
agents and/or targets that modulate angiogenesis. 

20 

BACKGROUND OF THE INVENTION 
Both vasculogenesis, the development of an interactive vascular system 
comprising arteries and veins, and angiogenesis, the generation of new blood vessels, play a 
role in embryonic development. In contrast, angiogenesis is limited in a normal adult to the 

25 placenta, ovary, endometrium and sites of wound healing. However, angiogenesis, or its 

absence, plays an important role in the maintenance of a variety of pathological states. Some 
of these states are characterized by neovascularization, cancer, diabetic retinopathy, 
glaucoma, and age related macular degeneration. Others, e.g., stroke, infertility, heart 
disease, ulcers, and scleroderma, are diseases of angiogenic insufBciency. 

30 Angiogenesis has a nmnber of stages (see, e.g.^ Folkman, J.Natl Cancer Inst 

82:4-6, 1990; Firestein, J Clin Invest 103:3-4, 1999; Koch, Arthritis RheumAl:95U62, 1998; 
Carter, Oncologist 5(Suppl l):51-4, 2000; Browder et at. Cancer Res. 60:1878-86, 2000; and 
Zbu and Witte, Invest New Drugs 17:195-212, 1999). The early stages of angiogenesis 
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include endothelial cell protease production, migration of cells, and proliferation. The early 
stages also appear to require some growth factors, with VEGF, TGF-a, angiostatin, and 
selected chemokines all putatively playing a role. Later stages of angiogenesis include 
population of the vessels with mural cells (pericytes or smooth muscle cells), basement 
5 membrane production, and the induction of vessel bed specializations. The final stages of 
vessel formation include what is laiown as "remodeling", wherein a forming vasculature 
becomes a stable, mature vessel bed. Thus, the process is liighly dynamic, often requiring 
coordinated spatial and temporal waves of gene expression. 

Conversely, the complex process may be subject to disruption by interfering 

10 with one or more critical steps. Thus, the lack of understanding of the dynamics of 

angiogenesis prevents therapeutic intervention in serious diseases such as those indicated. It 
is an object of the invention to provide methods that can be used to screen compounds for the 
ability to modulate angiogenesis. Additionally, it is an object to provide molecular targets for 
therapeutic intervention in disease states which either have an undesirable excess or a deficit 

15 in angiogenesis. The present invention provides solutions to both. 

SUMMARY OF THE INVENTION 
The present invention provides compositions and methods for detecting or 
modidating angiogenesis associated sequences. 
20 In one aspect, the invention provides a method of detecting an angiogenesis- 

associated transcript in a cell in a patient, the method comprising contacting a biological 
sample from the patient with a polynucleotide that selectively hybridized to a sequence at 
least 80% identical to a sequence as shown in Tables 1-8. In one embodiment, the biological 
sample is a tissue sample. In another embodiment, the biological sample comprises isolated 
25 nucleic acids, which are often mRNA. 

In another embodiment, the method further comprises the step of amplifying 
nucleic acids before the step of contacting the biological sample with the polynucleotide. 
Often, the polynucleotide comprises a sequence as shown in Tables 1-8. The polynucleotide 
can be labeled, for example, with a fluorescent label and can be immobilized on a solid 
30 surface. 

In other embodiments the patient is undergoing a therapeutic regimen to treat a 
disease associated with angiogenesis or the patient is suspected of having an angiogenesis- 
associated disorder. 

2 
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In another aspect, the invention comprises an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1-8. The nucleic acid molecule 
can be labeled, for example, with a fluorescent label. 

In other aspects, the invention provides an expression vector comprising an 
5 isolated nucleic acid molecule consisting of a polynucleotide sequence as shown in Tables 1- 
8 or a host cell comprising the expression vector. 

In another embodiment, the isolated nucleic acid molecule encodes a 
polypeptide having an amino acid sequence as shown in Table 8. 

In another aspect, the invention provides an isolated polypeptide which is 
10 encoded by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1-8. 
In one embodiment, the isolated polypeptide has an amino acid sequence as shown in Table 
8. 

In another embodiment, the invention provides an antibody that specifically 

binds a polypeptide that has an amino acid sequence as shown in Table 8 or which is encoded 
15 by a nucleotide sequence of Tables 1-8 . The antibody can be conjugated or fused to an 

effector component such as a fluorescent label, a toxin, or a radioisotope. In some 

embodiments, the antibody is an antibody fragment or a humanized antibody. 

In another aspect, the invention provides a method of detecting a cell 

imdergoing angiogenesis in a biological sample from a patient, the method comprising 
20 contacting the biological sample with an antibody that specifically binds to a polypeptide that 

has an amino acid sequence as shown in Table 8 or which is encoded by a nucleotide 

sequence of Tables 1-8 . In some embodiments, the antibody is further conjugated or fused to 

an effector component, for example, a fluorescent label. 

In another embodiment, the invention provides a method of detecting 
25 antibodies specific to angiogenesis in a patient, the method comprising contacting a 

biological sample from the patient with a polypeptide which is encoded by a nucleotide 

sequence of Tables 1-8. 

The invention also provides a method of identifying a compound that 

modulates the activity of an angiogenesis-associated polypeptide, the method comprising the 
30 steps of: (i) contacting the compound witii a polypeptide that comprises at least 80% identity 

to an amino acid sequence as shown in Table 8 or which is encoded by a nucleotide sequence 

of Tables 1-8; and (ii) detecting an increase or a decrease in the activity of the polypeptide. 

In one embodiment, the polypeptide has an amino acid sequence as shown in Table 8 or is a 
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polypeptide encoded by a nucleotide sequence of Tables 1-8. In another embodiment, the 
polypeptide is expressed in a cell. 

The invention also provides a method of identifying a compound that 
modulates angiogenesis, the method comprising steps of: (i) contacting the compound with a 
5 cell undergoing angiogenesis; and (ii) detecting an increase or a decrease in the expression of 
a polypeptide sequence as shown in Table 8 or a polypeptide which is encoded by a 
nucleotide sequence of Tables 1-8. In one embodiment, the detecting step comprises 
hybridizing a nucleic acid sample from the cell with a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-8. In 

1 0 another embodiment, the method further comprises detecting an increase or decrease in the 
expression of a second sequence as shown in Table 8 or a polypeptide which is encoded by a 
nucleotide sequence of Tables 1-8 . 

In another embodiment, the invention provides a method of inhibiting 
angiogenesis in a cell that expresses a polypeptide at least 80% identical to a sequence as 

15 shown in Table 8 or which is 80% identical to a polypeptide encoded by a nucleotide 
sequence of Tables 1-8 , the method comprising the step of contacting the cell with a 
therapeutically effective amoimt of an inhibitor of the polypeptide. In one embodiment, the 
polypeptide has an amino acid sequence shown in Table 8 or is a polypeptide which is 
encoded by a nucleotide sequence of Tables 1-8 . In another embodiment, the inhibitor is an 

20 antibody. 

In other embodiments, the invention provides a method of activating 
angiogenesis in a cell that expresses a polypeptide at least 80% identical to a sequence as 
shown in Table 8 or at least 80% identical to a polypeptide which is encoded by a nucleotide 
sequence of Tables 1-8 , the method comprising the step of contacting the cell with a 
25 therapeutically effective amount of an activator of the polypeptide. In one embodiment, the 
polypeptide has an amino acid sequence shown in Table 8 or is a polypeptide which is 
encoded by a nucleotide sequence of Tables 1-8. 

Other aspects of the invention will become apparent to the skilled artisan by 
the following description of the invention. 



30 



Tables 1-8 provide nucleotide sequence of genes that exhibit changes in 
expression levels as a function of time in tissue undergoing angiogenesis compared to tissue 
that is not. 
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DESCRIPTION OF THE SPECIFIC EMBODIMENTS 
In accordance with the objects outlined above, the present invention provides 
novel methods for diagnosis and treatment of disorders associated with angiogenesis 
(sometimes referred to herein as angiogenesis disorders or AD), as well as methods for 
5 screening for compositions which modulate angiogenesis. By "disorder associated with 

angiogenesis" or "disease associated with angiogenesis'' herein is meant a disease state which 
is marked by either an excess or a deficit of blood vessel development. Angiogenesis 
disorders asociated with increased angiogenesis include, but are not limited to, cancer and 
proliferative diabetic retinopathy. Pathological states for which it may be desirable to 

10 increase angiogenesis include stroke, heart disease, infertility, ulcers, woxmd healing, 

ischemia, and scleradoma. Solid tumors typically require angiogenesis to support or sustain 
growth, e.g., breast, colon, lung, brain, bladder, and prostate tumors. Other AD include, e.g., 
arthritis, inflanamatoiy bowel disease, diabetis retinopathy, macular degeneration, 
atherosclerosis, and psoriasis. Also provided are methods for treating AD. 

IS Definitions 

The temi "angiogenesis protein" or "angiogenesis polynucleotide" refers to 
nucleic acid and polypeptide polymorphic variants, alleles, mutants, and interspecies 
homologs that: (1) have an amino acid sequence that has greater than about 60% amino acid 
sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 

20 96%, 97%, 98% or 99% or greater amino acid sequence identity, preferably over a region of 
over a region of at least about 25, 50, 100, 200, 500, 1000, or more amino acids, to an 
angiogenesis protein sequence of Table 8; (2) bind to antibodies, e.g., polyclonal antibodies, 
raised against an immunogen comprising an anfiino acid sequence of Table 8, and 
conservatively modified variants thereof; (3) specifically hybridize under stringent 

25 hybridization conditions to an anti-sense strand corresponding to a nucleic acid sequence of 
Tables 1-8 and conservatively modified variants thereof; (4) have a nucleic acid sequence 
that has greater than about 95%, preferably greater than about 96%, 97%, 98%, 99%, or 
higher nucleotide sequence identity, preferably over a region of at least about 25, 50, 100, 
200, 500, 1000, or more nucleotides, to a sense sequence corresponding to one set out in 

30 Tables 1-8 . A polynucleotide or polypeptide sequence is typically fi'om a mammal 

including, but not limited to, primate, e.g., human; rodent, e,gy rat, mouse, hamster; cow, pig, 
horse, sheep, or any mammal. An "angiogenesis polypeptide" and an "angiogenesis 
polynucleotide," include both naturally occurring or recombinant. 

5 
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A "full length" angiogenesis protein or nucleic acid refers to an agiogenesis 
polypeptide or polynucleotide sequence, or a variant thereof, that contains all of the elements 
normally contained in one or more naturally occurring, wild type angiogenesis polynucleotide 
or polypeptide sequences. The "full length" may be prior to, or after, various stages of post- 
5 translation processing. 

'TBiological sample" as used herein is a sample of biological tissue or fluid that 
contains nucleic acids or polypeptides, e.g^., of an angiogenic protein. Such samples include, 
but are not limited to, tissue isolated j&om primates, e,g., humans, or rodents, e.g., mice, and 
rats. Biological samples may also include sections of tissues such as biopsy and autopsy 
10 samples, and frozen sections taken for histologic purposes. A biological sample is typically 
obtained from a eukaryotic organism, most preferably a mammal such as a primate e.g., 
chimpanzee or hiraian; cow; dog; cat; a rodent, e.g.y guinea pig, rat, mouse; rabbit; or a bird; 
reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
15 methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomphshed by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 
methods of the invention in vivo. Archival tissues, having treatment or outcome histroy, will 
be particularly useful. 

20 The terms "identical" or percent "identity," in the context of two or more 

nucleic acids or polj^jeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 
same (i.e., about 70% identity, preferably 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, 99%, or higher identity over a specified region (e.g., SEQ ID NOS:l-229), 

25 when compared and aligned for maximum correspondence over a comparison window or 
designated region) as measured using a BLAST or BLAST 2.0 sequence comparison 
algorithms with default parameters described below, or by manual alignment and visual 
inspection (see, e,g., NCBI web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such 
sequences are then said to be "substantially identical." This definition also refers to, or may 

30 be applied to, the compliment of a test sequence. The definition also includes sequences that 
have deletions and/or additions, as well as those that have substitutions. As described below, 
the preferred algorithms can account for gaps and the like. Preferably, identity exists over a 
region that is at least about 25 amino acids or nucleotides in length, or more preferably over a 
region that is 50-100 andino acids or nucleotides in length. 

6 
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For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 
and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Preferably, default 
5 program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
any one of the number of contiguous positions selected from the group consisting of from 20 

10 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a 
sequence may be compared to a reference sequence of the same nimiber of contiguous 
positions after the two sequences are optimally aUgned. Methods of alignment of sequences 
for comparison are well-known in the art. Optimal alignment of sequences for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. AppL 

15 Math, 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. MoL 
Biol 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat'L 
Acad. Set USA 85:2444 (1988), by computerized implementations of these algorithms 
(GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 

20 visual inspection {see, e.g,. Current Protocols in Molecular Biology (Ausubel et al.y eds. 
1995 supplement)). 

A preferred example of algorithm that is suitable for determining percent 
sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which 
are described in Altschul et aL, Nuc. Acids Res, 25:3389-3402 (1977) and Altschul et al, J. 

25 MoL Biol. 215:403-410 (1990), respectively. BLAST and BLAST 2.0 are used, with the 

parameters described herein, to determine percent sequence identity for the nucleic acids and 
proteins of the invention. Software for performing BLAST analyses is publicly available 
through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). 
This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying 

30 short words of length W in the query sequmce, which either match or satisfy some positive- 
valued threshold score T when aligned with a word of the same length in a database 
sequence. T is referred to as the neighborhood word score threshold (Altschul et al.^ supra). 
These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs 
containing them. The word hits are extended in both directions along each sequence for as 
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far as the cumulative alignment score can be increased. Cumulative scores are calculated 
using, for nucleotide sequences, the parameters M (reward score for a pair of matching 
residues; always > 0) and N (penalty score for mismatching residues; always < 0). For amino 
acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the 
5 word hits in each direction are halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 

10 uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, M=5, N~4 and a 
comparison of both strands. For amino acid sequences, the BLAST? program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff & HenikofF, Proc, Natl Acad. Set USA 89:10915 (1989)) alignments (B) of 
50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

15 The BLAST algorithm also performs a statistical analysis of the similarity 

between two sequences (see, e,g., Karlin & Altschul, Proc. Nat 'I Acad. Scl USA 90:5873- 
5787 (1993)). One measure of similarity provided by the BLAST algorithm is ttie smallest 
sum probability (P(N)), which provides an indication of the probabiUty by which a match 
between two nucleotide or amino acid sequences would occur by chance. For example, a 

20 nucleic acid is considered similar to a reference sequence if the smallest smn probability in a 
comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 
preferably less than about 0.01, and most preferably less than about 0.001. 

An indication that two nucleic acid sequences or polypeptides are substantially 
identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 

25 reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, for example, where the two peptides differ only by conservative substitutions. 
Another indication that two nucleic acid sequences are substantially identical is that the two 
molecules or their complements hybridize to each other under stringent conditions, as 

30 described below. Yet another indication that two nucleic acid sequences are substantially 
identical is that the same primers can be used to amplify the sequences, 

A "host ceir' is a naturally occurring cell or a transformed cell that contains an 
expression vector and supports the replication or expression of the expression vector. Host 
ceUs may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
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prokaryotic cells such as E. coliy or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g^., the American Type Culture 
Collection catalog or web site, www.atcc.org). 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 
5 herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers and 
non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 

10 as well as amino acid analogs and amino acid mimetics that function in a manner similar to 
the naturally occurring amino acids. Naturally occurring amino acids are those encoded by 
the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, i.e., an a carbon that is 

15 boimd to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 
norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs have modified 
R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical 
structure as a naturally occurring amino acid. Amino acid numetics refers to chemical 
compounds that have a stmcture that is different fix>m the general chemical structure of an 

20 amino acid, but that functions in a manner similar to a naturally occurring amino acid. 

Amino acids may be referred to herein by either their commonly known three 
letter symbols or by the one-letter symbols recommended by the lUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

25 "Conservatively modified variants" applies to both amino acid and nucleic 

acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical airdno 
acid sequences, or where the nucleic acid does not encode an anmio acid sequence, to 
essentially identical sequences. Because of the degeneracy of the genetic code, a large 

30 number of functionally identical nucleic acids encode any given protein. For instance, the 
codons GC A, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every 
position where an alanine is specified by a codon, the codon can be altered to any of the 
corresponding codons described without altering the encoded polypeptide. Such nucleic acid 
variations are "silent variations," which are one species of conservatively modified 
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variations. Every nucleic acid sequence herein which encodes a polypeptide also describes 
every possible silent variation of the nucleic acid. One of skill will recognize that each codon 
in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, 
which is ordinarily the only codon for tryptophan) can be modified to yield a functionally 
5 identical molecule. Accordingly, each silent variation of a nucleic acid which encodes a 

polypeptide is implicit in each described sequence with respect to the expression product, but 
not with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 

10 sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 
Conservative substitution tables providing fimctionally similar amino acids are well known in 
the art. Such conservatively modified variants are in addition to and do not exclude 

15 polymorphic variants, interspecies homologs, and alleles of the invention. 

The following eight groups each contain amino acids that are conservative 
substitutions for one another:!) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid 
(E); 3) Asparagme (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), 
Leucine (L), Methionine (M), VaUne (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan 

20 (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see, e.g., Creighton, 
Proteins (1984)). 

Macromolecular structures such as polypeptide stmctures can be described in 
terms of various levels of organization. For a general discussion of this organization, see, 
e.g., Alberts et aL, Molecular Biology of the Cell (3"^ ed., 1994) and Cantor and Schimmel, 

25 Biophysical Chemistry Part I: Ute Conformation of Biological Macromolecules (1980). 
"Primary structure" refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, three dimensional stractures within a polypeptide. These 
structures are conamonly known as domains. Domains are portions of a polypeptide that 
form a compact unit of the polypeptide and are typically 25 to approximately 500 amino 

30 acids long. Typical domains are made up of sections of lesser organization such as stretches 
of p-sheet and a-helices. 'Tertiary structure" refers to the complete three dimensional 
structure of a polypeptide monomer. "Quaternary structure" refers to the three dimensional 
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Structure formed, usually by the noncovalent association of independent tertiary units. 
Anisotropic terms are also known as energy tenns. 

A "label" or a "detectable moiety" is a composition detectable by 
spectroscopic, photochemical, biochemical, inmiunochemical, chemical, or other physical 
5 means. For example, useful labels include ^^P, fluorescent dyes, elec6ron-dense reagents, 
enzymes (e-g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 
which can be made detectable, e.g., by incorporating a radiolabel into the peptide or used to 
detect antibodies specifically reactive with the peptide. 

An "effector" or "effector moiety" or "effector component" is a molecule that 

10 is bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 
The "effector" can be a variety of molecules including, for example, detection moieties 
including radioactive compounds, fluroescent compounds, an enzyme or substrate, tags such 
as epitope tags, a toxin; a chemotherapeutic agent; a lipase; an antibiotic; or a radioisotope 

15 emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 
covalently, flirough a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 

20 using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is defined as a 
nucleic acid capable of binding to a target nucleic acid of complementary sequence through 
one or more types of chemical bonds, usually through complementary base pairing, usually 

25 through hydrogen bond formation. As used herein, a probe may include natural (i.e.. A, G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 
may be joined by a linkage other than a phosphodiester bond, so long as it does not interfere 
with hybridization. Thus, for example, probes may be peptide nucleic acids in which the 
constituent bases are joined by peptide bonds rather than phosphodiester linJcages. It will be 

30 understood by one of skill in the art that probes may bind target sequences lacking complete 
complementarity with the probe sequence depending iq)on the stringency of the hybridization 
conditions. The probes are preferably directly labeled as with isotopes, chromophores, 
lumiphores, chromogens, or indirectly labeled such as with biotin to which a streptavidin 
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complex may later bind. By assaying for the presence or absence of the probe, one can detect 
the presence or absence of the select sequence or subsequence. 

The term "recombinant" when used with reference, e,g.^ to a cell, or nucleic 
acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been 
5 modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic acid or protein, or that ttie cell is derived from a cell so modified. Thus, for 
example, recombinant cells express genes that are not foimd within the native (non- 
recombinant) form of the cell or express native genes that are otherwise abnormally 
expressed, under expressed or not expressed at all. 

10 The term "heterologous" when used with reference to portions of a nucleic 

acid indicates that the nucleic acid comprises two or more subsequences that are not found in 
the same relationship to each other in nature. For instance, the nucleic acid is typically 
recombinantly produced, having two or more sequences firom unrelated genes arranged to 
make a new functional nucleic acid, a promoter firom one source and a coding region 

15 firom another soiirce. Similarly, a heterologous protein indicates that the protein comprises 
two or more subsequences that are not found in the same relationship to each other in nature 
{e.g., a fiision protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic 

20 acid sequences near the start site of transcription, such as, in the case of a polymerase n type 
promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs firom the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 

25 active under environmental or developmental regulation. The term "operably linked" refers 
to a fimctional linkage between a nucleic acid expression control sequence (such as a 
promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

30 An "expression vector" is a nucleic acid construct, generated recombinantly or 

s)mthetically, with a series of specified nucleic acid elements tiiat permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid firagment. Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter, 
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The phrase "selectively (or specifically) hybridizes to*' refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 
5 The phrase "stringent hybridization conditions" refers to conditions under 

which a probe will hybridize to its target subsequence, typically in a complex mixture of 
nucleic acids, but to no other sequences. Stringent conditions are sequence-dependent and 
will be different in different circumstances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 

10 Tijssen, Techniques in Biochemistry and Molecular Biology— Hybridization with Nucleic 
Probes, "Overview of principles of hybridization and the strategy of nucleic acid assays" 
(1993). Generally, stringent conditions are selected to be about 5-1 0°C lower than the 
thermal melting point (Tm) for the specific sequence at a defined ionic strength pH. The Tm is 
the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% 

15 of the probes complementary to the target hybridize to the target sequence at equilibrium (as 
the target sequences are present in excess, at Tm, 50% of the probes are occupied at 
equilibrium). Stringent conditions will be those in which the salt concentration is less than 
about 1.0 M sodium iori, typically about 0.01 to 1.0 M sodium ion concentration (or other 
salts) at pH 7.0 to 8.3 and the temperature is at least about 30^C for short probes (e.g., 10 to 

20 50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing agents such as 
formamide. For selective or specific hybridization, a positive signal is at least two times 
background, preferably 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, 

25 incubating at 42'*C, or, 5x SSC, 1% SDS, incubating at 65°C, witii wash in 0.2x SSC, and 
0.1% SDS at 65°C. For PGR, a temperature of about Se^'C is typical for low stringency 
amplification, although annealing temperatures may vary between about 32**C and 48®C 
depending on primer length. For high stringency PGR amplification, a temperature of about 
62**C is typical, although high stringency annealing temperatures can range from about 50°C 

30 to about 65**C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90'^C - 95°C for 
30 sec - 2 min., an annealing phase lasting 30 sec. - 2 min., and an extension phase of about 
72^C for 1 - 2 min. Protocols and guidelines for low and high stringency amplification 
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reactions are provided, e.g., in Innis et al (1990) PCR Protocols, A Guide to Methods and 
Applications, Academic Press, Inc. N.Y.). 

Nucleic acids that do not hybridize to each other under stringent conditions are 
still substantially identical if the polypeptides which they encode are substantially identical. 
5 This occurs, for example, when a copy of a nucleic acid is created using the maximum codon 
degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 
under moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
1% SDS at 3TC, and a wash in IX SSC at 45''C. A positive hybridization is at least twice 

10 background. Those of ordinary skill will readily recognize that alternative hybridization and 
wash conditions can be utilized to provide conditions of similar stringency. Additional 
guidelines for determining hybridization parameters are provided in numerous reference, e.g., 
and Current Protocols in Molecular Biology, ed. Ausubel, et al 

The phrase "functional effects" in the context of assays for testing compounds 

1 5 that modulate activity of an angiogenesis protein includes the determination of a parameter 
that is indirectly or directly under the influence of the angiogenesis protein, e.g., a functional, 
physical, or chemical effect, such as the ability to increase or decrease angiogenesis. It 
includes binding activity, the ability of cells to proliferate, expression in cells undergoing 
angiogenesis, and other characteristics of angiogenic cells. "Functional effects" include ui 

20 vitro, in vivo, and ex vivo activities. 

By "determiaing the fimctional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of an 
angiogenesis protein sequence, e.g^., functional, physical and chemical effects. Such 
functional effects can be measured by any means known to those skilled in the art, e.g., 

25 changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive index), 
hydrodynamic {e.g., shape), chromatographic^ or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the angiogenesis protein; 
measuring binding activity or binding assays, e.g. binding to antibodies, and measuring 
cellular proliferation, particularly endothehal cell proliferation, cell viability, cell division 

30 especially of endothelial cells, lumen formation and capillary or vessel growth or formation. 
Determination of the functional effect of a compound on angiogenesis can also be performed 
using angiogenesis assays known to those of skill in the art such as an in vitro assays, e.g., in 
vitro endothelial cell tube formation assays, and other assays such as the chick CAM assay, 
the mouse corneal assay, and assays that assess vascularization of an implanted tumor. The 
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functional effects can be evaluated by many means known to those skilled in the art, e.g., 
microscopy for quantitative or qualitative measures of alterations in morphological features, 
e.g.y tube or blood vessel fomiation, measurement of changes in RNA or protein levels for 
angiogenesis-associated sequences, measurement of RNA stability, identification of 
5 downstream or reporter gene expression (CAT, luciferase, P-gal, GFP and the like), e.g.^ via 
chemiluminescence, fluorescence, colorimetric reactions, antibody binding, inducible 
markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of angiogenic polynucleotide and 
polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules 

10 identified using m vitro and in vivo assays of angiogenic polynucleotide and polypeptide 
sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block activity, 
decrease, prevent, delay activation, inactivate, desensitize, or down regulate the activity or 
expression of angiogenesis proteins, e.g., antagonists. "Activators" are compounds that 
increase, open, activate, facilitate, enhance activation, sensitize, agonize, or up regulate 

15 angiogenesis protein activity. Inhibitors, activators, or modulators also include genetically 
modified versions of angiogenesis proteins, e.g,, versions Avith altered activity, as well as 
naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small chemical 
molecules and the like. Such assays for inhibitors and activators include, e.g., expressing the 
angiogenic protein in vitro, in cells, or cell membranes, applying putative modulator 

20 compounds, and then determining the functional effects on activity, as described above. 
Activators and inhibitors of angiogenesis can also be identified by incubating angiogenic 
cells with the test compound and deteraiining increases or decreases in the expression of 1 or 
more angiogenesis proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 or more angiogenesis 
proteins, such as angiogenesis proteins comprising the sequences set out in Table 8. 

25 Samples or assays comprising angiogenesis proteins that are treated with a 

potential activator, inhibitor, or modulator are compared to control samples without the 
inhibitor, activator, or modulator to examine fhe extent of inhibition. Control samples 
(untreated Avith inhibitors) are assigned a relative protein activity value of 100%. Inhibition 
of a polypeptide is achieved when the activity value relative to the control is about 80%, 

30 preferably 50%, more preferably 25-0%. Activation of an angiogenesis polypeptide is 

achieved when the activity value relative to the control (untreated with activators) is 1 10%, 
more preferably 150%, more preferably 200-500% (i.e., two to five fold higher relative to the 
control), more preferably 1000-3000% higher. 
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"Antibody" refers to a polypeptide comprising a jframework region from an 
immunoglobulin gene or fragments thereof that specijQcally binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes^ as well as the myriad immunoglobulin variable region 
5 genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
garoma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody 
will be most critical in specificity and affinity of binding. 

An exemplary inmiunoglobulin (antibody) structural unit comprises a 

10 tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "lighf (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus 
of each chain defines a variable region of about 100 to 1 10 or more amino acids primarily 
responsible for antigen recognition. The temis variable light chain (Vl) and variable heavy 
chain (Vh) refer to these Ught and heavy chains respectively. 

15 Antibodies exist, as intact immunoglobulins or as a mmoiber of well- 

characterized fragments produced by digestion with various peptidases. Thus, for example, 
pepsin digests an antibody below the disulfide linkages in the hinge region to produce F(ab)'2. 
a dimer of Fab which itself is a light chain joined to Vh-Ch1 by a disulfide bond. The F(ab)'2 
may be reduced under mild conditions to break the disulfide linkage in the hinge region, 

20 thereby converting the F(ab)'2 dimer into an Fab' monomer. The Fab' monomer is 

essentially Fab with part of the hinge region (see Fundamental Immunology (Paul ed., 3d ed. 
1993). While various antibody fragments are defined in terms of the digestion of an intact 
antibody, one of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by using recombinant DNA methodology. Thus, the term antibody, as used 

25 herein, also includes antibody fragments either produced by the modification of whole 

antibodies, or those synthesized de novo using recombinant DNA methodologies (e.g^., single 
chain Fv) or those identified using phage display libraries (see^ e.g., McCafferty et aL, Nature 
348:552-554 (1990)) 

For pireparation of antibodies, e.g:, recombinant, monoclonal, or polyclonal 

30 antibodies, many technique known in the art can be used (see, e,g,, Kohler & Milstein, 

Nature 256:495-497 (1975); Kozbor et aL, Immunology Today 4: 72 (1983); Cole et al, pp. 
11 '96 m Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. (1985); Coligan, 
Current Protocols in Immunology (1991); Harlow & Lane, Antibodies, A Laboratory Manual 
(1988); and Go6m%, Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). 
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Techniques for the production of single chain antibodies (U.S. Patent 4,946,778) can be 
adapted to produce antibodies to polypeptides of this invention. Also, transgenic mice, or 
other organisms such as other mammals, may be used to express humanized antibodies. 
Alternatively, phage display technology can be used to identify antibodies and heteromeric 
5 Fab fragments that specifically bind to selected antigens {see^ e.g^., McCafFerty et a?.. Nature 
348:552-554 (1990); Marks et al. Biotechnology 10:779-783 (1992)). 

A "chimeric antibody" is an antibody molecule in which (a) the constant 
region, or a portion thereof, is altered, replaced or exchanged so that the antigen binding site 
(variable region) is linked to a constant region of a different or altered class, effector function 
10 and/or species, or an entirely different molecule which confCTs new properties to the chimeric 
antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 
region, or a portion thereof, is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 

The detailed description of the invention includes discussion of the following 
1 5 aspects of the invention: Expression of angiogenesis-associated sequences 

Informatics 

Angiogenesis-associated sequences 

Detection of angiogenesis sequence for diagnostic and 
therapeutic applications 

20 Modulators of angiogenesis 

Methods of identifying variant angiogenesis-associated 
sequences 

Administration of pharmaceutical and vaccine compositions 
Kits for use in diagnostic and/or prognostic applications. 

25 Expression of angiogenesis-associated sequences 

In one aspect, the expression levels of genes are determined in different 
patient samples for which diagnosis information is desired, to provide expression profiles. 
An expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have any particular gene similarly expressed, the evaluation of 

30 a number of genes simultaneously allows the generation of a gene expression profile that is 
unique to the state of the cell. That is, normal tissue may be distinguished from AD tissue. 
By comparing expression profiles of tissue in known different angiogenesis states, 
information regarding which genes are important (including both up- and down-regulation of 
genes) in each of these states is obtained. The identification of sequences that are 
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differentially expressed in angiogenic versus non-angiogenic tissue allows the use of this 
infonnation in a number of ways. For example, a particular treatment regime may be 
evaluated: does a chemotherapeutic drug act to down-regulate angiogenesis, and thus tumor 
growth or recurrence, in a particular patient. Similarly, diagnosis and treatment outcomes 
5 may be done or confirmed by comparing patient samples with the known expression profiles. 
Angiogenic tissue can also be analyzed to determine the stage of angiogenesis in the tissue. 
Furthermore, these gene expression profiles (or individual genes) allow screening of drug 
candidates with an eye to mimicking or altering a particular expression profile; for example, 
screening can be done for drugs that suppress the angiogenic expression profile. This may be 

10 done by making biochips comprising sets of the important angiogenesis genes, which can 
then be used in these screens. These methods can also be done on the protein basis; that is, 
protein expression levels of tlie angiogenic proteins can be evaluated for diagnostic purposes 
or to screen candidate agents. In addition, the angiogenic nucleic acid sequences can be 
administered for gene therapy purposes, including the administration of antisense nucleic 

1 5 acids, or the angiogenic proteins (including antibodies and other modulators thereof) 
administered as therapeutic drugs. 

Thus the present invention provides nucleic acid and protein sequences that 
are differentially expressed in angiogenesis, herein termed "angiogenesis sequences". As 
outlined below, angiogenesis sequences include those that are up-regulated (i.e. expressed at 

20 a higher level) in disorders associated with angiogenesis, as well as those that are down- 
regulated (i.e. expressed at a lower level). In a preferred embodiment, the angiogenesis 
sequences are from humans; however, as will be appreciated by those in the art, angiogenesis 
sequences from other organisms may be useful in animal models of disease and drug 
evaluation; thus, other angiogenesis sequences are provided, from vertebrates, including 

25 mammals, including rodents (rats, mice, hamsters, guinea pigs, etc.), primates, farm animals 
(including sheep, goats, pigs, cows, horses, etc). Angiogenesis sequences from other 
organisms may be obtained using the techniques outlined below. 

Angiogenesis sequences can include both nucleic acid and amino acid 
sequences. In a preferred embodiment, the angiogenesis sequences are recombinant nucleic 

30 acids. By the term "recombinant nucleic acid" herein is meant nucleic acid, origmally formed 
in vitroy in general, by the manipulation of nucleic acid e.g., using polymerases and 
endonucleases, in a form not normally foimd in nature. Thus an isolated nucleic acid, in a 
linear form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
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understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e. using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
recombinantly, although subsequently replicated non-recombinantly, are still considered 
5 recombinant for the piuposes of the invention. 

Similarly, a "recombinant protein'* is a protein made using recombinant 
techniques, i.e. through the expression of a recombinant nucleic acid as depicted above. A 
recombinant protein is distinguished from naturally occurring protein by at least one or more 
characteristics. For example, the protein may be isolated or purified away from some or all 

10 of the proteins and compoimds with which it is normally associated in its wild type host, and 
thus may be substantially pure. For example, an isolated protein is unaccompanied by at least 
some of the material with which it is normally associated in its natural state, preferably 
constituting at least about 0.5%, more preferably at least about 5% by weight of the total 
protein in a given sample. A substantially pure protein comprises at least about 75% by 

15 weight of the total protein, with at least about 80% being preferred, and at least about 90% 
being particularly preferred. The definition includes the production of an angiogenesis protein 
from one organism in a different organism or host cell. Altematively, the protein may be 
made at a significantly higher concentration than is normally seen, through the use of an 
inducible promoter or high expression promoter, such that the protein is made at increased 

20 concentration levels. Altematively, the protein may be in a form not normally found in 
nature, as in the addition of an epitope tag or amino acid substitutions, insertions and 
deletions, as discussed below. 

In a preferred embodiment, the angiogenesis sequences are nucleic acids. As 
will be appreciated by those in the art and is more fully outlined below, angiogenesis 

25 sequences are useful in a variety of applications, including diagnostic applications, which will 
detect naturally occurring nucleic acids, as well as screening applications; for example, 
biochips comprising nucleic acid probes to the angiogenesis sequences can be generated. In 
the broadest sense, then, by '"nucleic acid'' or "oligonucleotide" or grammatical equivalents 
herein means at least two nucleotides covalently linked together. A nucleic acid of the 

30 present invention will generally contain phosphodiester bonds, altiiougji in some cases, 
nucleic acid analogs are included that may have altemate backbones, comprising, for 
example, phosphoramidate, phosphorothioate, phosphorodithioate, or O- 
methylphophoroamidite linkages (see Eckstein, OHgonucleotides and Analogues: A Practical 
Approach, Oxford University Press); and peptide nucleic acid backbones and linkages. Other 
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analog nucleic acids include those with positive backbones; non-ionic backbones, and non- 
ribose backbones, including those described in U.S. Patent Nos. 5,235,033 and 5,034,506, 
and Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate Modifications in 
Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids containing one or 
5 more carbocycUc sugars are also included within one definition of nucleic acids. 

Modifications of the ribdse-phosphate backbone may be done for a variety of reasons, for 
example to increase the stability and half-life of such molecules in physiological 
environments or as probes on a biochip. 

As will be appreciated by those in the art, nucleic acid analogs may find use in 
10 the present invention. In addition, mixtures of naturally occurring nucleic acids and analogs 
can be made; altematively, mixtures of different nucleic acid analogs, and mixtures of 
naturally occurring nucleic acids and analogs may be made. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
1 5 contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 
kinetics. PNAs have larger changes in the melting temperatuire (Tm) for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4''C drop in T^ for an 
intemal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9*'C. Similarly, 
20 due to their non-ionic nature, hybridization of the bases attached to these backbones is 

relatively insensitive to salt concentration. In addition, PNAs are not degraded by cellular 
enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or 
contain portions of both double stranded or single stranded sequence. As will be appreciated 
25 by those in the art, the depiction of a single strand also defines the sequence of the 

complementary strand; thus the sequences described herein also provide the complement of 
the sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, 
where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and 
combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 
30 xanthine hypoxanthine, isocytosine, isoguanine, etc. As used herein, the term "nucleoside'* 
includes nucleotides and nucleoside and nucleotide analogs, and modified nucleosides such 
as amino modified nucleosides. In addition, "nucleoside** includes non-naturally occurring 
analog structures. Thus for example the individual xmits of a peptide nucleic acid, each 
containing a base, are referred to herein as a nucleoside. 
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An angiogenesis sequence can be initially identified by substantial nucleic 
acid and/or amino acid sequence homology to the angiogenesis sequences outlined herein. 
Such homology can be based upon the overall nucleic acid or amino acid sequence, and is 
generally determined as outlined below, using either homology programs or hybridization 
5 conditions. 

For identifying angiogenesis-associated sequences, the angiogenesis screen 
typically includes comparing genes identified in a modification of an in vitro model of 
angiogenesis as described in Hiraoka, Cell 95:365 (1998) with genes identified in controls. 
Samples of normal tissue and tissue undergoing angiogenesis are applied to biochips 

10 comprising nucleic acid probes. The samples are first microdissected, if applicable, and 
treated as is known in the art for the preparation of mRNA. Suitable biochips are 
commercially available, for example firom Affymetrix. Gene expression profiles as described 
herein are generated and the data analyzed. 

In a preferred embodiment, the genes showing changes in expression as 

15 between normal and disease states are compared to genes expressed in other normal tissues, 
including, but not limited to lung, heart, brain, liver, breast, kidney, muscle, prostate, small 
intestine, large intestine, spleen, bone and placenta. In a preferred embodiment, those genes 
identified during the angiogenesis screen that are expressed in any significant amount ia other 
tissues are removed firom the profile, although in some embodiments, this is not necessary. 

20 That is, when screening for dmgs, it is usually preferable that the target be disease specific, to 
nodnimize possible side effects. 

In a preferred embodiment, angiogenesis sequences are those that are up- 
regulated in angiogenesis disorders; that is, the expression of these genes is higher in the 
disease tissue as compared to normal tissue. *Trp-regulation" as used herein means at least 

25 about a two-fold change, preferably at least about a three fold change, with at least about 
five-fold or higher being preferred. All accession numbers herein are for the GenBank 
sequence database and the sequences of the accession numbers are hereby expressly 
incorporated by reference. GenBank is known in the art, see, e.g., Benson, DA, et al.. 
Nucleic Acids Research 26:1-7 (1998) and http://www.ncbi.nlm.nih.gov/. Sequences are also 

30 avialable in other databases, European Molecular Biology Laboratory (EMBL) and 
DNA Database of Japan (DDBJ). In addition, most preferred genes were found to be 
expressed in a limited amoxmt or not at all in heart, brain, lung, liver, breast, kidney, prostate, 
small intestine and spleen. 
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In anotlier preferred embodiment, angiogenesis sequences are those that are 
down-regulated in the angiogenesis disorder; that is, the expression of these genes is lower in 
angiogenic tissue as compared to normal tissue. 'TDown-regulation" as used herein means at 
least about a two-fold change, preferably at least about a three fold change, with at least about 
5 five-fold or higiher being preferred. 

Angiogenesis sequences according to the invention may be classified into 
discrete clusters of sequences based on conmion e:^ression profiles of the sequences. 
Expression levels of angiogenesis sequences may increase or decrease as a function of time in 
a manner that correlates with the induction of angiogenesis. Alternatively, expression levels 

10 of angiogenesis sequences may both increase and decrease as a function of time. For 
example, expression levels of some angiogenesis sequences are temporarily induced or 
diminished during the switch to the angiogenesis phenotype, followed by a return to baseline 
expression levels. Tables 1-8 provides genes, the mRNA expression of which varies as a 
function of time in angiogenesis tissue when compared to normal tissue. 

15 In a particularly preferred embodiment, angiogenesis sequences are those that 

are induced for a period of time, typically by positive angiogenic factors, followed by a return 
to the baseline levels. Sequences that are temporarily induced provide a means to target 
angiogenesis tissue, for example neovascularized tumors, at a particular stage of 
angiogenesis, while avoiding rapidly growing tissue that require perpetual vascularization. 

20 Such positive angiogenic factors include aFGF, PFGF, VEGF, angiogenin and the like. 

Induced angiogenesis sequences also are further categorized with respect to 
the timing of induction. For example, some angiogenesis genes may be induced at an early 
time period, such as within 10 minutes of the induction of angiogenesis. Others may be 
induced later, such as between 5 and 60 minutes, while yet others may be induced for a time 

25 period of about two hours or more followed by a return to baseline expression levels. 

In another preferred embodiment are angiogenesis sequences that are inhibited 
or reduced as a function of time followed by a return to "normal" expression levels. 
Inhibitors of angiogenesis are examples of molecules that have this expression profile. These 
sequences also can be further divided into groups depending on the timing of diminished 

30 expression. For example, some molecules may display reduced expression within 10 minutes 
of the induction of angiogenesis. Others may be diminished later, such as between 5 and 60 
minutes, while others may be diminished for a time period of about two hoxurs or more 
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followed by a return to baseline. Examples of such negative angiogenic factors include 
thrombospondin and endostatin to name a few. 

In yet another preferred embodiment are angiogenesis sequences that are 
induced for prolonged periods. These sequences are typically associated with induction of 
5 angiogenesis and may participate in induction and/or maintenance of the angiogenesis 
phenotype. 

In another preferred embodiment are angiogenesis sequences, the expression 
of which is reduced or diminished for prolonged periods in angiogenic tissue. These 
sequences are typically angiogenesis inhibitors and their diminution is correlated with an 
1 0 increase in angiogenesis. 

Informatics 

The abihty to identify genes that undergo changes in expression with time 
during angiogenesis can additionally provide high-resolution, high-sensitivity datasets which 

1 5 can be used in the areas of diagnostics, therapeutics, drug development, biosensor 

development, and other related areas. For example, the expression profiles can be used in 
diagnostic or prognostic evaluation of patients with angiogenesis-associated disease. Or as 
another example, subcellular toxicological information can be generated to better direct drag 
structure and activity correlation {see^ Anderson, L., ''Pharmaceutical Proteomics: Targets, 

20 Mechanism, and Function," paper presented at the IBC Proteomics conference, Coronado, 
CA (Jxme 11-12, 1998)). Subcellular toxicological information can also be utilized in a 
biological sensor device to predict the likely toxicological effect of chemical exposures and 
likely tolerable exposure thresholds {see, U.S. Patent No. 5,81 1,231). Similar advantages 
accrue from datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, 

25 saccharides, lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that 
includes at least one set of data assay data. The data contained in the database is acquired , 
e.g. 9 using array analysis either singly or in a library format. The database can be in 
substantially any form in which data can be maintained and transmitted, but is preferably an 

30 electronic database. The electronic database of the invention can be maintained on any 
electronic device allowing for the storage of and access to the database, such as a personal 
computer, but is preferably distributed on a wide area.network, such as the World Wide Web. 
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The focus of the present section on databases that include peptide sequence 
data is for clarity of illustration only. It will be apparent to those of skill in the art that similar 
databases can be assembled for any assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative 
5 and/or absolute abimdance of a variety of molecular and macromolecular species from a 

biological sample undergoing angiogenesis, Le., the identification of angiogenesis-associated 
sequences described herein, provide an abundance of information, which can be correlated 
with pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, 
gene-disease causal linkages, identification of correlates of immunity and physiological 

10 status, among others. Although the data generated from the assays of the invention is suited 
for manual review and analysis, in a preferred embodiment, prior data processing using high- 
speed computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is 
known in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational 

1 5 database system for storing biomolecular sequence information in a manner that allows 
sequences to be catalogued and searched according to one or more protein fimction 
hierarchies. U.S. Patent 5,953,727 discloses a relational database having sequence records 
containing information in a format that allows a collection of partial-length DNA sequences 
to be catalogued and searched according to association with one or more sequencing projects 

20 for obtaining fiiU-length sequences firom the collection of partial length sequences. U.S. 

Patent 5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 
between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 

25 in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness*of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a fimctionality for multi-dimensional data analysis 
described as on-line analytical processing (OLAP), which entails the consolidation of 
projected and actual data according to more than one consolidation path or dimension. U.S. 

30 Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 
the merger of two or more such tree structures. 
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The present invention provides a computer database comprising a computer 
and software for storing in computer-retrievable form assay data records cross-tabulated, e.g.^ 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 
5 In an exemplary embodiment, at least one of the soxirces of target-containing 

sample is from a control tissue sample known to be free of pathological disorders. In a 
variation, at least one of the sources is a known pathological tissue specimen, e.g., a 
neoplastic lesion or another tissue specimen to be analyzed for angiogenesis. In another 
variation, the assay records cross-tabulate one or more of the following parameters for each 

10 target species in a sample: (1) a imique identification code, which can include, e.g., a target 
molecular structure and/or characteristic separation coordinate (e.g., electrophoretic 
coordinates); (2) sample source; and (3) absolute and/or relative quantity of the target species 
present in the sample. 

The invention also provides for the storage and retrieval of a collection of 

1 S target data in a computer data storage apparatus, which can include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, and other data storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable mediimi or as an array of charge states or 

20 transistor gate states, such as an array of cells in a DRAM device each cell comprised of 
a transistor and a charge storage area, which may be on the transistor). In one embodiment, 
the invention provides such storage devices, and computer systems built therewith, 
comprising a bit pattern encoding a protein expression fingerprint record comprising unique 
identifiers for at least 10 target data records cross-tabulated with target source. 

25 When the target is a peptide or nucleic acid, the invention preferably provides 

a method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 

30 embodiment thereof FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
be of the relative amoimt of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM- 
compatible (DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format 



BNSDOCID: <WO 0207©492A2_L> 



wo 02/079492 PCT/US02/0491 5 

/ 

(e.g., Linux, SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, eta} floppy diskette or 
hard (fixed, Winchester) disk drive, comprising a bit pattern encoding data fi-om an assay of 
the invention in a file format suitable for retrieval and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 
5 The invention also provides a network, comprising a plurality of computing 

devices linked via a data link, such as an Ethemet cable (coax or lOBaseT), telephone line, 
ISDN line, wireless network, optical fiber, or other suitable signal tranmission medium, 
whereby at least one network device (e.g., computer, disk array, etc.) comprises a pattem of 
magnetic domains {e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 

10 cells) composing a bit pattem encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattem encoding data fi-om an assay or a 

1 5 database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
degree of identity and gap weight to the target data. A central processor is preferably 

20 initialized to load and execute the computer program for ahgnment and/or comparison of the 
assay results. Data for a query target is entered into the central processor via an I/O device. 
Execution of the computer program results in the central processor retrieving the assay data 
from the data file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to 

25 secondary memory, which is typically random access memory (e.g., DRAM, SRAM, 
SGRAM, or SDRAM). Targets are ranked according to the degree of correspondence 
between a selected assay characteristic (e.g., binding to a selected affinity moiety) and the 
same characteristic of the query target and results are output via an I/O device. For example, 
a central processor can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 

30 PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc.); an I/O device can be a terminal comprising a video display and a keyboard, a modem, 
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an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or 
other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as 
that described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
5 collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

10 Angiogenesis-associated sequences 

Angiogenesis proteins of the present invention may be classified as secreted 
proteins, transmembrane proteins or intracellular proteins. In one embodiment,the 
angiogenesis protein is an intracellular protein. Intracellular proteins may be foimd in the 
cj^oplasm and/or in the nucleus or associated with the intracellular side of the plasma 

1 5 membrane. Intracellular proteins are involved in all aspects of cellular function and 

replication (including, e.^., signaling pathways); aberrant expression of such proteins often 
results in unregulated or disregulated cellular processes (see, e.g,. Molecular Biology of the 
Cell, 3rd Edition, Alberts, Ed., Garland Pub., 1994). For example, many intracellular 
proteins have enzymatic activity such as protein kinase activity, protein phosphatase activity, 

20 protease activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular 
proteins also serve as docking proteins that are involved in organizing complexes of proteins, 
or targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence 

25 in the proteins of one or more motifs for which defined functions have been attributed. In 

addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct firom SH2 

30 domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 

targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 
few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of primary 
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sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 

In another embodiment, the angiogenesis sequences are transmembrane 
proteins. Transmembrane proteins are molecules that span a phosphohpid bilayer of a cell. 
5 They may have an intracellular domain, an extracellular domain, or both. The intracellular 
domains of such proteins may have a number of functions including those already described 
for intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 
domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 

10 kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane 
domains. For example, receptor tyrosine kinases, certain cytokine receptors, receptor 

1 5 guanylyl cyclases and receptor serine/threonine protein kinases contain a single 

transmembrane domain. However, various other proteins including channels and adenylyl 
cyclases contain numerous transmembrane domains. Many important cell surface receptors 
such as G protein coupled receptors (GPCRs) are classified as "seven transmembrane 
domain" proteins, as they contain 7 membrane spanning regions. Characteristics of 

20 transmembrane domains include approximately 20 consecutive hydrophobic amino acids that 
may be followed or flanked by charged amino acids. Therefore, upon analysis of the amino 
acid sequence of a particular protein, the localization and number of transmembrane domains 
within the protein may be predicted (see, e.g, PSORT web site http://psort.iiibb.ac.jp/). 

The extracellular domains of transmembrane proteias are diverse; however, 

25 conserved motifs are found repeatedly among various extracellular domains. Conserved 
structure and/or functions have been ascribed to different extracellular motifs. Many 
extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are found on receptors. Factors that bind the receptor domain include circulating 
ligands, which may be peptides, proteins, or small molecules such as adenosine and the like. 

30 For example, growth factors such as EOF, FGF and PDGF are circulating growth factors that 
bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 
cytokines, mitogenic factors, neurotrophic factors and the like. Extracellular domains also 
bind to cell-associated molecules. In this respect, they mediate cell-cell interactions. Cell- 
associated ligands can be tethered to the cell for example via a glycosylphosphatidylinositol 
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(GPI) anchor, or may themselves be transmembrane proteins. Extracellular domains also 
associate with the extracellular matrix and contribute to the maintenance of the cell structure. 

Angiogenesis proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for inmiunotherapeutics, as are 
5 described herein. In addition, as outlined below, transmembrane proteins can be also useful 
in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically peimeablized to provide acess to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can 

10 be made soluble by removing transmembrane sequences, for example through recombinant 
methods. Furthermore, transmembrane proteins that have been made soluble can be made to 
be secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the angiogenesis proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated. These proteins have a signal 

1 5 peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 

proteins are involved in numerous physiological events; by virtue of their circulating nature, 
they serve to transmit signals to various other cell types. The secreted protein may function in 
an autocrine manner (acting on the cell ttiat secreted the factor), a paracrine manner (acting 
on cells in close proximity to the cell that secreted the factor) or an endocrine maimer (acting 

20 on cells at a distance). Thus secreted molecules find use in modulating or altering numerous 
aspects of physiology. Angiogenesis proteins that are secreted proteins are particularly 
preferred in the present invention as they serve as good targets for diagnostic markers, e.g.^ 
for blood or serum tests. 

An angiogenesis sequence is typically initially identified by substantial nucleic 

25 acid and/or amino acid sequence homology or linkage to the angiogenesis sequences outlined 
herein. Such homology can be based upon the overall nucleic acid or amino acid sequence, 
and is generally determined as outlined below, using either homology programs or 
hybridization conditions. Typically, linked sequences on a mRNA are found on the same 
molecule. 

30 As detailed in flie definitions, percent identity can be determined using an 

algorithm such as BLAST. A preferred method utilizes the BLASTN module of WU- 
BLAST-2 set to the default parameters, with overlap span and overlap firaction set to 1 and 
0.125, respectively. The aligmnent may include the introduction of gaps in the sequences to 
be aligned. In addition, for sequences which contain either more or fewer nucleotides than 
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those of the nucleic acids of the figures, it is understood that the percentage of homology will 
be determined based on the number of homologous nucleosides in relation to the total number 
of nucleosides. Thus, for example, homology of sequences shorter than those of the 
sequences identified herein and as discussed below, will be determined using the number of 
5 nucleosides in the shorter sequence. 

In one embodiment, the nucleic acid homology is determined through 
hybridization studies. Thus, e.g., nucleic acids which hybridize under high stringency to a 
nucleic acid of Tables 1-8 , or its complement, or is also found on naturally occurring 
mRNAs is considered an angiogenesis sequence. In another embodiment, less stringent 

10 hybridization conditions are used; for example, moderate or low stringency conditions may 
be used, as are known in the art; see Ausubel, supra, and Tijssen, supra. 

In addition, the angiogenesis nucleic acid sequences of the invention, e.g, the 
sequence in Tables 1-8 , are firagments of larger genes, i.e. they are nucleic acid segments. 
"Genes" in this context includes coding regions, non-coding regions, and mixtures of coding 

15 and non-coding regions. Accordingly, as will be appreciated by those in the art, using the 

sequences provided herein, extended sequences, in either direction, of the angiogenesis genes 
can be obtained, using techniques well known in the art for cloning either longer sequences or 
the fiill length sequences; see Ausubel, et aL, supra. Much can be done by informatics and 
many sequences can be clustered to include multiple sequences, e.g., systems such as 

20 UniGene (see, http://www.ncbi.nlm.nih.gov/UniGene/). 

Once the angiogenesis nucleic acid is identified, it can be cloned and, if 
necessary, its constituent parts recombined to form the entire angiogenesis nucleic acid 
coding regions or the entire mRNA sequence. Once isolated firom its natural source, e.g., 
contained within a plasmid or other vector or excised therefi-om as a linear nucleic acid 

25 . segment, the recombinant angiogenesis nucleic acid can be further-used as a probe to identify 
and isolate other angiogenesis nucleic acids, for example extended coding regions. It can 
also be used as a "precursor" nucleic acid to make modified or variant angiogenesis nucleic 
acids and proteins. 

The angiogenesis nucleic acids of the present invention are used in several 
30 ways. In a first embodiment, nucleic acid probes to the angiogenesis nucleic acids are made 
and attached to biochips to be used in screening and diagnostic methods, as outlined below, 
or for administration, for example for gene therapy, vaccine, and/or antisense applications. 
Altematively, the angiogenesis nucleic acids that include coding regions of angiogenesis 
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proteins can be put into expression vectors for the expression of angiogenesis proteins, again 
for screening purposes or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to angiogenesis nucleic acids 
(both the nucleic acid sequences outlined in the figures and/or the complements thereof) are 
5 made. The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the angiogenesis nucleic acids, Le. the target sequence (either the target 
sequence of the sample or to other probe sequences, for example in sandwich assays), such 
that hybridization of the target sequence and the probes of the present invention occurs. As 
outlined below, this complementarity need not be perfect; there may be any nxunber of base 

10 pair nfiismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 

15 sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single 
and partially double stranded. The strandedness of the probe is dictated by the stracture, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 

20 firom about 8 to about 100 bases long, with firom about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 
genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
himdreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with 
25 either overlapping probes or probes to different sections of the target being used. That is, 

two, three, four or more probes, with three being preferred, are used to build in a redundancy 
for a particular target. The probes can be overlapping (/.e. have some sequence in conomon), 
or separate. In some cases, PGR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
30 immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and flie 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
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hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 
attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 
the biotinylated probe to the streptayidin. By "povalent binding" and grairunatical 
equivalents herein is meant that the two moieties, the solid support and the probe, are 
5 attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the sohd support or can be 
formed by a cross hnker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 
10 In general, the probes are attached to the biochip in a wide variety of ways, as 

will be appreciated by those in the art. As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "sohd 

15 support" or other grammatical equivalents herein is meant a material that can be modified to 
contain discrete individual sites appropriate for the attachment or association of the nucleic 
acid probes and is amenable to at least one detection method. As will be appreciated by those 
in the art, the number of possible substrates are very large, and include, but are not limited to, 
glass and modified or fimctionalized glass, plastics (including acrylics, polystyrene and 

20 copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 

polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or sihca- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 
plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fiuorescese. A preferred substrate is described in copending appUcation entitled Reusable 

25 Low Fluorescent Plastic Biochip, U.S. Application Serial No. 09/270,214, filed March 15, 
1999, herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those in 
the art, other configurations of substrates may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to minimize 

30 sample volume. Similarly, the substrate may be flexible, such as a flexible foam, including 
closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, for 
example, the biochip is derivatized with a chemical fimctional group including, but not 
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limited to, amino groups, carboxy groups, oxo groups and thiol groups, with amino groups 
being particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example,. nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, for example using linkers as are known in the 
5 art; for example, homo-or hetero-bifimctional linkers as are well known (see 1994 Pierce 
Chemical Company catalog, technical section on cross-linkers, pages 155-200, incorporated 
herein by reference). In addition, in some cases, additional linkers, such as alkyl groups 
(including substituted and heteroalkyl groups) may be used. 

In this embodiment, ohgonucleotides are synthesized as is known in the art, 

10 and then attached to the surface of the solid support. As will be appreciated by those skilled 
in the art, either the 5' or 3' terminus may be attached to the solid support, or attachment may 
be via an intemal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent. For example, biotinylated oligonucleotides can be made, which 

1 5 bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is 
known in the art. For example, photoactivation techniques utilizing photopolymerization 
compounds and techniques are used. In a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those described 

20 in WO 95/251 16; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of . 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 
level of angiogenesis-associated sequences. These assays are typically performed in 

25 conjunction with reverse transcription. In such assays, an angiogenesis-associated nucleic 
acid sequence acts as a template in an amplification reaction ie.g,, Polymerase Chain 
Reaction, or PCR), In a quantitative amplification, the amount of amplification product will 
be proportional to the amount of template in the original sample. Comparison to appropriate 
controls provides a measure of the amoimt of angiogenesis-associated RNA Methods of 

30 quantitative amplification are well known to those of skill in the art. Detailed protocols for 
quantitative PCR are provided, e.g,, in Innis et al (1990) PCR Protocols, A Guide to Methods 
and Applications y Academic Press, Inc. N.Y.). 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
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dye and a 3' quenching agent. The probe hybridizes to a PGR product, but cannot itself be 
extended due to a blocking agent at the 3* end. When the PGR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3* 
5 quraching agent, thereby resulting in an increase in fluorescence as a function of 

amplification (see, for example, literature provided by Perkin-Elmer, e.g., www2.perkin- 
elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase 
chain reaction (LCR) (see, Wu and Wallace (1989) Genomics 4: 560, Landegren et al. (1988) 
10 Science 241 : 1077, and Barringer et al, (1990) Gene 89: 117), transcription amplification 

(Kwoh et al (1989) Proc. Natl. Acad, Set USA 86: 1 173), self-sustained sequence repUcation 
(GuateUi et aL (1990) Proc Nat, Acad. ScL USA 87: 1874), dot PGR, and linker adapter PGR, 
etc. 

In a preferred embodiment, angiogenesis nucleic acids, e.g., encoding 

15 angiogenesis proteins are used to make a variety of expression vectors to express 

angiogenesis proteins which can then be used in screening assays, as described below. 
Expression vectors and recombinant DNA technology are well known to fhose of skill in the 
art (see, e,g., Ausubel, supra^ and Gene Expression Systems, Fernandez & HoeflEler, Eds, 
Academic Press, 1999) and are used to express proteins. The expression vectors may be 

20 either self-replicating extrachromosomal vectors or vectors which integrate into a host 
genome. Generally, these expression vectors include transcriptional and translational 
regulatory nucleic acid operably linked to the nucleic acid encoding the angiogenesis protein. 
The term "control sequences" refers to DNA sequences used for the expression of an 
operably linked coding sequence in a particular host organism. Control sequences that are 

25 suitable for prokaiyotes, for example, include a promoter, optionally an operator sequence, 
and a ribosome binding site. Eukaiyotic cells are known to utilize promoters, 
polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a fimctional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or 

30 secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein 
that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are contiguous, 
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and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers 
do not have to be contiguous. Linking is typically accomplished by ligation at convenient 
restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are 
used in accordance with conventional practice. Transcriptional and translational regulatory 
5 nucleic acid will generally be appropriate to the host cell used to express the angiogenesis 
protein; for example, transcriptional and translational regulatory nucleic acid sequences from 
Bacillus are preferably used to express the angiogenesis protein in Bacillus. Numerous types 
of appropriate expression vectors, and suitable regulatory sequences are known in the art for 
a variety of host cells. 

10 In general, transcriptional and translational regulatory sequences may include, 

but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and 
stop sequences, translational start and stop sequences, and enhancer or activator sequences. 
In a preferred embodiment, the regulatory sequences include a promoter and transcriptional 
start and stop sequences. 

1 5 Promoter sequences encode either constitutive or inducible promoters. The 

promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known in the art, 
and are useful in the present invention. 

In addition, an expression vector may comprise additional elements. For 

20 example, the expression vector may have two replication systems, thus allowing it to be 

maintained in two organisms, for example in maramalian or insect cells for expression and in 
a procaryotic host for cloning and amplification. Furthermore, for integrating expression 
vectors, the expression vector contains at least one sequence homologous to the host cell 
genome, and preferably two homologous sequences which flank the expression construct. 

25 The integrating vector may be directed to a specific locus in the host cell by selecting the 
appropriate homologous sequence for inclusion in the vector. Constructs for integratiug 
vectors are well known in the art (e.g., Fernandez & Hoeffler, supra). See also Kitamura, et 
al. (1995) PNAS 92:9146-9150. 

In addition, in a preferred embodiment, the expression vector contains a 

30 selectable marker gene to allow the selection of transformed host cells. Selection genes are 
well known in the art and will vary with the host cell used. 

The angiogenesis proteins of the present invention are produced by culturing a 
host cell transformed with an expression vector containing nucleic acid encoding an 
angiogenesis protein, under the appropriate conditions to induce or cause expression of the 
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angiogenesis protein. Conditions appropriate for angiogenesis protein expression will vary 
with the choice of the expression vector and the host cell, and will be easily ascertained by 
one skilled in the art through routine experimentation or optimization. For example, the use 
of constitutive promoters in the expression vector will require optimizing the growth and 
proliferation of the host cell, while the use of an inducible promoter requires the appropriate 
growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect 
and animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeasts, E, coli, Bacillus subtilis, Sf9 cells, CI 29 cells, 293 cells, 
Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial 
cells), THPl cells (a macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the angiogenesis proteins are expressed in 
mammalian cells. Mammalian expression systems are also known in the art, and include 
retroviral and adenoviral systems. Of particular use as mammalian promoters are the 
promoters from mammalian viral genes, since the viral genes are often highly expressed and 
have a broad host range. Examples include the S V40 early promoter, mouse mammary tumor 
virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, and the 
CMV promoter (see, e,g,, Femandez & Hoeffler, supra). Typically, transcription termination 
and polyadenylation sequences recognized by mammalian cells are regulatory regions located 
3' to the translation stop codon and thus, together v^th the promoter elements, flank the 
coding sequence. Examples of transcription terminator and polyadenlytion signals include 
those derived form SV40. 

The methods of introducing exogenous nucleic acid into mammalian hosts, as 
well as other hosts, is well known in the art, and will vary with the host cell used. 
Techniques include dextran-mediated transfection, calcium phosphate precipitation, 
polybrene mediated transfection, protoplast fusion, electroporation, viral infection, 
encapsulation of the polynucleotide(s) in liposomes, and direct microiojection of the DNA 
into nuclei. 

In a preferred embodiment, angiogenesis proteins are expressed in bacterial 
systems. Bacterial expression systems are well known in the art. Promoters from 
bacteriophage may also be used and are known in the art. In addition, synthetic promoters 
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and hybrid promoters are also useful; for example, the tac promoter is a hybrid of the trp and 
lac promoter sequences. Furthermore, a bacterial promoter can include naturally occurring 
promoters of non-bacterial origin that have the ability to bind bacterial RNA polymerase and 
initiate transcription. In addition to a functioning promoter sequence, an efficient ribosome 

5 binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the angiogenesis protein, in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may also include a selectable marker gene to allow for the selection of 

10 bacterial strains that have been transformed. Suitable selection genes include genes which 
render the bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, 
kanamycin, neomycin and tetracycline. Selectable markers also include biosynthetic genes, 
such as those in the histidine, tryptophan and leucine biosynthetic pathways. These 
components are assembled into expression vectors. Expression vectors for bacteria are well 

15 known in the art, and include vectors for Bacillus subtilis, E, coli. Streptococcus cremoriSy 
and Streptococcus lividans^ among others {e.g.; Fernandez & HoefiQer, supra). The bacterial 
expression vectors are transformed into bacterial host cells iising techniques well known in 
the art, such as calcium chloride treatment, electroporation, and others. 

In one embodiment, angiogenesis proteins are produced in insect cells. 

20 Expression vectors for the transformation of insect cells, and in particular, baculovirus-based 
expression vectors, are well known in the art. 

In a preferred embodiment, angiogenesis protein is produced in yeast cells. 
Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula polymorpha, 

25 Kluyveromyces fragilis and K. lactis, Pichia guillerimondii and P. pastoris, 
Schizosaccharomyces pombe^ and Yarrowia lipolytica. 

The angiogenesis protein may also be made as a fusion protein, using 
techniques well known in the art. Thus, for example, for the creation of monoclonal 
antibodies, if the desired epitope is small, the angiogenesis protein may be fused to a carrier 

30 protein to form an immunogen. Altematively, the angiogenesis protein may be made as a 
fusion protein to increase expression, or fpr other reasons. For example, when the 
angiogenesis protein is an angiogenesis peptide, the nucleic acid encoding the peptide may be 
linked to another nucleic acid for expression purposes. Fusion with detection epitope tags 
can be made, e.g., with FLAG, His 6, myc, HA, etc. 
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In one embodiment, the angiogenesis nucleic acids, proteins and antibodies of 
the invention are labeled. By "labeled" herein is meant that a compound has at least one 
element, isotope or chemical compound attached to enable the detection of the compound. In 
general, labels fall into three classes: a) isotopic labels, which may be radioactive or heavy 
5 isotopes; b) immune labels, which may be antibodies, antigens, or epitope tags and c) colored 
or fluorescent dyes. The labels may be incorporated into the angiogenesis nucleic acids, 
proteins and antibodies at any position. For example, the label should be capable of 
producing, either directly or indirectly, a detectable signal. The detectable moiety may be a 
radioisotope, such as ^H, ^'^C, ^^P, ^^S, or a fluorescent or chemiluminescent compoimd, 

10 such as fluorescein isothiocyanate, rhodamine, or luciferin, or an enzyme, such as alkaline 
phosphatase, beta-galactosidase or horseradish peroxidase. Any method known in the art for 
conjugating the antibody to the label may be employed, including those methods described by 
Hunter et al.. Nature, 144:945 (1962); David et al.. Biochemistry, 13:1014 (1974); Pain et 
al.,^J. Immunol. Meth., 40:219 (1981); andNygren, J. Histochem. and Cytochem., 30:407 

15 (1982). 

Accordingly, the present invention also provides angiogenesis protein 
sequences. An angiogenesis protein of the present invention may be identified in several 
ways. "Protein" in this sense includes proteins, polypeptides, and peptides. As will be 
appreciated by those in the art, the nucleic acid sequences of the invention can be used to 

20 generate protein sequences. There are a variety of ways to do this, including cloning the 
entire gene and verifying its firame and amino acid sequence, or by comparing it to known 
sequences to search for homology to provide a fi-ame, assimiing the angiogenesis protein has 
an identifiable motif or homology to some protein in the database being used. Generally, the 
nucleic acid sequences are input into a program that will search all three firames for 

25 homology. This is done in a preferred embodiment using the following NCBI Advanced 

BLAST parameters. The program is blastx or blastn. The database is nr. The input data is as 
"Sequence in FASTA format". The organism list is **none". The "expect" is 10; the filter is 
default. The "descriptions" is 500, the "alignments" is 500, and the "alignment view" is 
pairwise. The "Query Genetic Codes" is standard (1). The matrix is BLOSUM62; gap 

30 existence cost is 1 1 , per residue gap cost is 1; and the lambda ratio is .85 default. This 
results in the generation of a putative protein sequence. 

Also included within one embodiment of angiogenesis proteins are amino acid 
variants of the naturally occurring sequences, as determined herein. Preferably, the variants 
are preferably greater than about 75% homologous to the wild-type sequence, more 
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preferably greater than about 80%, even more preferably greater than about 85% and most 
preferably greater than 90%. In some embodiments the homology will be as high as about 93 
to 95 or 98%. As for nucleic acids, homology in this context means sequence similarity or 
identity, with identity being preferred. This homology will be determined using standard 
5 techniques well known in the art as are outlined above for the nucleic acid homologies. 

Angiogenesis proteins of the present invention may be shorter or longer than 
the wild type amino acid sequences. Thus, in a preferred embodiment, included within the 
definition of angiogenesis proteins are portions or firagments of the wild type sequences, 
herein. In addition, as outlined above, the angiogenesis nucleic acids of the invention may be 

10 used to obtain additional coding regions, and thus additional protein sequence, using 
techniques known in the art. 

In a preferred embodiment, the angiogenesis proteins are derivative or variant 
angiogenesis proteins as compared to the wild-type sequence. That is, as outlined more fully 
below, the derivative angiogenesis peptide will often contain at least one amino acid 

15 substitution, deletion or insertion, with amino acid substitutions being particularly preferred. 
The amino acid substitution, insertion or deletion may occur at any residue within the 
angiogenesis peptide. 

Also included within one embodiment of angiogenesis proteins of the present 
invention are amino acid sequence variants. These variants typically fall into one or more of 

20 three classes: substitutional, insertional or deletional variants. These variants ordinarily are 
pr^ared by site specific mutagenesis of nucleotides in the DNA encoding the angiogenesis 
protein, using cassette or PGR mutagenesis or other techniques well known in the art, to 
produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell 
culture as outlined above. However, variant angiogenesis protein fragments having up to 

25 about 1 00-1 50 residues may be prepared by in vitro synthesis using established techniques. 
Amino acid sequence variants are characterized by the predetermined nature of the variation, 
a feature that sets them apart from naturally occurring allelic or interspecies variation of the 
angiogenesis protein amino acid sequence. The variants typically exhibit the same qualitative 
biological activity as the naturally occurring analogue, although variants can also be selected 

30 which have modified characteristics as will be more fiiUy outlined below. 

While the site or region for introducing an amino acid sequence variation is 
predetermined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be 
conducted at the target codon or region and the expressed angiogenesis variants screened for 
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the optimal combination of desired activity. Techniques for making substitution mutations at 
predetemiined sites in DNA having a known sequence are well known, for example. Ml 3 
primer mutagenesis and PGR mutagenesis. Screening of the mutants is done using assays of 
angiogenesis protein activities. 
5 Amino acid substitutions are typically of single residues; insertions usually 

will be on the order of fix)m about 1 to 20 amino acids, although considerably larger 
insertions may be tolerated. Deletions range from about 1 to about 20 residues, although in 
some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to 

10 arrive at a final derivative. Generally these changes are done on a few amino acids to 

minimize the alteration of the molecule. However, larger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of the angiogenesis protein are 
desired, substitutions are generally made in accordance with the amino acid substitution chart 
provided in the definition section. 

1 5 Substantial changes in function or inmiunological identity are made by 

selecting substitutions that are less conservative than those provided in the definition of 
"conservative substitution". For example, substitutions may be made which more 
significantly affect: the stmcture of the polypeptide backbone in the area of the alteration, for 
example the alpha-helical or beta-sheet structure; the charge or hydrophobicity of the 

20 molecule at the target site; or the bulk of the side chain. The substitutions which in general 
are expected to produce the greatest changes in the polypeptide's properties are those in 
which (a) a hydrophilic residue, e.g. seryl or threonyl, is substituted for (or by) a hydrophobic 
residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline is 
substituted for (or by) any other residue; (c) a residue having an electropositive side chain, 

25 e.g. lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, e.g. 
glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g. phenylalanine, is 
substituted for (or by) one not having a side chain, e.g. glycine. 

The variants typically exhibit the same qualitative biological activity and will 
elicit the same immune response as the naturally-occurring analog, although variants also are 

30 selected to modify the characteristics of the angiogenesis proteins as needed. Alternatively, 
the variant may be designed such that the biological activity of the angiogenesis protein is 
altered. For example, glycosylation sites may be altered or removed. 

Covalent modifications of angiogenesis polypeptides are included within the 
scope of this invention. One type of covalent modification includes reacting targeted amino 
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acid residues of an angiogenesis polypeptide with an organic derivatizing agent that is 
capable of reacting with selected side chains or the N-or C-terminal residues of an 
angiogenesis. polypeptide. Derivatization with.bifunctional agents is useful, for instance, for 
crosslinking angiogenesis polypeptides to a water-insoluble support matrix or surface for use 

5 in the method for purifying anti-angiogenesis polypeptide antibodies or screening assays, as 
is more fully described below. Commonly used crosslinking agents include, e.g., 1,1- 
bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, for example, 
esters with 4-azidosalicyUc acid, homobifunctional imidoesters, including disuccinimidyl 
esters such as 3,3*-dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N- 

10 maleimido-l,8-octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl 
residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of 
proline and lysine, phosphorylation of hydroxyl groups of seryl, threonyl or tyrosyl residues, 
methylation of the y-amino groups of lysine, arginine, and histidine side chains [T.E. 

15 Creighton, Proteins: Structure and Molecular Properties, W.H. Freeman & Co., San 

Francisco, pp. 79-86 (1983)], acetylation of the N-tenninal amine, and amidation of any C- 
terminal carboxyl group. 

Another type of covalent modification of the angiogenesis polypeptide 
included within the scope of this invention comprises altering the native glycosylation pattern 

20 of the polypeptide. "Altering the native glycosylation pattem" is intended for purposes herein 
to mean deleting one or more carbohydrate moieties found in native sequence angiogenesis 
polypeptide, and/or adding one or more glycosylation sites that are not present in the native 
sequence angiogenesis polypeptide. Glycosylation patterns can be altered in many ways. For 
example the use of different cell types to express angiogenesis-associated sequences can 

25 result in different glycosylation patterns. 

Addition of glycosylation sites to angiogenesis polypeptides may also be 
accomplished by altering the amino acid sequence thereof. The alt^ation may be made, for 
example, by the addition of, or substitution by, one or more serine or threonine residues to the 
native sequence angiogenesis polypeptide (for O-linked glycosylation sites). The 

30 angiogenesis amino acid sequence may optionally be altered through changes at the DNA 

level, particularly by mutating the DNA encoding the angiogenesis polypeptide at preselected 
bases such that codons are generated that will translate into the desired anoino acids. 
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Another means of increasing the number of carbohydrate moieties on the 
aiigiogenesis polypeptide is by chemical or enzymatic coupHng of glycosides to the 
polypeptide. Such methods are described in the art, e.g., in WO 87/05330 published 1 1 
September 1987, and in Aplin and Wriston, CRC Crit. Rev. Biochem., pp. 259-306 (1981). 
5 Removal of carbohydrate moieties present on the angiogenesis polypeptide 

may be accomplished chemically or enzymatically or by mutational substitution of codons 
encoding for amino acid residues that serve as targets for glycosylation. Chemical 
deglycosylation techniques are known in the art and described, for instance, by Hakimuddin, 
et al.. Arch. Biochem. Biophys., 259:52 (1987) and by Edge et al. Anal. Biochem., 1 18:131 

10 (1981). Enzymatic cleavage of carbohydrate moieties on polj^eptides can be achieved by the 
use of a variety of endo-and exo-glycosidases as described by Thotakura et al., Meth, 
EnzymoL, 138:350(1987). 

Another type of covalent modification of angiogenesis comprises linking the 
angiogenesis polypeptide to one of a variety of nonproteinaceous polymers, e.g., 

1 5 polyethylene glycol, polypropylene glycol, or polyoxyalkylenes, in the maimer set forth in 
U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Angiogenesis polypeptides of the present invention may also be modified in a 
way to form chimeric molecules comprising an angiogenesis polypeptide fiised to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 

20 molecule comprises a fiision of an angiogenesis polypeptide with a tag polj^eptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino-or carboxyl-terminus of the angiogenesis polypeptide. The 
presence of such epitope-tagged forms of an angiogenesis polypeptide can be detected using 
an antibody against the tag polypeptide. Also, provision of the epitope tag enables the 

25 angiogenesis polypeptide to be readily purified by affinity purification using an anti-tag 
antibody or another type of affinity matrix that binds to the epitope tag. In an alternative 
embodiment, the chimeric molecule may comprise a fiision of an angiogenesis polypeptide 
with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of 
the chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

30 Varioiis tag polypeptides and their respective antibodies are well known in the 

art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
HIS6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 [Field et 
aL Mol Cell Biol, 8:2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto [Evan et al. Molecular and Cellular Biology, 5:3610-3616 (1985)]; 
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and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky et al. 
Protein Engineerings 3(6):547-553 (1990)]. Other tag polypeptides include the Flag-peptide 
[Hopp et aL, BioTechnology, 6:1204-1210 (1988)]; the KT3 epitope peptide [Martm et al. 
Science, 255:192-194 (1992)]; tubulin epitope peptide [Skinner et al, J. Biol Chetn., 
5 266:15163-15166 (1991)]; and the T7 gene 10 protem peptide tag [Lutz-Freyennuth et al, 
Proc. Natl Acad. Set USA, 87:6393-6397 (1990)]. 

Also included with an embodiment of angiogenesis protein are other 
angiogenesis proteins of ttie angiogenesis family, and angiogenesis proteins from other 
organisms, which are cloned and expressed as outlined below. Thus, probe or degenerate 

10 polymerase chain reaction (PGR) primer sequences may be used to find other related 

angiogenesis proteins from himians or other organisms. As will be appreciated by those in 
the art, particularly useful probe and/or PGR primer sequences include the unique areas of the 
angiogenesis nucleic acid sequence. As is generally known in tlie art, preferred PGR primers 
are from about 15 to about 35 nucleotides in length, with from about 20 to about 30 being 

15 preferred, and may contain inosine as needed. The conditions for the PGR reaction are well 
known in the art (e.g^., Innis, PGR Protocols, supra). 

In addition, as is outlined herein, angiogenesis proteins can be made that are 
longer than those encoded by flie nucleic acids of the figures, e.g., by the elucidation of 
extended sequences, the addition of epitope or purification tags, the addition of other fusion 

20 sequences, etc. 

Angiogenesis proteins may also be identified as being encoded by 
angiogenesis nucleic acids. Thus, angiogenesis proteins are encoded by nucleic acids that 
will hybridize to the sequences of the sequence listings, or their complements, as outlined 
herein. 

25 In a preferred embodiment, when the angiogenesis protein is to be used to 

generate antibodies, e.g., for immunotherapy or immunodiagnosis, the angiogenesis protein 
should share at least one epitope or determinant with the full length protein. By "epitope" or 
"deteraiinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHG. Thus, in most instances, antibodies 

30 made to a smaller angiogenesis protein will be able to bind to the full-length protein, 
particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. In a preferred 
embodiment, the epitope is selected from a protein sequence set out in Table 8. 
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Methods of preparing polyclonal antibodies are known to the skilled artisan 
(e.g.^ Coligan, supra; and Harlow & Lane, supra). Polyclonal antibodies can be raised in a 
mammal, e.g.^ by one or more injections of an immunizing agent and, if desired, an adjuvant. 
Typically, the immunizing agent and/or adjuvant will be injected in the mammal by multiple 
5 subcutaneous or intraperitoneal injections. The immunizing agent may include a protein 
encoded by a nucleic acid of the figures or firagment thereof or a fusion protein thereof. It 
may be usefiil to conjugate the immunizing agent to a protein known to be immunogenic in 
the mammal being immunized. Examples of such immunogenic proteins include but are not 
limited to keyhole limpet hemocyanin, serum albxmiin, bovine thyroglobulin, and soybean 

10 typsin inhibitor. Examples of adjuvants which may be employed include Freund's complete 
adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomycolate). The inmiunization protocol may be selected by one skilled in the art 
without undue experimentation. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 

1 5 antibodies may be prepared using hybridoma methods, such as those described by Kohler and 
Milstein, Nature, 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to 
the inraiunizing agent. Alternatively, the lymphocytes may be inraiunized in vitro. The 

20 immimizing agent will typically include a.polypeptide encoded by a nucleic acid of Tables 1- 
8 , or firagment thereof, or a fiision protein thereof. Generally, either peripheral blood 
lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammaUan sources are desired. The lymphocytes are then 
fiised with an inamortalized cell line using a suitable fiising agent, such as polyethylene 

25 glycol, to form a hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice, 
Academic Press, (1986) pp. 59-103]. Immortalized cell lines are usually transformed 
mammalian cells, particularly myeloma cells of rodent, bovine and human origin. Usually, 
rat or mouse myeloma cell lines are employed. The hybridoma cells may be cultured in a 
suitable culture medium that preferably contains one or more substances that inhibit the 

30 growth or survival of the imfiised, immortalized cells. For example, if the parental cells lack 
the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture 
medium for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine 
("HAT medium"), which substances prevent the growth of HGPRT-deficient cells. 
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In one embodiment, the antibodies are bispecific antibodies. Bispecific 
antibodies are monoclonal, preferably human or himianized, antibodies that have binding 
specificities for at least two different antigens or that have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
5 protein encoded by a nucleic acid Tables 1-8 or a firagment thereof, the other one is for any 
other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 

In a preferred embodiment, the antibodies to angiogenesis protein are capable 

10 of reducing or eliminating a biological function of an angiogenesis protein, as is described 
below. That is, the addition of anti-angiogenesis protein antibodies (either polyclonal or 
preferably monoclonal) to angiogenic tissue (or cells containing angiogenesis) may reduce or 
eliminate the angiogenesis activity. Generally, at least a 25% decrease in activity is 
preferred, with at least about 50% being particularly preferred and about a 95-100% decrease 

1 5 being especially preferred. 

In a preferred embodiment the antibodies to the angiogenesis proteins are 
humanized antibodies (e.g., Xenerex Biosciences, Mederex, Lie, Abgenix, Inc., Protein 
Design Labs,Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of immunoglobulins, inojnunoglobulin chains or fragments thereof (such as Fv, 

20 Fab, Fab*, F(ab')2 or other antigen-binding subsequences of antibodies) which contain 

minimal sequence derived fi*om non-human immimoglobulin. Humanized antibodies include 
hiunan immunoglobuHns (recipient antibody) in which residues form a complementary 
determining region (CDR) of the recipient are replaced by residues fi-om a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 

25 affinity and capacity. In some instances, Fv framework residues of the human 

immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 
imported CDR or fimiework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 

3 0 substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the firamework (FR) regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a himian 
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immunoglobulin [Jones et al.. Nature, 321:522-525 (1986); Riechmann et al.. Nature, 
332:323-329 (1988); andPresta, Curr. Op. Struct. Biol., 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art. 
Generally, a humanized antibody has one or more amino acid residues introduced into it from 
5 a soiirce which is non-human. These non-human amino acid residues are often referred to as 
import residues, which are typically taken from an import variable domain. Humanization 
can be essentially performed following the method of Winter and co-workers [Jones et al.. 
Nature, 321:522-525 (1986); Riechmann et al.. Nature, 332:323-327 (1988); Verhoeyen et 
al.. Science, 239:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the 

10 corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 
human variable domain has been substituted by the corresponding sequence from a non- 
himian species. In practice, hiunanized antibodies are typically human antibodies in which 
some CDR residues and possibly some FR residues are substituted by residues from 

15 analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the 
art, including phage display libraries [Hoogenboom and Winter, J. Mol. Biol., 227:381 
(1991); Marks et al., J. Mol. Biol., 222:581 (1991)]. The techniques of Cole et al. and 
Boemer et al. are also available for the preparation of hmnan monoclonal antibodies (Cole et 

20 al.. Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985) and Boemer et 
al., J. Immunol., 147(l):86-95 (1991)]. Similarly, human antibodies can be made by 
introducing of human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immimoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 

25 .humans in all respects, iacluding gene rearrangement, assembly, and antibody repertoire. 
This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 
5,569,825; 5,625,126; 5,633,425; 5,661,016, and in the following scientific publications: 
Marks et al., Bio/Technology 10, 779-783 (1992); Lonberg et al.. Nature 368 856-859 (1994); 
Morrison, Nature 368, 812-13 (1994); Fishwild et al.. Nature Biotechnology 14, 845-51 

30 (1996); Neuberger, Nature Biotechnology 14, 826 (1996); Lonberg and Huszar, Intem. Rev. 
Immunol. 13 65-93 (1995). 

By immunotherapy is meant treatment of angiogenesis with an antibody raised 
against angiogenesis proteins. As used herein, irmnunotherapy can be passive or active. 
Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 
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(patient). Active immunization is the induction of antibody and/or T-cell responses in a 
recipient (patient). Induction of an immune response is the result of providing the recipient 
with an antigen to which antibodies are raised. As appreciated by one of ordinary skill in the 
art, the antigen may be provided by injecting a polypeptide against which antibodies are 
5 desired to be raised into a recipient, or contacting the recipient with a nucleic acid capable of 
expressing the antigen and imder conditions for expression of the antigen, leading to an 
inmiune response. 

In a preferred embodiment the angiogenesis proteins against which antibodies 
are raised are secreted proteins as described above. Without being boimd by theory, 

10 antibodies used for treatment, bind and prevent the secreted protein from binding to its 
receptor, thereby inactivating the secreted angiogenesis protein. 

In another preferred embodiment, the angiogenesis protein to which antibodies 
are raised is a transmembrane protein. Without being boxmd by theory, antibodies used for 
treatment, bind the extracellular domain of the angiogenesis protein and prevent it from 

1 5 binding to othw proteins, such as circulating ligands or cell-associated molecules. The 

antibody may cause down-regulation of the transmembrane angiogenesis protein. As will be 
appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 
competitive or vmcompetitive inhibitor of protein binding to the extracellular domain of the 
angiogenesis protein. The antibody is also an antagonist of the angiogenesis protein. 

20 Further, the antibody prevents activation of the transmembrane angiogenesis protein. In one 
aspect, when the antibody prevents the binding of other molecules to the angiogenesis 
protein, the antibody prevents growth of the ceil. The antibody may also be used to target or 
sensitize the cell to cytotoxic agents, including, but not limited to TNF-a, TNF-p, IL-1, INF-y 
and IL-2, or chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 

25 methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-depaident cytotoxicity (ADCC). Thus, angiogenesis is 
treated by administering to a patient antibodies directed against the transmembrane 
angiogenesis protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 

30 otherwise provide means to locally ablate cells. 

in another preferred embodiment, the antibody is conjugated or fused to an 
effector moiety. The effector moiety can be any number of molecules, including labelling 
moieties such as radioactive labels or fluorescent labels, or can be a therapeutic moiety. In 
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one aspect the therapeutic moiety is a small molecule that modulates the activity of the 
angiogenesis protein. In another aspect the therapeutic moiety modulates the activity of 
molecules associated with or in close proximity to the angiogenesis protein. The therapeutic 
moiety may inhibit enzymatic activity such as protease or coUagenase activity associated with 
5 angiogenesis, or be an attractant of other cells, such as NK cells. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic 
agent. In this method, targeting the cytotoxic agent to angiogenesis tissue or cells, results in a 
reduction in the number of afflicted cells, thereby reducing symptoms associated with 
angiogenesis. Cytotoxic agents are numerous and varied and include, but are not limited to, 

10 cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and their 

corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotin, phenomycin, enomycin and the like. Cj^otoxic agents also include 
radiochemicals made by conjugating radioisotopes to antibodies raised against angiogenesis 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 

15 the antibody. Targeting the therapeutic moiety to transmembrane angiogenesis proteins not 
only serves to increase the local concentration of therapeutic moiety in the angiogenesis 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 
the therapeutic moiety. 

In another preferred embodiment, the angiogenesis protein against which the 

20 antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
or fused to a protein which faciUtates entry into the cell. In one case, the antibody enters the 
cell by endocytosis. In another embodiment, a nucleic acid encoding the antibody is 
administered to the individual or cell. Moreover, wherein the angiogenesis protein can be 
targeted within a cell, i.e., the nucleus, an antibody thereto contains a signal for that target 

25 locaUzation, i.e., a nuclear localization signal. 

The angiogenesis antibodies of the invention specifically bind to angiogenesis 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
Kd of at least about 0. 1 mM, more iisually at least about 1 jiM, preferably at least about 0. 1 
pM or better, and most preferably, 0.01 \jM or better. Selectivity of binding is also 

30 important. 

La a preferred embodiment, the angiogenesis protein is purified or isolated 
after expression. Angiogenesis proteins may be isolated or purified in a variety of ways 
known to those skilled in the art depending on what other components are present in the 
sample. Standard pmification methods include electrophoretic, molecular, immimological 
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and chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the angiogenesis protein 
may be purified using a standard anti-angiogenesis protein antibody column. Ultrafiltration 
and diafiltration techniques, in conjunction with protein concentration, are also usefiiL For 
5 general guidance in suitable pvirification techniques, see Scopes, R., Protein Purification, 
Springer- Verlag, NY (1982). The degree of purification necessary will vary depending on 
the use of the angiogenesis protein. In some instances no purification will be necessary. 

Once expressed and purified if necessary, the angiogenesis proteins and 
nucleic acids are useful in a nmnber of applications. They may be used as immunoselection 
10 reagents, as vaccine reagents, as screening agents, etc. 

Detection of angiogenesis sequence for diagnostic and therapeutic applications 

In one aspect, the RNAexpression levels of genes are determined for different 
cellular states in the angiogenesis phenotype. Expression levels of genes in normal tissue 

15 (i.e., not undergoing angiogenesis) and in angiogenesis tissue (and in some cases, for varying 
severities of angiogenesis that relate to prognosis, as outlined below) are evaluated to provide 
expression profiles. An expression profile of a particular cell state or point of development is 
essentially a "fingerprint" of the state. While two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the 

20 generation of a gene expression profile that is reflective of the state of the cell. By comparing 
expression profiles of cells in different states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 
obtained. Then, diagnosis may be performed or confirmed to determine whether a tissue 
sample has the gene expression profile of normal or angiogenesic tissue. This will provide 

25 for molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g.y 

30 normal versus angiogenic tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more statese. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
not in both. Alternatively, the difference in expression may be quantitative, e,gj in that 
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expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 
5 GeneChip™ expression arrays, Lockhart, Nature Biotechnology, 1 4: 1 675- 1 680 (1 996), 

hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, Northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is at least 
about 50%, more preferably at least about 100%, more preferably at least about 150%, more 

10 preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 

Evaluation may be at the gene transcript, or the protein level. The amoimt of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, altematively, the 
final gene product itself (protein) can be monitored, e.^., with antibodies to the angiogenesis 

15 protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to 
angiogenesis genes, i.e., those identified as being important in an angiogenesis phenotype, 
can be evaluated in an angiogenesis diagnostic test. 

In a preferred embodiment, gene expression monitoring is performed 

20 simultaneously on a number of genes. Multiple protein expression monitoring can be 

performed as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the angiogenesis nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of angiogenesis sequences in a 
particular cell. The assays are further described below in the example. PGR techniques can 

25 be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the angiogenesis protein are 
detected. Although DNA or RNA encoding the angiogenesis protein may be detected, of 
particular interest are methods wherein an mRNA encoding an angiogenesis protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 

30 complementary to and hybridizes wifli the mRNA and includes, but is not limited to, 

oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 
examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
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detected. In another method detection ofthemRNA is performed in situ. In this method 
permeabili2:ed cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mKNA. Following 
washing to remove the non-specifically bound probe, the label is detected. For example a 
5 digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding 
an angiogenesis protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins 
10 as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The angiogenesis proteins, antibodies, nucleic acids, modified proteins and cells 
containing angiogenesis sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
15 techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, angiogenesis proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of angiogenesis. 
Detection of these proteins in putative angiogenesis tissue allows for detection or diagnosis of 

20 angiogenesis. In one embodiment, antibodies are used to detect angiogenesis proteins. A 
preferred method separates proteins fi-om a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the angiogenesis protein is 
detected, e,g,y by immunoblotting with antibodies raised against the angiogenesis protein. 

25 Methods of immimoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the angiogenesis protein find use in 
in situ imaging techniques, e.g.y in histology Methods in Cell Biology: Antibodies in 
Cell Biology^ volume 37 (Asai, ed. 1993)). In this method cells are contacted with firom one 
to many antibodies to the angiogenesis protein(s). Following washing to remove non-specific 

30 antibody binding, the presence of the antibody or antibodies is detected. In one embodiment 
the antibody is detected by incubating with a secondary antibody that contains a detectable 
label. In another method the primary antibody to the angiogenesis protein(s) contains a 
detectable label, for example an enzyme marker that can act on a substrate. In another 
preferred embodiment each one of multiple primary antibodies contains a distinct and 
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detectable label. This method finds particular use in simultaneous screening for a plurality of 
angiogenesis proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are alsoprovided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 
5 ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing 
angiogenesis firom biological samples, such as blood, urine, sputum, or other bodily fluids. 
As previously described, certain angiogenesis proteins are secreted/circulating molecules. 

10 Blood samples, therefore, are useful as samples to be probed or tested for the presence of 
secreted angiogenesis proteins. Antibodies can be used to detect an angiogenesis protein by 
previously described immunoassay techniques including ELISA, immunoblotting (Western 
blotting), immunoprecipitation, BIACORE technology and the like. Conversely, the presence 
of antibodies may indicate an immune response against an endogenous angiogenesis protein. 

15 In a preferred embodiment, in situ hybridization of labeled angiogenesis 

nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 
angiogenesis tissue and/or nomial tissue, are made. In situ hybridization (see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the 

20 findings. It is fiirther understood that the genes which indicate the diagnosis may differ ft-om 
those which indicate the prognosis and molecular profiUng of the condition of the cells may 
lead to distinctions between responsive or refractory conditions or may be predictive of 
outcomes. 

In a preferred embodiment, the angiogenesis proteins, antibodies, nucleic 
25 acids, modified proteins and cells containing angiogenesis sequences are used in prognosis 
assays. As above, gene expression profiles can be generated that correlate to angiogenesis 
severity, in terms of long term prognosis. Again, this may be done on either a protein or gene 
level, with the use of genes being preferred. As above, angiogmesis probes may be attached 
to biochips for the detection and quantification of angiogenesis sequences in a tissue or 
30 patient. The assays proceed as outlined above for diagnosis. PGR method may provide more 
sensitive and accurate quantification. 

In a preferred embodiment members of the three classes of proteins as 
described herein are used in drug screening assays. The angiogenesis proteins, antibodies, 
nucleic acids, modified proteins and cells containing angiogenesis sequences are used in drug 
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screening assays or by evaluating the effect of drug candidates on a "gene expression profile" 
or expression profile of polypeptides. In a preferred embodiment, the expression profiles are 
used, preferably in conjunction with high throughput screening techniques to allow 
monitoring for expression profile genes after treatment with a candidate agent (e.g., 
5 Zlokamik, et al., Sciisnce 279, 84-8 (1 998); Heid, Genome Res 6:986-94, 1 996). 

In a preferred embodiment, the angiogenesis proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified angiogenesis proteins are 
used in screening assays. That is, the presOTt invention provides novel methods for screening 
for compositions which modulate the angiogenesis phenotype or an identified physiological 

10 function of an angiogenesis protein. As above, this can be done on an individual gene level 
or by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokamik, supra. 

1 5 Having identified the diflferentially expressed genes herein, a variety of assays 

may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level. That is, having identified a particular gene as up regulated in angiogenesis, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
angiogenic protein. "Modulation" thus includes both an increase and a decrease in gene 

20 expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue xmdergoing angiogenesis, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4- fold increase in angiogenic tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in 

25 angiogenic tissue compared to normal tissue often provides a target value of a 10-fold 
increase in expression to be induced by the test compoxmd. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 
be monitored, e.g.^ through the use of antibodies to the angiogenesis protein and standalrd 

30 immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entitites, ue,, an expression profile, is monitored simultaneously. Such profiles will 
typically invove a plurality of those entitites described herein.. 
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In this embodiment, the angiogenesis nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of angiogenesis sequences in a 
particular cell. Altematively, PGR may be used. Thus, a series, e.g., of microtiter plate, may 
be used with dispensed primers in desired wells. A PGR reaction can then be perforaied and 
5 analyzed for each well. 

Modulators of angiogenesis 

Expression monitoring can be performed to identify compounds that modify 
the expression of one or more angiogenesis-associated sequences, e.g,^ a polynucleotide 
10 sequence set out in Tables 1-8 . Generally, in a preferred embodiment, a test modulator is 
added to the cells prior to analysis. Moreover, screens are also provided to identify agents 
ttiat modulate angiogenesis, modulate angiogenesis proteins, bind to an angiogenesis protein, 
or interfere with the binding of an angiogenesis protein and an antibody or other binding 
partner. 

15 The term "test compoimd" or "drug candidate" or "modulator" or grammatical 

equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the angiogenesis phenotype or the expression of an angiogenesis sequence, 
e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 

20 expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses an angiogenesis phenotype, for example to a normal 
tissue fingerprint. In another embodiment, a modulator induced an angiogenesis phenotype. 
Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
to obtain a differential response to the various concentrations. Typically, one of these 

25 concentrations serves as a negative control, at zero concentration or below the level of 
detection. 

In one aspect, a modulator will neutralize the effect of an angiogenesis protein. 
By "neutralize" is meant that activity of a protein is inhibited or blocked and thereby has 
substantially no effect on a cell. 
30 In certain embodiments, combinatorial libraries of potential modulators will be 

screened for an ability to bind to an angiogenesis polypeptide or to modulate activity. 
Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
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and activity of those variant compounds. Often, high throughput screening (HTS) methods 
are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve 
providing a library containing a large number of potential therapeutic compounds (candidate 
5 compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
display a desired characteristic activity. The compounds thus identified can serve as 
conventional "lead compoimds" or can themselves be used as potential or actual therapeutics. 
A combinatorial chemical library is a collection of diverse chemical 

10 compounds generated by either chenucal synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 
chemical library, such as a polypeptide {e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 
length {i.e., the number of amino acids in a polypeptide compound). Millions of chemical 

1 5 compoimds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop et al (1994) J. Med. Chem. 37(9): 1233-1251). 

Preparation and screening of combinatorial chemical libraries is well knoAvn to 
those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka (1991) Int. J. Pept Prot Res., 

20 37: 487-493, Houghton et al (1991) Nature, 354: 84-88), peptoids (PCX Publication No WO 
91/19735, 26 Dec. 1991), encoded peptides (PCT Publication WO 93/20242, 14 Oct. 1993), 
random bio-oUgomers (PCT Publication WO 92/00091, 9 Jan. 1992), benzodiazepines (U.S. 
Pat. No. 5,288,514), diversomers such as hydantoins, benzodiazepines and dipeptides (Hobbs 
et al, (1993) Proc. Nat. Acad. Sci. USA 90: 6909-6913), vinylogous polypeptides (Hagihara 

25 et al. (1992) J. Amer. Chem. Soc. 114: 6568), nonpeptidal peptidomimetics with a Beta-D- 
Glucose scaffolding (Hirschmann et al., (1992) J. Amer. Chem. Soc. 1 14: 9217-9218), 
analogous organic syntheses of small compound libraries (Chen et al (1994) J. Amer. Chem. 
Soc. 116: 2661), oligocarbamates (Cho, et al., (1993) Science 261:1303), and/or peptidyl 
phosphonates (Campbell et al, (1994) J. Org. Chem. 59: 658). See, generally, Gordon et al, 

30 (1994) J. Med. Chem. 37:1385, nucleic acid libraries (see, e.g., Strategene, Corp.), peptide 
nucleic acid libraries {see, e.g., U.S. Patent 5,539,083), antibody libraries {see, e.g., Vaughn 
etal {1996) Nature Biotechnology, 14(3): 309-314), and PCT/US96/1 0287), carbohydrate 
libraries {see, e.g., Liang et al, (1996) Science, 274: 1520-1522, and U.S. Patent No. 
5,593,853), and small organic molecule libraries {see, e.g., benzodiazepines, Baum (1993) 
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C&EN, Jan 18, page 33; isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and 
metathiazanones, U.S. Patent No. 5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 
5,519,134; morpholino compounds, U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent 
No. 5,288,514; and the like). 
5 Devices for the preparation of combinatorial libraries are commercially 

available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Wobum, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

A number of well known robotic systems have also been developed for 
10 solution phase chemistries. These systems include automated workstations like the 

automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate n, Zymark Corporation, 
Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.), which mimic the manual 
synthetic operations performed by a chemist. Any of the above devices are suitable for use 
1 5 with the present invention. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available {see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, 
MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
20 Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of angiogenesis gene transcription, 
inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

25 High throughput assays for the presence, absence, quantification, or other 

properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 
for example, U.S. Patent No. 5,559,410 discloses high throughput screemng methods for 
proteins, U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic 

30 acid bmding {Le., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 
{see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. FuUerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
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typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 
as a high degree of flexibihty and customization. The manufacturers of such systems provide 

5 detailed protocols for various high throughput systems. Thus, for example, Zymark Corp. 
provides technical bulletins describing screening systems for detecting the modulation of 
gene transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 
proteins or fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing 

10 proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fimgal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
preferred. Paticularly usefiil test compoimd will be directed to the class of proteins to which 

15 the target belongs, e.g., substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of fi'om about 5 to about 
30 amino acids, with fi'om about 5 to about 20 amino acids being preferred, and firom about 7 
to about 15 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or **biased'* random peptides. By 

20 "randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 
may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 

25 most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fiilly randonciized, with no sequence 
preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected firom a 

30 limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, for example, of hydrophobic 
amino acids, hydrophilic residues, sterically biased (either small or large) residues, towards 
the creation of nucleic acid binding domains, the creation of cysteines, for cross-linking, 
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prolines for SH-3 domains, serines, threonines, tyrosines or histidines for phosphorylation 
sites, etc*, or to purines, etc. 

Modulators of angiogenesis can also be nucleic acids, as defined above. 
As described above generally for proteins, nucleic acid modulating agents may 
5 be naturally occurring nucleic acids, random nucleic acids, or **biased*' random nucleic acids. 
For example, digests of procaryotic or eucaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literature. 

10 After the candidate agent has been added and the cells allowed to incubate for 

some period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
with purification and/or amplification such as PGR performed as appropriate. For example, 

15 an //J vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, for example, a 
fluorescent, a chemiluminescent, a chemical, or a radioactive signal, to provide a means of 
detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 

20 such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 

appropriate substrate produces a product that can be detected. Altematively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzjnne. The label also can be a moiety or compound, such as, an 
epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, the 

25 streptavidin is labeled as described above, thereby, providing a detectable signal for the 

bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 
probes, as is generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 

30 5,594,117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 
5,594,1 18, 5,359,100, 5,124,246 and 5,681,697, aU of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, under 
conditions that allow the formation of a hybridization complex. 
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A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allows formation of the label probe 
hybridization complex only in the presence of target. Stringency can be controlled by 
5 altering a step parameter that is a thermodynamic variable, including, but not limited to, 
temperature, formamide concentration, salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,68 1,697. Thus it may be desirable to perform certain 
10 steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outUned below. In addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g, albumin, detergents, etc, 
15 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors,r nuclease inhibitors, anti-microbial agents, etc.^ may also be 
used as appropriate, depending on tiie sample preparation methods and purity of the target. 

The assay data are analyzed to determine the expression levels, and changes in 
20 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the angiogenesis phenotype. 
In one embodiment, screening is performed to identify modulators that can induce or 
suppress a particular expression profile, thus preferably generating the associated phenotype. 
In another embodiment, for diagnostic appUcations, having identified differentially 
25 expressed genes important in a particular state, screens can be performed to identify 

modulators that alter expression of individual genes. In an another embodiment, screening is 
performed to identify modulators that alter a biological function of the expression product of 
a differentially expressed gene. Again, having identified the importance of a gme in a 
particular state, screens are performed to identify agents that bind and/or modulate the 
30 biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent. After identifying a modulator based upon its ability to suppress an 
angiogenesis expression pattern leading to a normal expression pattern, or to modulate a 
single angiogenesis gene expression profile so as to mimic the expression of the gene firom 
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normal tissue, a screen as described above can be performed to identify genes that are 
specifically modulated in response to the agent. Comparing expression profiles between 
normal tissue and agent treated angiogenesis tissue reveals genes that are not expressed in 
normal tissue or angiogenesis tissue, but are expressed in agent treated tissue. These agent- 
5 specific sequences can be identified and used by methods described herein for angiogenesis 
genes or proteins. In particular these sequences and the proteins they encode find use in 
marking or identifying agent treated cells* In addition, antibodies can be raised against the 
agent induced proteins and used to target novel therapeutics to the treated angiogenesis tissue 
sample. 

10 Thus, in one embodiment, a test compound is administered to a population of 

angiogenic cells, that have an associated angiogenesis expression profile. By 
"administration" or "contacting** herein is meant that the candidiate agent is added to the cells 
in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 

1 5 encoding a proteinaceous candidate agent (z. e. , a peptide) may be put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression of 
the peptide agent is accomplished, e.g., PCT US97/01019. Regulatable gene ther^y systems 
can also be used. 

Once the test compound has been administ^ed to the cells, the cells can be 
20 washed if desired and are allowed to incubate xmder preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
generated, as outlined herein. 

Thus, for example, angiogenesis tissue may be screened for agents that 
modulate, e,g., induce or suppress the angiogenesis phenotype. A change in at least one 
25 gene, preferably many, of the expression profile indicates that the agent has an effect on 

angiogenesis activity. By defining such a signature for the angiogenesis phenotjpe, screens 
for new drags fliat alter the phenotype can be devised. With this approach, the drug target 
need not be known and need not be represented in the original expression screening platform, 
nor does the level of transcript for the target protein need to change. 
30 Measure of angiogenesis polypeptide activity, or of angiogenesis or the 

angiogenic phenotype can be performed using a variety of assays. For example, the effects of 
the test compounds upon the function of the anagiogenesis polypeptides can be measured by 
examining parameters described above. A suitable physiological change that affects activity 
can be used to assess the influence of a test compound on the polypeptides of this invention. 
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When the functional consequences are determined using intact cells or animals, one can also 
measure a variety of effects such as, in the case of angiogenesis associated with tumors, 
tumor growth, neovascularization, hormone release, transcriptional changes to both known 
and imcharacterized genetic markers (e.g., northem blots), changes in cell metabolism such as 
5 cell growth or pH changes, and changes in intracellular second messengers such as cGMP. In 
the assays of the invention, manmialian angiogenesis polypeptide is typically used, e.g., 
mouse, preferably human. 

A variety of angiogenesis assays are known to those of skill in the art. Various 
models have been employed to evaluate angiogenesis (e.g., Croix et aL, Science 289: 11 97- 

10 1202, 2000 and Kahn et al^ Amer, J. Pathol 156:1887-1900). Assessement of angiogenesis 
in the presence of a potential modulator of angiogenesis can be performed using cell-cultre- 
based angiogenesis assays, e.g., endothelial cell tube formation assays, as well as other 
bioassays such as the chick CAM assay, the mouse comeal assay, and assays measuring the 
effect of administering potential modulators on implanted tumors. The chick CAM assay is 

15 described by O'Reilly, et al Cell 79: 315-328, 1994. Briefly, 3 day old chicken embryos with 
intact yolks are separated from the egg and placed in a petri dish. After 3 days of incubation, 
a methylcellulose disc containing the protein to be tested is applied to the CAM of individual 
embryos. After about 48 hours of incubation, the embryos and CAMs are observed to 
determine whether endothelial growth has been inhibited. The mouse comeal assay involves 

20 implanting a growth factor-containing pellet, along with another pellet containing the 

suspected endothelial growth inhibitor, in the cornea of a mouse and observing the pattem of 
capillaries that are elaborated in the cornea. Angiogenesis can also be measured by 
determining the extent of neovascularization of a tumor. For example, carcinoma cells can be 
subcutaneously inoculated into athymic nude mice and tumor growth then monitored. The 

25 cancer cells are treated with an angiogenesis inhibitor, such as an antibody, or other 

compound that is exogenbusly administered, or can be transfected prior to inoculation with a 
polynucleotide inhibitor of angiogenesis. Immunoassays using endothelial cell-specific 
antibodies are typically used to stain for vasculari2sation of tumor and the number of vessels 
in the tumor. 

30 Assays to identify compoimds with modulating activity can be performed in 

vitro. For example, an angiogenesis polypeptide is first contacted with a potential modulator 
and incubated for a suitable amoimt of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the angiogenesis polypeptide levels are determined in vitro by measuring the level of protein 
or mKNA. The level of protein is measured using immunoassays such as western blotting, 
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ELIS A and the like with an antibody that selectively binds to the angiogenesis polypeptide or 
a fragment thereof. For measurement of mRNA, amplification, e.g., using PGR, LCR, or 
hybridization assays, e.g., nc^em hybridization, RNAse protection, dot blotting, are 
preferred. The level of protein or mKNA is detected using directly or indirectly labeled 
5 detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the angiogenesis 
protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or p-gal. The reporter construct is typically transfected into a cell. After 

10 treatment with a potential modulator, the amoimt of reporter gene transcription, translation, or 
activity is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
differentially expressed gene as important in a particular state, screening of modulators of the 

15 expression of the gene or the gene product itself can be done. The gene products of 

differentially expressed genes are sometimes referred to herein as "angiogenesis proteins". In 
preferred embodiments the angiogenesis protein comprises a sequence shown in Table 8. 
The angiogenesis protein may be a fi-agment, or alternatively, be the full length protein to a 
fragment shown herein. 

20 Preferably, the angiogenesis protein is a fragment of approximately 14 to 24 

amino acids long. More preferably the fragment is a soluble fragment. In one embodiment 
an angiogenesis protein is conjugated or ftised to an immunogenic agent or BSA. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated. In another 

25 embodiment, screens are designed to first find compoimds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be fiirther screened to better evaluate stracutre activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 

30 isolated gene product is used; that is, the gene products of one or more differentially 

expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the angiogenesis proteins can be used in the assays. 
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Thus, in a preferred embodiment, the methods comprise combining an 
angiogenesis protein and a candidate compound, and detemiining the binding of the 
compoimd to the angiogenesis protein. Preferred embodiments utihze the human 
angiogenesis protein, although other mammalian proteins may also be used, for example for 
5 the development of animal models of human disease. In some embodiments, as outlined 
herein, variant or derivative angiogenesis proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the angiogenesis 
protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 
sample receiving areas (e.g. a microtiter plate, an array, etc.)- The insoluble supports may be 

10 made of any composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 
suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 

15 teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 
with the reagents and overall methods of the invention, maintains the activity of the 
composition and is nondiffiisable. Preferred methods of binding include the use of antibodies 

20 (which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 
the protein or agent, excess unbound material is removed by washing. The sample receiving 
areas may then be blocked through incubation with bovine serum albmnin (BSA), casein or 

25 other innocuous protein or other moiety. 

In a preferred embodiment, the angiogenesis protein is bound to the support, 
and a test compound is added to the assay. Alternatively, the candidate agent is bound to the 
support and the angiogenesis protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 

30 analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 
protein binding, functional assays (phosphorylation assays, etc.) and the Uke. 
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The determination of the binding of the test modulating compound to the 
angiogenesis protein may be done in a number of ways. In a preferred embodiment, the 
compound is labelled, and binding detemiined directly, e.g., by attaching all or a portion of 
the angiogenesis protein to a solid support, adding a labelled candidate agent a 
5 fluorescent label), washing off excess reagent, and determining whether the label is present 
on the solid support: Various blocking and washing steps may be utilized as appropriate. 

By "labeled" herein is meant that the compound is either directly or indirectly 
labeled with a label which provides a detectable signal, e.g. radioisotope, fluorescers, 
enzyme, antibodies, particles such as magnetic particles, chemiluminescers, or specific 

10 binding molecules, etc. Specific binding molecules include pairs, such as biotin and 
streptavidin, digoxin and antidigoxin, etc. For the specific binding members, the 
complementary member would normally be labeled with a molecule which provides for 
detection, in accordance with known procedures, as outlined above. The label can directly or 
indirectly provide a detectable signal. 

15 In some embodiments, only one of the components is labeled, e.g., the 

proteins (or proteinaceous candidate compoimds) can be labeled. Alternatively, more than 
one component can be labeled with different labels, e.g., ^^I for the proteinsand a fluorophor 
for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
usefiil. 

20 In one embodiment, the binding of the test compound is determined by 

competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e. an angiogenesis protein), such as an antibody, peptide, binding partner, ligand, 
etc. Under certain circumstances, there may be competitive binding between the compound 
and the binding moiety, with the binding moiety displacing the compound. In one 

25 embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present. Incubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 
40*^C. Incubation periods are typically optimized, e.g., to facilitate rapid high throughput 
screening. Typically between 0.1 and 1 hour will be sufficient. Excess reagent is generally 

30 removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
compound. Displacement of the competitor is an indication that the test compoxmd is binding 
to the angiogenesis protein and thus is capable of binding to, and potentially modulating, the 
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activity of the angiogenesis protein. In this emhodiment, either component can be labeled. 
Thus, for example, if the competitor is labeled, the presence of label in the wash solution 
indicates displacement by the agent. Altematively, if the test compound is labeled, the 
presence of the label on the support indicates displacement. 
5 In an altemative embodiment, the test compound is added first, with 

incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is boimd to the angiogenesis protein with a 
higher affinity. Thus, if the test compound is labeled, the presence of the label on the 
support, coupled with a lack of competitor binding, may indicate that the test compound is 

10 capable of binding to the angiogenesis protein. 

In a preferred embodiment, the methods comprise differential screening to 
identity agents that are capable of modulating the activitity of the angiogenesis proteins. In 
this embodiment, the methods comprise combining an angiogenesis protein and a competitor 
in a first sample. A second sample comprises a test compound, an angiogenesis protein, and 

15 a competitor. The binding of the competitor is determined for both samples, and a change, or . 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the angiogenesis protein and potentially modulating its activity. That is, if the 
binding of the competitor is differoit in the second sample relative to the first sample, the 
agent is capable of binding to the angiogenesis protein. 

20 Altematively, differential screening is used to identify drug candidates that 

bind to the native angiogenesis protein, but caimot bind to modified angiogenesis proteins. 
The structure of the angiogenesis protein may be modeled, and used in rational dmg design to 
synthesize agents that interact with that site. Drug candidates that affect the activity of an 
angiogenesis protein are also identified by screening dmgs for the ability to either enhance or 

25 reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 
results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed fi-ee of non-specifically bound material 

30 and the amount of bound, generally labeled ag^t determined. For example, where a 

radiolabel is employed, the samples may be coimted in a scintillation coimter to determine the 
amoimt of boimd compound. 

A variety of other reagents may be included in the screening assays. These 
include reagents like salts, neutral proteins, e.g. albumin, detergents, etc. which may be used 
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to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 
5 In a preferred embodiment, the invention provides methods for screening for a 

compound capable of modulating the activity of an angiogenesis protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising angiogenesis 
proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
nucleic acid that encodes an angiogenesis protein. In a preferred embodiment, a library of 

10 candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposure of physiological signals, for example hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentialis, pharmacological agents including 
chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 

1 5 example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate angiogenesis agents are identified. 
Compounds with pharmacological activity are able to enhance or interfere wifli the activity of 
the angiogenesis protein. Once identified, similar structures are evaluated to identify critical 
structural feature of the compound. 

20 In one embodiment, a method of inhibiting angiogenic cell division is 

provided. The method comprises administration of an angiogenesis inhibitor. In another 
embodiment, a method of inhibiting angiogenesis is provided. The method comprises 
administration of an angiogenesis inhibitor. In a further embodiment, methods of treating 
cells or individuals with angiogenesis are provided. The method comprises administration of 

25 an angiogenesis inhibitor. 

In one embodiment, an angiogenesis inhibitor is an antibody as discussed 
above. In another embodiment, the angiogenesis inhibitor is an antisense molecule. 

Polynucleotide modulators of angiogenesis 
30 , Antisense Polynucleotides 

In certain embodiments, the activity of an angiogenesis-associated protein is 
downregulated, or entirely inhibited, by the use of antisense poljmucleotide, i. e., a nucleic 
acid complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., an angiogenesis protein mKNA, or a subsequence thereof. 
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Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
ofthemRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally-occurring 
5 subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
sulfur containing species which are known for use in the art. Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the angiogenesis protein 
mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

1 0 Such antisense polynucleotides can readily be synthesized using recombinant 

means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 
Antisense molecules as used h^ein include antisense or sense 

1 5 oligonucleotides. Sense oligonucleotides can, e.g., be employed to block trancription by 

binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for angiogenesis molecules. A preferred antisense 
molecule is for an angiogenesis sequences in Tables 1-8 , or for a ligand or activator thereof. 

20 Antisense or sense oligonucleotides, according to the present invention, comprise a fragment 
generally at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The 
ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence 
encoding a given protein is described in, for example, Stein and Cohen (Cancer Res. 48:2659, 
1988) and van der Krol et al. (BioTechniques 6:958, 1988). 

25 

Ribozynies 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of angiogenesis-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalytically cleaves other KNA molecules. Different kinds of ribozymes have 
30 been described, including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, 

RNase P, and axhead ribozymes (see, e.g,, Castanotto et al. (1994) Adv. in Pharmacology 25: 
289-3 17 for a general review of the properties of different ribozymes). 

The general features of hairpin ribozymes are described, e,g., in Hampel et al. 
(1990) Nucl. Acids Res. 18: 299-304; Hampel et al. (1990) European Patent Publication No. 0 
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360 257; U.S. Patent No. 5,254,678. Methods of preparing are well known to those of skill in 
the art {see, e.g., Wong-Staal et aL, WO 94/26877; Ojwang et al (1993) Proc. Natl Acad. 
Scu USA 90: 6340^6344; Yamada et al. (1994) Human Gene Tfierapy 1: 39-45; Leavitt et al 
(1995) Proa. Natl. Acad. ScL USA 92: 699-703; Leavitt et al (1994) Human Gene Therapy 5: 
5 1 151-120; and Yamada et al (1994) Virology 205: 121-126). 

Polynucleotide modulators of angiogenesis may be introduced into a cell 
containing the target nucleotide sequence by formation of a conjugate with a ligand binding 
molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are 
not limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 

10 bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 
not substantially interfere with the ability of the hgand binding molecule to bind to its 
corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of 
angiogenesis may be introduced into a cell containing the target nucleic acid sequence, e.g., 

15 by formation of an polynucleotide-Upid complex, as described in WO 90/10448. It is 

understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating angiogenesis in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 

20 anti-angiogenesis antibody that reduces or eliminates the biological activity of an 

endogeneous angiogenesis protein. Alternatively, the methods comprise, administering to a 
cell or organism a recombinant nucleic acid encoding an angiogenesis protein. This may be 
accomplished in any number of ways. In a preferred embodiment, for example when the 
angiogenesis sequence is down-regulated in angiogenesis, such state may be reversed by 

25 increasing the amount of angiogenesis gene product in the cell. This can be accomplished, 

by overexpressing the endogeneous angiogenesis gene or administering a gene encoding 
the angiogenesis sequence, using known gene-therapy techniques, for example. In a 
preferred embodiment, the gene therapy techniques include the incorporation of the 
exogenous gene using enhanced homologous recombination (EHR), for example as described 

30 in PCT/US93/03868, hereby incorporated by reference in its entireity. Alternatively, for 

example when the angiogenesis sequence is up-regulated in angiogenesis, the activity of the 
endogeneous angiogenesis gene is decreased, for example by the administration of a 
angiogenesis antisense nucleic acid or other inhibitor, such as RNAi. 
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In one embodiment, the angiogenesis eproteins of the present invention may 
be used to generate polyclonal and monoclonal antibodies to angiogenesis proteins. 
Similarly, the angiogenesis proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify angiogenesis 
5 antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 

embodiment, the antibodies are generated to epitopes unique to a angiogenesis protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The angiogenesis 
antibodies may be coupled to standard affinity chromatography columns and used to purify 
angiogenesis proteins. The antibodies may also be used as blocking polypeptides, as outlined 
10 above, since they will specifically bind to the angiogenesis protein. 

Methods of identifying variant angiogenesis-associated sequences 

Without being bound by theory, expression of various angiogenesis sequences 
is correlated with angiogenesis. Accordingly, disorders based on mutant or variant 

1 5 angiogenesis genes may be determined. In one embodiment, the invention provides methods 
for identifying cells containing variant angiogenesis genes, e.g,y determining all or part of the 
sequence of at least one endogeneous angiogenesis genes in a cell. This may be 
accomplished using any number of sequencing techniques. In a preferred embodiment, the 
invention provides methods of identifying the angiogenesis genotype of an individual, 

20 detemiining all or part of the sequence of at least one angiogenesis gene of the individual. 

This is generally done in at least one tissue of the individual, and may include the evaluation 
of a number of tissues or different samples of the same tissue. The method may include 
comparing the sequence of the sequenced angiogenesis gene to a known angiogenesis gene. 
I.e., a wild-type gene. 

25 The sequence of all or part of the angiogenesis gene can then be compared to 

the sequence of a known angiogenesis gene to determine if any differences exist. This can be 
done using any niraiber of known homology programs, such as Bestfit, etc. In a preferred 
embodiment, the presence of a a difference in the sequence between the angiogenesis gene of 
the patient and the known angiogenesis gene correlates with a disease state or a propensity 

30 for a disease state, as outlined herein. 

In a preferred embodiment, the angiogenesis genes are used as probes to 
determine the number of copies of the angiogenesis gene in the genome. 

In another preferred embodiment, the angiogenesis genes are used as probes to 
determine the chromosomal locaUzation of the angiogenesis genes. Information such as 
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chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the 
angiogenesis gene locus. 

5 Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of an angiogenesis protein 
or modulator thereof, is administered to a patient. By "therapeutically effective dose" herein 
is meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 

10 known techniques Ansel ai, Pharmaceuitcal Dosage Forms and Dmg Delivery, 
Lippincott, Williams & WiUdns Publishers, ISBN:0683305727; Lieberman (1992) 
Pharmaceutical Dosage Forms (vols. 1-3), Dekker, ISBN 0824770846, 082476918X, 
0824712692, 0824716981; Lloyd (1999) The Art, Science and Technology of Pharmaceutical 
Compounding, Amer. Pharmacutical Assn, ISBN 0917330889; and Pickar (1999) Dosage 

1 5 Calculations, Delmar Pub, ISBN 0766805042), As is known in the art, adjustments for 
angiogenesis degradation, systemic versus localized delivery, and rate of new protease 
synthesis, as well as the age, body weight, general health, sex, diet, time of administration, 
drug interaction and the severity of the condition may be necessary, and will be ascertainable 
with routine experimentation by those skilled in the art. 

20 A "patient" for the purposes of the present invention includes both humans and 

other animals, particularly mammals. Thus the methods are appUcable to both hmnan 
therapy and veterinary appUcations. In the preferred embodiment the patient is a mammal, 
preferably a primate,.and in the most preferred embodiment the patient is human. 

The administration of the angiogenesis proteins and modulators thereof of the 

25 present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
some instances, for example, in the treatment of wounds and inflammation, the angiogenesis 
proteins and modulators may be directly applied as a solution or spray. 

30 The phamiaceutical compositions of the present invention comprise an 

angiogenesis protein in a form suitable for administration to a patient. In the preferred 
embodiment, the pharmaceutical compositions are in a water soluble form, such as being 
present as pharmaceutically acceptable salts, which is meant to include both acid and base 
addition salts. "Phamiaceutically acceptable acid addition salt" refers to those salts that retain 
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the biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
5 acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 

methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, Uthium, anmionium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminiun salts and the like. Particularly preferred are the anunonium, 

10 potassium, sodium, calcium, and magnesixmi salts. Salts derived from pharmaceutically 

acceptable organic non- toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylauMne, 
tripropylamine, and ethanolamine. 

1 5 The pharmaceutical compositions may also include one or more of the 

following: carrier proteins such as serum albxmiin; buffers; fillers such as microcrystalline 
cellulose, lactose, com and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit 

20 dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that angiogenesis protein modulators (e.g., antibodies, 
antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished eittier by complexing the 

25 molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise an angiogenesis 
protein modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous 

30 carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
steriUzed by conventional, well known sterilization techniques. The compositions may 
contain pharmaceutically acceptable auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
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and the like, for example, sodium acetate, sodium chloride, potassium chloride, calciiun 
chloride, sodium lactate and the like. The concentration of active agent in these formulations 
can vary widely, and will be selected primarily based on fluid volumes, viscosities, body 
weight and the like in accordance with the particular mode of achninistration selected and the 
5 patient*s needs (e.g:. Remington 's Pharmaceutical Science, 15th ed.. Mack Publishing 
Company, Easton, Peimsylvania (1980) and Goodman and Gillman, The Phamiacologial 
Basis ofTIterapeutics,(H2irdrmn, J.G, Limbird, L.E, Molinoff, P.B., Ruddon, R.W, and 
Oilman, A,G.,eds) TheMcGraw-Hill Companies, Inc., 1 996). 

Thus, a typical pharmaceutical composition for intravenous administration 

10 would be about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per 
patient per day may be used, particularly when the dmg is adntiinistered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 

15 in the art, e.g.. Remington 's Pharmaceutical Science and Goodman and Gilhnan, ITie 
Pharmacologial Basis of Therapeutics^ supra. 

The compositions containing modulators of angiogenesis proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 
compositions are administered to a patient suffering from a disease {e.g,, a cancer) in an 

20 amount sufficient to cure or at least partially arrest the disease and its complications. An 

amoimt adequate to accomplish this is defined as a "therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administratioiis of the compositions may be administered 
depending on the dosage and frequency as reqiured and tolerated by the patient. In any event, 

25 the composition should provide a sufficient quantity of the agents of this invention to 

efifectively treat the patient. An amount of modulator fliat is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a *^rophylactically effective dose." 
The particular dose required for a prophylactic treatment will dep^d upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 

30 factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 
treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 
recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer. 
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It will be appreciated that the present angiogenesis protein-modulating 
compounds can be administered alone or in combination with additional angiogenesis 
modulating compounds or with other therapeutic agent, e.g., other anti-cancer agents or 
treatments. 

5 In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 

comprising nucleic acid sequences set forth in Tables 1-8 , such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
methods, reagents, vectors, and cells useful for expression of angiogenesis-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 

10 organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 

15 plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell {see, 
e.g., Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
volume 152 Academic Press, Inc., San Diego, CA (Berger), F.M. Ausubel et aL, eds.. Current 
Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & 

20 Sons, Inc., (supplemented through 1 999), and Sambrook et aL, Molecular Cloning - A 
Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, 1989. 

In a preferred embodiment, angiogenesis proteins and modulators are 
adnoinistered as therapeutic agents, and can be formulated as outlined above. Similarly, 

25 angiogenesis genes (including both the full-length sequence, partial sequences, or regulatory 
sequences of the angiogenesis coding regions) can be administered in a gene therapy 
application. These angiogenesis genes can include antisense applications, either as gene 
therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art. 

30 Angiogenesis polypeptides and polynucleotides can also be administered as 

vaccine compositions to stimulate HTL, CTL and antibody responses.. Such vaccine 
compositions can include, for example, lipidated peptides (eg^.jVitiello, A. et al., J, Clin. 
Invest. 95:341, 1995), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) 
CTLG") microspheres {see, e.g., Eldridge, et al, Molec. Immunol 28:287-294, 1991 : Alonso 
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etaL, Vaccine 12:299-306, 1994; Jones et al. Vaccine 13:675-681, 1995), peptide 
compositions contained in immune stimulating complexes (ISCOMS) (jee, e.^., Takahashi et 
al. Nature 344:873-875, 1990; Hu et al, Clin Exp Immunol 1 13:235-243, 1998), multiple 
antigen peptide systems (MAPs) {see e.g.. Tarn, J. P., Proc. Natl Acad. Set U.S.A. 85:5409- 
5 5413, 1988; Tarn, J.P., J. Immunol Methods 196:17-32, 1996), peptides foraiulated as 
multivalent peptides; peptides for use in ballistic delivery systems, typically crystallized 
peptides, viral delivery vectors (Perkus, M. E. et al. In: Concepts in vaccine development, 
Kaufinann, S. H. E., ed., p. 379, 1996; Chakrabarti, S. et al. Nature 320:535, 1986; Hu, S. L. 
et al. Nature 320:537, 1986; Kieny, M.-R et al, AIDS Bio/Technology 4:790, 1986; Top, F. 

10 H. et al, J. Infect. Dis. 124:148, 1971; Chauda, P. K. et a/., Virology 175:535, 1990), 

particles of viral or synthetic origin (e,g,, Kofler, N. et al,J. Immunol Methods. 192:25, 
1996; Eldridge, J. n. et al, Sem. Hematol 30:16, 1993; Falo, L. D., Jr, et al. Nature Med. 
7:649, 1995), adjuvants (Warren, H. S., Vogel, F. R., and Chedid, L. A. Annu. Rev. Immunol 
4:369, 1986; Gupta, R. K. et al. Vaccine 11:293, 1993), liposomes (Reddy, R. et al, J. 

15 Immunol 148:1585, 1992; Rock, K. L., Immunol Today 17:131, 1996), or, naked or particle 
absorbed cDNA (Ulmer, J. B. et al, Sci&tce 259:1745, 1993; Robinson, H. L., Hunt, L. A., 
and Webster, R. G., Vaccine 1 1:957, 1993; Shiver, J. W. et al. In: Concepts in vaccine 
development, Kaufinann, S. H. E., ed., p. 423, 1996; Cease, K. B., and Berzofeky, J. A., 
Annu. Rev. Immunol 12:923, 1994 and Eldridge, J. H. et al, Sem. Hematol 30:16, 1993). 

20 Toxin-targeted delivery technologies, also known as receptor mediated targeting, such as 
those of Avant Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 
or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 

25 Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, for example, Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, 
Detroit, MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 
(SmithKline Beecham, Philadelphia, PA); alumimrai salts such as aluminum hydroxide gel 
(alimi) or aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of 

30 acylated tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 
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Vaccines can be administered as nucleic acid compositions wherein DNA or 
RNA encoding one or more of the polypeptides, or a fragment thereof, is administered to a 
patient. This approach is described, for instance, in Wolff et al.y Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
5 WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 
include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 
cationic lipid complexes, and particle-mediated ("gene gmi") or pressure-mediated delivery 
(see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the 

10 irivention can be expressed by viral or bacterial vectors, Examples of expression vectors 

include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
vaccinia virus, for example, as a vector to express nucleotide sequences that encode 
angiogenic polypeptides or polypeptide fragments. Upon introduction into a host, the 
recombinant vaccinia vims expresses the immunogenic peptide, and thereby elicits an 

1 5 immune response. Vaccinia vectors and methods useful in immunization protocols are 
described in, e.g., U.S. Patent No. 4,722,848. Another vector is BCG (Bacille Cahnette 
Guerin). BCG vectors are described in Stover a/., iVatore 351:456-460 (1991). A wide 
variety of other vectors usefril for therapeutic administration or inmiunization e.g. adeno and 
adeno-associated virus vectors, retroviral vectors. Salmonella typhi vectors, detoxified 

20 anthrax toxin vectors, and the like, will be apparent to those skilled in the art from the 

description herein (see, e.g., Shata et al (2000) Mol Med Today, 6: 66-71; Shedlock et aL, J 
Leukoc Biol 68,:793-806, 2000; Hipp et al. In Vivo 14:571-85, 2000). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing an angiogenesis gene or portion of an angiogenesis gene under the control of a 

25 regulatable promoter or a tissue-specific promoter for expression in an angiogenesis patient. 
The angiogenesis gene used for DNA vaccines can encode full-length angiogenesis proteins, 
but more preferably encodes portions of the angiogenesis proteins including peptides derived 
from the angiogenesis protein. In one embodiment, a patient is immunized with a DNA 
vaccine comprising a plurality of nucleotide sequences derived from an angiogenesis gene. 

30 For example, angiogenesis-associated genes or sequence encoding subfiiugments of an 
angiogenesis protein are introduced into e3q>ression vectors and tested for their 
inununogenicity in the context of Class I MHC and an ability to generate cytotoxic T cell 
responses. This procedm^e provides for production of cytotoxic T cell responses against cells 
which present antigen, including intracellular epitopes. 
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In a preferred embodiment, the DNA vaccines include a gene encoding an 
adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the angiogenesis polypeptide encoded by the DNA 
vaccine. Additional or alternative adjuvants are available. 
5 In another preferred embodiment angiogenesis genes find use in generating 

animal models of angiogenesis. When the angiogenesis gene id^tified is repressed or 
diminished in angiogenesic tissue, gene therapy technology, e.g., wherein antisense RNA 
directed to the angiogenesis gene will also diminish or repress expression of the gene. 
Animal models of angiogenesis find use in screening for modulators of an angiogenesis- 

10 associated sequence or modulators of angiogenesis. Similarly, transgenic animal technology 
including gene knockout technology, for example as a result of homologous recombination 
with an appropriate gene targeting vector, will result in the absence or increased expression 
of the angiogenesis protein. When desired, tissue-specific expression or knockout of the 
angiogenesis protein may be necessary. 

15 It is also possible that the angiogenesis protein is overexpressed in 

angiogenesis. As such, transgenic animals can be generated that overexpress the 
angiogenesis protein. Depending on the desired expression level, promoters of various 
strengths can be employed to express the transgene. Also, the number of copies of the 
integrated transgene can be determined and compared for a determination of the expression 

20 level of the transgene. Animals generated by such methods find use as animal models of 

angiogenesis and are additionally usefiil in screening for modulators to treat angiogenesis or 
to evaluate a therapeutic entity. 

Kits for Use in Diagnostic and/or Prognostic Applications 

25 For use in diagnostic, research, and therapeutic applications suggested above, 

kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, angiogenesis-specific nucleic 
acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 
ribozymes, dominant negative angiogenesis polypeptides or polynucleotides, small molecules 

30 inhibitors of angiogenesis-associated sequences etc. A therapeutic product may include 
sterile saline or another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 
(i.e., protocols) for the practice of the methods of this invention. While the instmctional 
materials typically comprise written or printed materials they are not limited to such. Any 
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medium capable of storing such instructions and conununicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g^., CD ROM), and the 
like. Such media may include addresses to internet sites that provide such instructional 
S materials. 

The present invention also provides for kits for screening for modulators of 
angiogenesis-associated sequences. Such kits can be prepared from readily available 
materials and reagents. For example, such kits can comprise one or more of the following 
materials: an angiogenesis-associated polypeptide or polynucleotide, reaction tubes, and 

10 instructions for testing angiogenic-associated activity. Optionally, the kit contains 

biologically active angiogenesis protein. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit and 
the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 

1 5 parameters in disease which may be identified in historical or outcome data. 

It is understood that the examples described above in no way serve to limit 
the true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
20 specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 

EXAMPLES 

25 Example 1: Tissue Preparation, Labeling Chips, and Fingerprints 
Purify total RNA from tissue using TRIzol Reagent 

Homogenize tissue samples in 1ml of TRIzol per 50mg of tissue using a 
PolytronSlOOhomogenizer. The generator/probe used depends upon the tissue size, A 
generator that is too large for the amount of tissue to be homogenized will cause a loss of 

30 sample and lower KNA yield. TRIzol is added directly to frozen tissue, which is then 

homogenize. Following homogenization, insoluble material is removed by centrifiigation at 
7500 x g for 15 min in a Sorvall superspeed or 12,000 x g for 10 nnin. in an Eppendorf 
centrifiige at A^C. The clear homogenate is transferred to a new tube for use. The samples 
may be frozen now at -60° to -TO^'C (and kept for at least one month). The homogenate is 
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mixed with 0.2ml of chlorofomi per 1ml of TRIzol reagent used in the original 
homogenization and incubated at room temp, for 2-3 minutes. The aqueous phase is then 
separated by centrifugation and transferred to a firesh tube and the RNA precipitated using 
isopropyl alcohol. The pellet is isolated by centrifugation, washed, air-dried, resuspended in 
5 an appropriate volume of DEPC H2O, and the absorbance measured. 

Purification of poly A+ mRNA firom total RNA is performed as follows. Heat 
an oligotex suspension to 37°C and mixing immediately before adding to RNA. The 
Elution Buffer is heated at 70°C. Warm up 2 x Binding Buffer at 65°C if there is precipitate 
in the buffer. Mix total RNA with DEPC-treated water, 2 x Binding Buffer, and Oligotex 

10 according to Table 2 on page 16 of the Oligotex Handbook. Incubate for 3 minutes at 65°C. 
Incubate for 10 minutes at room temperature. Centrifuge for 2 minutes at 14,000 to 18,000 g. 
Remove supernatant without disturbing Oligotex pellet. A little bit of solution can be left 
behind to reduce the loss of Oligotex. Gently resuspend in Wash Buffer OW2 and pipet onto 
spin column. Centrifuge the spin column at full speed for 1 minute. Transfer spm column to 

15 a new collection tube and gently resuspend in Wash Buffer OW2 and centrifuge as describe 
herein. Transfer spin column to a new tube and elute with 20 to 100 ul of preheated (70oC) 
Elution Buflfer. Gently resuspend Oligotex resin by pipetting up and down. Centrifuge as 
above. Repeat elution with fresh elution buffer or use first eluate to keep the elution volume 
low. Read absorbance, using diluted Elution Buffer as the blank. Before proceeding with 

20 cDNA synthesis, precipitate the mRNA as follows: add 0.4 vol. of 7.5 M NH40Ac -f 2.5 vol. 
of cold 100% ethanol. Precipitate at -20oC 1 hour to overnight (or 20-30 min. at -TOoC). 
Centrifuge at 14,000-16,000 x g for 30 minutes at 4oC. Wash pellet with 0.5ml of 
80%ethanol (-20oC) then centrifuge at 14,000-16,000 x g for 5 minutes at room temperature- 
Repeat 80% ethanol wash. Air dry the ethanol from the pellet in the hood.. Suspend pellet m 

25 DEPC H2O at lug/ul concentration. 

To further Clean up total RNA using Qiagen*s RNeasy kit, add no more than 
lOOug to an RNeasy column. Adjust sample to a volume of lOOul with RNase-free water. 
Add 350ul Buffer RLT then 250ul ethanol (100%) to the sample. Mix by pipetting (do not 
centrifuge) then apply sample to an RNeasy mini spin column. Centrifuge for 15 sec at 

30 >10,000rpm. Transfer column to a new 2-ml collection tube. Add 500ul Buffer RPE and 
centrifuge for 15 sec at >10,000rpm. Discard flowthrough. Add 500ul Buffer RPE and 
centrifuge for 15 sec at >10,000rpm. Discard flowthrough then centrifuge for 2 min at 
maximum speed to dry column membrane. Transfer column to a new 1.5-ml collection tube 
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and apply 30-5 Oul of RNase-free water directly onto column membrane. Centrifuge 1 min at 
>10,000rpm. Repeat elution. and read absorbance. 

cDNA synthesis using Gibco's "Superscript Choice System for cDNA Synthesis" kit 
5 First Strand cDNA synthesis is performed as follows. Use 5ug of total KNA 

or lug of poly A+ mRNA as starting material. FortotalRNA, use 2ul of Superscript RT. For 
polyA+ mRNA, use lul of Superscript RT. Final volume of first strand synthesis mix is 
20uL RNA must be in a volume no greater than lOul. Incubate RNA with lul of lOOpmol 
T7-T24 oligo for 10 min at 70C. On ice, add 7 ul of: 4ul 5X 1st Strand Buffer, 2ul of O.IM 

1 0 DTT, and 1 ul of 1 OmM dNTP mix. Incubate at 37C for 2 min then add Superscript RT. 
Incubate at 37C for 1 hour. 

For the second strand synthesis, place 1st strand reactions on ice and add: 91ul 
DEPC H2O; 30ul 5X 2nd Strand Buffer; 3ul lOmM dNTP mix; lul lOU/ul E.coli DNA 
Ligase; 4ul lOU/ul E.coli DNA Polymerase; and lul 2U/ul RNase H, Mix and incubate 2 

15 hours at 16C. Add 2ul T4 DNA Polymerase. Incubate 5 min at 16C, Add lOul of 0.5M 
EDTA. A further clean-up of DNA is performed using phenol:chloroform:isoamyl Alcohol 
(25:24:1) purification. 

In vitro Transcription (IVT) and labeling with biotin is performed as follows: 
Pipet 1 .5ul of cDNA into a thin-wall PCR tube. Make NTP labeling mix by combining 2ul T7 

20 lOxATP (75mM) (Ambion); 2ul T7 lOxGTP (75mM) (Ambion); 1.5ul T7 lOxCTP (75mM) 
(Ambion); 1.5ul T7 lOxUTP (75mM) (Ambion); 3.75ul lOmM Bio-ll-UTP (Boehringer- 
Mannheim/Roche or Enzo); 3.75ul lOmM Bio-16-CTP (Enzo); 2ul lOx T7 transcription 
buffer (Ambion); and 2ul lOx T7 enzyme mix (Ambion). The final volume is 20ul. Incubate 
6 hours at 37°C in a PCR machine. The RNA can be furthered cleaned. 

25 Fragmentation is performed as follows. 1 5 ug of labeled RNA is usually 

fragmented. Try to ndnimize the fragmentation reaction volume; a 10 ul volume is 
recommended but 20 ul is all right. Do not go higher than 20 ul because the magnesium in 
the fragmratation buffer contributes to precipitation in the hybridization buffer. Fragment 
RNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer (5 x Fragmentation 

30 buffer is 200 mM Tris-acetate, pH 8. 1 ; 500 mM KOAc; 1 50 mM MgOAc), The labeled 
RNA transcript can be analyzed before and after fragmentation. Samples can be heated to 
65°C for 15 minutes and electrophoresed on 1% agarose/TBE gels to get an approximate idea 
of the transcript size range 
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For hybridization, 200 ul (lOug cRNA) of a hybridization mix is put on the 
chip. If multiple hybridizations are to be done (such as cycling through a 5 chip set), then it 
is recommended that an initial hybridization mix of 300 ul or more be made. The 
hybridization mix is: fragment labeled RNA (SOng/ul final cone); 50 pM 948-b control 
5 oligo; 1 .5 pM BioB; 5 pM BioC; 25 pM BioD; 100 pM CRE; O.lmg/ml herring sperm DNA; 
0,5mg/ml acetylated BSA; and 300 ul with IxMES hyb buffer. 

Labeling is performed as follows: The hybridization reaction includes non- 
biotinylated IVT (purified by RNeasy columns); IVT antisense RNA 4 |ag:jLil; random 
Hexamers (1 pg/|xl) 4 jiil and water to 14 ul. The reaciton is incubated at TO^'C, 10 min. 

10 Reverse transcriptionis performed in the following reaction: 5X First Strand (BRL) buffer, 6 
^il; 0.1 M DTT, 3 jxl; 50X dNTP mix, 0.6 ^il; H2O, 2.4 jil; Cy3 or Cy5 dUTP (ImM), 3 ^il; SS 
RT n (BRL), 1 |xl in a final volume of 16 |li1. Add to hybridization reaction. Licubate 30 
min., 42^C. Add I ul SSII and incubate another hour. Put on ice, 50X dNTP mix (25mM of 
cold dATP, dCTP, and dGTP, lOmM of dTTP: 25 jal each of lOOmM dATP, dCTP, and 

15 dGTP; 10 1^1 of lOOmM dTTP to 15 ^il H20. dNTPs from Pharmacia) 

RNA degradation is performed as follows. Add 86 |il H20, 1.5 \xl IM NaOH/ 
2mM EDTA and incubate at 65°C, 10 min.. For U-Con 30, 500 jil TE/sample spin at 7000g 
for 1 0 min, save flow through for purification. For Qiagra purification, suspend u-con 
recovered material in 500^1 buffer PB and proceed using Qiagen protocol. For DNAse 

20 digestion, add 1 ul of 1/100 dil of DNAse/30ul Rx and incubate at 37°C for 1 5 min. Incubate 
at 5 min 95*^0 to denature the DNAse/ 

For sample preparation, add Cot-1 DNA, 10 |j.1; 50X dNTPs, 1 20X SSC, 
2.3 IJ.I; Na pyro phosphate, 7.5 jal; lOmg/ml Heiring sperm DNA; lul of 1/10 dilution to 21.8 
final vol. Dry in speed vac. Resuspend in 15 fil H20. Add 0.38 |li1 10% SDS. Heat 95°C, 2 

25 min and slow cool at room temp, for 20 min. Put on sUde and hybridize overnight at 64°C. 
Washing after the hybridization: 3X SSC/0.03% SDS: 2 min., 37.5 mis 20X SSC+0.75mls 
10% SDS in 250mls H20; IX SSC: 5 min., 12.5 mis 20X SSC in 250mls H20; 0.2X SSC: 5 
min., 2.5 mis 20X SSC in 250mls H20. Dry slides and scan at ^propiate PMT's and 
channels. 

30 

Example 2. A model of angiogenesis is used to determine expression in angiogenesis 

In the model of angiogenesis used to determine expression of angiogenesis- 
associated sequences, human umbiUcal vein endotheUal cells (HUVEC) were obtained, e.g., 
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as passage 1 (pi) frozen cells from Cascade Biologies (Oregon) and grown in maintenance 
medium: Medium 199 (Life Technologies) supplemented with 20% pooled human serum, 
100 mg/ml heparin and 75 mg/ml endothelial cell growth supplements (Sigma) and 
gentamicin (Life Technologies). An in vitro cell system model was used in which 2x10^ 

5 HUVECs were cultured in 0.5 ml 3 mgs/ml plasminogen-depleted fibrinogen (Calbiochem, 
San Diego, CA) that was polymerized by the addition of 1 unit of maintenance medium 
supplemented with 100 ng/ml VEGF and HGF and 10 ng/ml TGF-a (R&D Systems, 
Minneapolis,MN) added (growth medixmi). The growth medium was replaced every 2 days. 
Samples for RNA were collected, e.g., at 0, 2, 6, 15, 24, 48, and 96 hours of culture. The 

10 fibrin clots were placed in Trizol (Life Technologies) and dismpted using a Tissuemizer, 
Thereafter standard procedures were used for extracting the RNA {e.g.. Example 1). 

Angiogenesis associated sequences thus identified are shown in Tables 1-8 . 
As indicated, some of the Accession numbers include expression sequence tags (ESTs). 
Thus, in one embodiment herein, genes within an expression profile, also termed expression 

1 5 profile genes, include ESTs and are not necessarily fiiU length. 
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TABLE 1: 



Pkey: Unique Eos probeset identifier number 

Accession: Accession number used for previous patent filings 

ExAccn: Exemplar Accession number, Genbanit accession number 

UnigenelD: Unlgene number 

UnigeneTitie: Unlgene gene title 
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Pkey 


Accession 


ExAccn 


UnigenelD 


134404 


AB0004SO 


AB000450 


Hs.82771 


121443 AB002380 


AF180681 


Hs.6582 


100082 AB003103 


AA1 30080 


Hs.4295 


132817 AB004884 


N27852 


Hs.57553 


130150 AF000573_ma1 


BE094848 


Hs.15113 


100104 AF008937 


AF008937 


Hs.102178 


130839 AF009301 


ABOillOg 


Hs.20141 


427064 AF009368 


AF029674 


Hs.173422 


100113 


D00591 


N1VL001269 


Hs.84746 


133980 


D00760 


AA294921 


Hs.250811 


100129 


D11139 


AA469369 


Hs.5831 


100154 


D14657 


H60720 


Hs.81892 


100169 


D14878 


AL037228 


Hs.82043 


101956 


D17716 


NM_002410 


Hs.121502 


100190 


D21090 


M91401 


Hs.178658 


134742 


D26135 


NM^001346 


Hs.89462 




D26528 


D26528 


Hs.123058 


100238 


D30742 


L24959 


Hs.348 


130283 


D31762 


NM 012288 


Hs.153954 


134237 


D31765 


D31765 


Hs.170114 


100248 


D31888 


NWL015156 


Hs.7e398 


100255 


D38128 


D25418 


Hs.393 


100262 


D38500 


D38500 


Hs.278468 


134329 


D38551 


N92036 


Hs.81848 


100281 


D42087 


AF091035 


Hs.184627 


100294 


D49396 


AA331881 




100327 


D55640 


D55640 




100335 


D63391 


AW247529 


Hs.6793 


134495 


D63477 


D63477 


Hs.84087 


100338 


D63483 


D86864 


Hs.57735 


135152 


D64015 


M96954 


Hs.182741 


134269 


D79990 


NM_014737 


Hs.80905 


100372 


D79997 


NWL014791 


Hs.184339 


134304 


D80010 


BE613486 


Hs.81412 


100394 


D84276 


D84284 


Hs.66052 


100405 


D86425 


AW291587 


Hs.82733 


100418 


D86978 


D86978 


Hs.84790 


133154 


D87012 


D87012 


Hs.194685 


134347 


D87075 


AF164142 


Hs.82042 


128653 


D87432 


D87432 


Hs.10315 


100438 


D87448 


AA013051 


Hs.91417 


134593 


D87845 


NM.000437 


Hs.234392 


100481 


HG1098-HT1098 X70377 


Hs. 121489 


100552 


H62167.HT2237 AA019521 


Hs.301946 


100591 


HG24154rr2511 NM_004091 




cds 

100652 


HG2825-HT2949 BE613608 


Hs.142653 


100662 


HG2887-HT3031 




A}368680 


100899 HG4660+rr5073 AL039123 


Hs.103042 


100905 


HG4704-HT5146 L12260 


Hs.172816 


100945 


HG884-HT884 


AF002225 


Hs.180686 


100950 


HG919-HT919 


AF128542 


Hs.166846 


100964 


J00212Lf 


J00212 




135407 


J04029 


J04029 


Hs.99936 


130149 J04031 


AW067805 


Hs.172665 


131877 


J04088 


J04088 


Hs.156346 


101016 J04543 


J04543 


Hs.78637 


134786 


L06139 


T29618 


Hs.89640 


134100 


L07540 


AA460085 


Hs.171075 


134078 L08895 


108895 


Hs.78995 


101132 


L11239 


L11239 


Hs.36993 


134849 


L11353 


BE409525 


Hs.902 


106432 


L13773 


AK000310 


Hs.17138 



UnigeneTitie 

vacdnia related kinase 2 

Rho guanine exchange factor (GEF) 12 

proteasome (piosome, macropain) 26S subunii. non-ATPase, 12 

tousled-like kinase 2 

homogentisate 1,2-dioxygenase (homogentisate oxkJase) 
syntaxin 16 

similar to S. cerevislae SSM4 
KIAA1605 protehi 
chromosome condensation 1 

v-ral simian leukemia viral oncogene homolog B (ras related; GTP binding protein) ' 
tissue inhibitor of metalloproteinase 1 (erythroid potentiating activity, collagenase inhibitor) 
K1AA0101 gene product 
D123 gene product 

mannosyl (a!pha-1.6-)^tycopnotein beta-1,6-N-acetyl^lucosaminyltransfeiase 
RAD23 (S. cerevislae) homolog B 
diacylglycerol kinase, gamma (90kD) 

DEAD/H (Asp-GIu-Ala-Asp/His) box polypeptide 7 (RNA helicase, 52kD) 
calcium/calmoduiin-dependent protein kinase IV 
TRAM-like protein 
KIAA0061 protein 
KIAA0071 protein 

prostaglandin 12 (prostacyclin) receptor (IP) 
postmeiotic segregation increased 2-Iike 4 
RAD21 (S. pombe) homolog 
K1AA01 18 protein 
peroxfredoxin 3 

gb:Human monocyte PABL (pseudoautosomal boundary-like sequence) mRNA. done Mo2. 
platelet-activating factor ace^hydrolase, isofomn lb. gamma subunit (29kD) 
KIAA0143 protein 
acetyl LDL receptor; SREC 

TIA1 cytotoxic granule^assodated RNA-binding protein-like 1 
Ras assodation (RalGDS/AF-6) domain family 2 
iaAA0175 gene produd 
lipin 1 

CD38 antigen (p45) 
nidogen 2 
KIAA0225 protein 
topoisomerase (DMA) III beta 

solute carrier family 23 (nucleobase transporters), member 1 

solute cam'er family 7 (cationic amino acid transporter, y-*- system), member 6 

topoisomerase (DMA) 11 binding protein 

ptatelet-activating fador acetylhydrolase 2 (40kD) 

cystatinD 

lysosomal 

Homo sapiens. Similar to hypothetical protein PR01722, done MGC:15692, mRNA, complete 
ret finger protein 

Hs.816 SRY (sex detemnining region Y)-box 2 
microtubule^ssodated protein IB 
neuregulin 1 

ubk|uitin protein ligase E3A (human pafulioma virus E6-assodated protein, Angelman syndrome) 

polymerase (DNA direded), epsiton 

Empirically selected from AFFX single probeset 

keratin 10 (epldermolytic hyperkeratosis; keratosis patmaris et plantaris) 

methylenetetrahydrofolate dehydrogenase (NADP-i- dependent], methenyltetrahydrofolate 

topoisomerase (DNA) 11 alpha (170kD) 

annexinA7 

TEK tyrosine kinase, endotheCal (venous malformations, multiple cutaneous and mucosal) 
repUcation fodor C (activator 1) 5 (36^kD) 

MADS box transcription enhancer factor 2, polypeptide C (myocyte enhancer fador 2C) 
gastrulation brain homeo box 1 
neurortbromin 2 (bilateral acoustic neuroma) 
hypottietical protein FU20303 
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101152 


L13800 


AI984625 


Hs.9884 


135397 


L14922 


L14922 


Hs.166563 


1316B7 


LI 5189 


BE297635 


Hs.3069 


101168 


L15388 


NM-005308 


Hs.211569 


421155 


L16895 


H87879 


Hs.102267 


101226 


L27476 


AF083892 


Hs.75608 


133975 


L27624 


C18356 


Hs.295944 


134739 


L32976 


NM.002419 


Hs.89449 


130155 


L33401 


AA101043 


Hs.151254 


440538 


L35263 


W76332 


Hs.79107 


132813 


L37347 


BE313625 


Hs.57435 


101294 


L40371 


AF168418 


Hs.116784 


101300 


L40391 


BE535511 


Hs.74137 


101310 


L41607 


L41607 


Hs,934 


130344 


L77566 


AW250122 


Hs.154879 


embryonic lethal 






101381 


M13928 


AW676039 


Hs.1227 


101668 


M14016 


AW005903 


Hs78601 


133780 


M14219 


AA557660 


Hs.76152 


101396 


M15796 


BE267931 


Hs.78996 


101447 


M21305 


M21305 




101458 


M22092 


M22092 




101470 


M22898 


NAL000546 


Hs.1846 


134604 


M22995 


NM.002684 


Hs.865 


101478 M23379 


NI\iL002890 


Hs.758 


406698 


M24364 


X03066 


H8.73931 


133519 


M24400 


AW583062 


Hs74502 


131185 


M25753 


BE280074 


Hs.23960 


134116 M27691 


R84694 


HS79194 


133999 


M28213 


AA535244 


Hs.78305 


130174 


M29550 


M29551 


Hs.151531 


129963 


M29971 


M29971 


Hs.1384 


132983 


M30269 


M30269 


Hs.62041 


133900 


M31158 


M31158 


Hs.77439 


101543 


M31166 


^431166 


Hs^OSO 


101545 


M31210 


BE246154 


. Hs.154210 


101620 


M55420 


S55271 


Hs.247930 


134691 


M59979 


AW382987 


Hs.88474 


133595 


M62810 


AA393273 


Hs.75133 


130425 


M63838 


AA243383 


Hs.155530 


101700 


M64710 


D90337 


Hs.247916 


101714 


M68874 


M68874 


Hs.211587 


134246 


M74524 


D28459 


Hs.80512 


101760 


M80254 


M80254 


Hs.173125 


133948 


M81780„cds3 


X59960 


Hs.77813 


101791 


M83822 


M83822 


Hs.62354 


101812 


M86934 


BE439894 


Hs.78991 


101813 


M87338 


NM.002914 


Hs.139226 


133396 


M96326.ma1 


M96326 


Hs.72885 


135152 


M96954 


M96954 


Hs.182741 


129026 M98833 


AL120297 * 


Hs.108043 


101901 


S66793 


H38026 


Hs.308 


134831 


S72370 


AA853479 


Hs.89890 


134039 


S78569 


NIVL002290 


Hs.78672 


134395 


S79873 


AA456539 


Hs.8262 


101975 


S83325 


AA079717 


Hs.283664 


101977 


S83364 


AF112213 


Hs.184062 


101978 


S83365 


BE561610 


H5.5809 


Interacli 
101998 


ng factor) 
U01212 


U01212 


Hs.248153 


102003 


U01922 


U01922 


Hs. 125565 


102007 


U02556 


U02556 


Hs.75307 


102009 


U02680 


BE245149 


Hs.82643 


416658 


U03272 


U03272 


Hs.79432 


132951 


UO4209 


AW821182 


Hs.61418 


135389 


U05237 


U05237 


Hs.99872 


102048 


U07225 


U07225 


Hs.339 


130145 


U07620 


U34820 


Hs.151051 


303153 


U09759 


U09759 




420269 


U09820 


U72937 


Hs.96264 


102095 


U11313 


U11313 


Hs.75760 


102123 


U14518 


NM_001809 


Hs.1594 


102126 


U14575 


AW950870 


Hs.78961 


102133 


U15173 


AU076845 


Hs.155596 


102139 


U15932 


NM_004419 


Hs^128 


102162 


U18291 


AA450274 


Hs.1592 



spindle pole body protein 
replication factor C (activator 1) 1 (145kD) 
heat shock 70kD protein 98 (mortalin-2) 
G protein-coupled receptor kinase 5 
lysyl oxidase 

tight junction protein 2 (zona ocdudens 2) 
tissue factor pathway inhibitor 2 
mitogen-activated protein kinase kinase kinase 11 
kallikrein 7 (chymotryptic. stratum comeum) 
mitogen-activated protein kinase 14 

solute carrier family 1 1 (proton-coupled divalent metal ion transporters), member 2 

thyroid homione receptor interactor 4 

transmembrane trafficking protein 

glucosaminyl (N-acetyl) transferase 2, l-branching enzyme 

DiGeoige syndrome critical region gene OGSI; likely ortholog of mouse expressed sequence 2 

amindevulinate, delta-, dehydratase 
uroporphyrinogen decartioxylase 
decorin 

proliferating cell nuclear antigen 

gbiHuman alpha satelRte and satellHe 3 Junction DMA sequence. 

gb:Human neural cell adhesion molecule (N-CAM) gene« axon SEC and partial cds. 

tumor protein p53 (U-Fraumeni syndrome) 

RAP1 A» member of RAS oncogene family 

RAS p21 protein activator (GTPase activating protein) 1 

fna]or histocompataiility complex, class II, DQ beta 1 

chymotrypsinogenBI 

cydin 81 

cAMP responsive element binding protein 1 
RAB2. member RAS oncogene family 

protein phosphatase 3 (formerly 28), catalytic subun'rt, beta isofbnm (calcineurin A beta) 
0-6-methylguanlne-DNA methyllransferase 
nidogen (enactin) 

protein kinase, cAMP-dependent. regulatory, type II. beta 
pentaxin-related gene, rapidly induced by IL-1 beta 
endothelial differentiation, sphingolipid G-protein-coupled receptor, 1 
Epsilon , (gE 

prostaglandin-endoperoxtde synthase 1 (prostaglandin G/H synthase and cydooxygenase) 
transcription factor 6-like 1 (mitochondrial transcnptlon foctor Mike) 
interferon, gamma-jnducible protein 16 
natriuretic peptide precursor C 

phospholipase A2, group IVA (cytosolic, caldunniependent) 
ubiquitin-conjugating enzyme E2A (RAD6 homolog) 
peptidylprolyl isomerase F (cyclophilin F) 

sphingomyelin phosphodiesterase 1, add lysosomal (acid sphingomyelinase) 
oeD diviston cyde 44lke 

DNA segment, numerous copies, expressed probes (GS1 gene) 

replteation factor C (activator 1) 2 (40kD) 

azuroddin 1 (cationic antimicrobial protein 37) 

TIA1 cytotoxic granule-associated RNA-Unding protein-like 1 

Friend leukemia virus integratkm 1 

anestln 3, retinal (X-arrestIn) 

pyruvate carboxylase 

lamfnin, alpha 4 

lysosomal 

aspartate beta-hydroxy!ase 
putative Rab5-interacting protein 

putative transmembrane protein; homolog of yeast Golgl membrane protein Vrflp (Yiplp- 
olfactory marker protein 

translocase of inner mitochondrial membrane 8 (yeast) homolog A 
t-complex-associated-testis-expressed 1-llke 
protein tyrosine kinase 9 

fibrillin 2 (congenital contracturat arachnodactyly) 

mk:rofibrillar>assodated protein 1 

fetal Alzheimer antigen 

purinergic receptor P2Y, G-protein coupled, 2 

mitogen-activated protein kinase 10 

mitogen-activated protein kinase 9 

alpha thalassemia/mental retaidatton syndrome X-linked (RADS4 (S. cerevisiae) homotog)* 

sterol carrier protein 2 

centromere protein A (17kD) 

protein phosphatase 1, regulatory (inhibitor) subunlt 8 

BCL2/adenovlrus E1B 19kD-lnteracting protein 2 

dual spedftcity phosphatase 5 

CDC16 (celt division <^e 16, S. cerevisiae, homolog) 
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102164 


1118300 


NM_000107 


Hs.77602 


damage-specific DNA binding protein 2 (48kD) 


427653 


U 18383 


AA159001 


Hs.180069 


nudear respiratory factor 1 


131817 


U20536 


U20536 


Hs.3280 


caspase 6, apoptosis-related cysteine protease 


102200 


U21551 


AA232362 


Hs.1 57205 


branched chain aminotransferase 1, cytosolic 


102210 


U23028 


BE619413 


Hs.2437 


eukaryotic translation initiation factor 2B, subunit 5 (epsllon, 82kD) 


102214 


U23752 


U23752 


Hs.32964 


SRY (sex determining region Y)-box 1 1 


132811 


U25435 


U25435 


Hs.57419 


CCCTC-binding fador (zinc finger protein) 


131319 


U25997 


NIVL003155 


Hs.25590 


stanniocaldn 1 


102256 


U28251_cds2 


U28251 


Hs.53237 


ESTs, Highly similar to Z169„HUMAN ZINC FINGER PROTEIN 169 [H,sapiens] 


132316 


U28831 


U28831 


Hs.44566 


KIAA1641 protein 


102269 
exon 1. 


U30245 


U30245 




gbiHuman myetomonocytic specific protein (MNDA) gene. 5* flanking sequence and complete 


134365 


U32315 


AA568906 


Hs.82240 


syntaxin 3A 


102293 


U32439 


AF0901 16 


Hs.79348 


regulator of G-protein signalling 7 


102298 


U32d49 


AA382169 


Hs.54483 


N-myc (and STAT) interador 


102325 


U35139 


Af815867 


Hs.50130 


necdin (mouse) homolog 


302344 


U36764 


BE303044 


Hs.192023 


eukaryotic translation initiation fador 3. subunit 2 (beta. 36kD) 


102361 


U39400 


AA223616 


HsJ5859 


chromosome 1 1 open reading frame 4 


102367 


U39857 


U39656 


Hs.118825 


mifogen-adivated protein kinase kinase 6 


102386 


U41344 


AA362907 


Hs.76494 


proline aiginine-rich end leudne-rfch repeat protein 


102394 


U41766 


NM.003816 


Hs.2442 


a disintegrin and metalloprotelnase domain 9 (meltrin gamma) 


129829 


U41813 


AF010258 


Hs.127428 


homeoboxA9 


102251 


U41815 


N^^004398 


Hs.41706 


DEAD/H (Asp-Gh^A1a<Asp/His) box polypeptide 10 (RNA heiicase) 


102409 


U43286 


BE300330 


Hs.118725 


selenophosphate synthetase 2 


133746 


U44378 


AW410035 


Hs.75862 


MAD (motheis against decapentaplegte, Drosophita) homobg 4 


102423 


U44754 


Z47542 


Hs.179312 


smaN nuclear RNA activating complex, polypeptide 1, 43kD 


132828 


U47011_cds1 


AB014615 


Hs.57710 


fibroblast growth fador 8 (androgen-Induced) 


130441 


U47077 


U63630 


Hs.155637 


protein kinase. DNA-adivated. catalytic polypeptide 


102450 


U48251 


U48251 


Hs.75871 


protein kinase C binding protein 1 


129350 


U50535 


U50535 


Hs. 11 0630 


Human BRCA2 regton. mRNA sequence CG008 


102534 


U56833 


U96759 


Hs,19B307 


von Hippel-Lindau binding protein 1 


130457 


U58091 


AB014595 


Hs.155976 


cuiiin 4B 


135065 


U58837 


AA019401 


Hs.93909 


cyclic nucleotide gated channel beta 1 


102560 


U59289 


R97457 


Hs.63984 


cadherin 1 3, H-cadherin (heart) 


102567 


U59863 


U63830 


Hs.146847 


TRAF family member-associated NFKB activator 


134305 


U67122 


U61397 


Hs.81424 


ubiquitin-like 1 (sentrin) 


102638 


U67319 


U67319 


Hs.9216 


caspase 7, apoptosis^fBlated cysteine protease 


132736 


U68019 


AW081883 


Hs.288261 


Homo sapiens cDNA: FLJ23037 frs, done LNG02036. highly similar to HSU68019 Homo 


sapiens mad protein homolog (hMAD-3) mRNA 


133070 


U69611 


U92649 


Hs.64311 


a disintegrin and metalloprotelnase domain 17 (tumor necrosis factor, alpha, converting enzyme) 


102663 


U70322 


NWL002270 


Hs.168075 


karyopherin (importin) beta 2 


134660 


U73524 


U73524 


Hs.87465 


ATP/GTP-binding protein 


102735 


U79267 


AF111106 


Hs.3382 


protein phosphatase 4, regulatory subunit 1 


102741 


U79291 


AW959829 


Hs,83572 


hypothetical protein MGC14433 


101175 


U82671.cds2 


U82671 


Hs.36980 


melanoma antigen, family A, 2 


132164 


U84573 


AI752235 


Hs.41270 


procollagen-lysine, 2-oxogIutarate &<lioxygenase (lysine hydroxylase) 2 


102823 


U90914 


085390 


Hs.5057 


carboxypeptidase D 


102626 


U91d16 


NIVL007274 


Hs.d679 


cytosolk; acyl coenzyme A tfiioester hydrolase 


102831 


U91932 


AA262170 


Hs.80917 


adaptor-related protein complex 3, sigma 1 subunit 


102846 


U96131 


BE264974 


Hs.6566 


thyroid honnone receptor mteractor 13 


129777 


U97018 


U97018 


Hs.12451 


echlnoderm microtubule-assodated protein-fike 


134161 


U97188 


AA634543 


Hs.79440 


IGF-II mRNA-blnding protein 3 


134854 


V00503 


J03464 


Hs.179573 


collagen, type 1. alpha 2 


302363 


X04327 


AW163799 


Hs.198365 


2,3-bisphosphogtyceFate mutase 


133708 


X06389 


Ai018666 


Hs.75667 


synaptophystn 


125701 


X07496 


T72104 


Hs.93194 


apolipoprotein A-l 


102915 


X07820 


X07820 


Hs.2258 


matrix metalloprotelnase 10 (stromelysin 2) 


134656 


X14787 


AI750878 


Hs.87409 


thromlwspondin 1 


413858 


X15525_nia1 


NIVL001610 


HS75589 


acid phosphatase 2, lysosomal 


102968 


X16396 


AU076611 


Hs.154672 


methylene tetrahydrofolate dehydrogenase (NACH dependent), methenyltetrahydrofolate 


cydohydrolase 






102971 


X16609 


X16609 


Hs.183805 


ankyrini.eiythrocylk: 


134037 


X53586^ma1 


AI808780 


Hs.227730 


integrin. alpha 6 


103023 


X53793 


AW500470 


Hs.1 17950 


multifunctional polypeptide similar to SAtCAR synthetase and AIR carboxylase 


103037 


X54936 


BE018302 


Hs.2894 


placental growth fador, vascular endothelial growth factor-related protein 


130282 


X55740 


BE245380 


Hs.153952 


5' nucleotidase (CD73) 


134542 


X57025 


M14156 


Hs.85112 


insulin-like growth fador 1 (somatomedin C) 


128568 


X60673^ma1 


H12912 


Hs.274691 


adenylate kinase 3 


103093 


X60708 


S79876 


Hs.44926 


dipeptidylpeptidase IV (CD26, adenosine deaminase comptexing protein 2) 


133606 


X62048 


U10664 


Hs.75188 


wee1+(S. pombe) homolog 


129063 


X63097 


X63094 


Hs.283822 


Rhesus blood group, D antigen 


424460 


X63563 


BE275979 


Hs.296014 


polymerase (RNA) 1) (DNA directed) polypeptide B {140kD) 


133227 


X64037 


AW977263 


Hs.68257 


general transcription fador IIF, polypeptide 1 (74kD subunit) 


103181 


X69636 


X69636 


Hs.334731 


Homo sapiens, done tMAGE:344€^06, mRNA, partial ods 


103184 


X6987d 


U43143 


Hs.74049 


fms-related tyrosine kinase 4 


103194 


X70649 


NM_004939 


Hs.78580 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 1 
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retinoblastoma-bfnding protein? 
ATP-blhding cassette. sul>-family E (OABP). member 1 
6MX non-feceptor tyrosine kinase 
acytphosphatase 1» erythrocyte (common) type 
cydin-dependent kinase 6 
hepatocyte nudear factor 4. alpha 
transient receptor potential channel 1 
uradl-DNAglyoosytase 

RAS p21 protein activator (GTPase activating protein) 3 (lns(1»3.4.5)P4^inding protein) 
endotheTial celt-specific molecule 1 
thtoredoxin reductase 1 
purine-nch eiement binding protein A 
coated veside membrane protein 
HCGVIII-1 protein 

RAB28, member RAS oncogene family 
DRI^associated protein 1 (negative cofactor 2 alpha) 
killer cell immunogbbuiin-IIke receptor, three domains, long cytoplasmk; tail, 1 
M-phase phosphoprotein 6 

ubiquitin spedfic protease 9, X chromosome (Drosophlla fat facets related) 
SMT3 (suppressor of mif two 3. yeast) homolog 1 
amyloid beta (A4) precursor protein (protease nexin-ll, Alzheimer disease) 
Ric (Drosophila)-iike. expressed in many tissues 
myosin VA (heavy polypeptide 12, myoxin) 
butyrophilin, subfamily 3, member A1 . 
Pirin 

alkylglycerone phosphate synthase 
spindlMke 

karyopherin alpha 3 (importin alpha 4) 
aconitase Insoluble 
mitogen-adivated protein kinase 1 
centromere protein E (312kD) 
H3 hlstone family, member A 
poly(rC)-binding proteki 2 

Homo sapiens diromosome X map Xpl 1 .23 L-type caldum channel alpha-1 subunit 
(CACNA1F) gene, complete cds; HSP27 pseudogene. complete sequence; and JMl protein, JW, protein, and Hb2E genes, complete cds 



103208 


X72841 


AW411340 


Hs.31314 


129698 


X74987 


BE242144 


Hs. 12013 


131486 


X83107 


F06972 


Hs.27372 


130729 


X84194 


AI963747 


Hs.18573 


103334 


XB5753 


NI\L001260 


Hs.25283 


132645 


X87870 


At654712 


Hs.54424 


135094 


yfi9066 


NM 003304 


Hs 250687 








Hs 78853 




yfiq^QQ 


XB9399 


Hs 119274 






X89426 


Hs 41716 


103371 


yQio47 


X91247 


Hs 13046 


131SfU 


AH lOHO 


AAsgssod 


Hs 29117 


103a7R 

1 D 


A9£USK> 


ALa36166 


Hs 323378 


103378 


A0£ 1 IV 


AL1 19690 


Hs.153618 






X94703 


Hs.296371 






AA158294 


Hs.334879 


133490 


yqyoqn f 

AUf C<J\J^l 


AF022044 


Hs.274601 




yqR?63 


AW175781 


Hs.152720 


103440 


XQ89Q6 


X98296 


Hs.77578 


103452 




NIVL006936 


Hs.85119 


133536 


Y00264 


W25797.comp Hs.177486 


1351fi5 


Y07566 


AW404908 


Hs.96038 


11RS93 


Y07759 


Y07759 


Hs.170157 


134RR? 


Yft7fl97 


NM.007048 


Hs.284283 




Yn7AR7 

1 Uf OOf 


BE386490 


Hs.279663 


1 UOoUU 




AW408009 


Hs.22580 


134389 


Y09858 


Y09858 


Hs.82577 


132084 


Y12394 


NM„002267 


Hs.3886 


103540 


211559 


NM_002197 


Hs.154721 


133152 


Z11695 


Z11695 


Hs.324473 


103548 


Z15005 


Z15005 


Hs.75573 


103612 


Z46261 


BE336654 


Hs.70937 


129092 


AA011243_s 


D56365 


Hs.63525 


103692 


AA018418 


AW137912 


Hs.227583 



103695 AA018758 
129796 AA018804 
132258 AA031993 
132683 AA044217 
131887 AA046548 
member 1 

103723 AA057447.S 
453368 AA058376 
133260 AA083572 

103765 AA085696 

103766 AA088744 

103767 AA089688 
132051 AA091284 
103773 AA092700 
[C.elegans) 
135289 AA092968 
132729 AA094800 
103794 AA100219 
131471 AA114885 
134319 AA129547 
103807 AA133016 
119159 AA149507 
129863 AA151005 
103850 AA187101 
103855 AA195179.S 
322025 AA203138 
135300 AA203645 
103861 AA206236 
130634 AA227621 
[C.elegans] 
447735 AA248283 
103909 AA24g611 
131236 AA282640 
mm AA287199 
129013 AA313990 
129435 AA314256 
103988 AA3143B9 
104000 AA324364 
425284 AA329211.S 
128629 AA399187 
133281 AA421079 



AW207152 H5.186600 ESTs 

BE218319 Hs.5807 GTPase Rab14 

AA306325 Hs.4311 SUMO-1 activating enzyme subunit 2 

BE264633 Hs.143638 WD repeat domain 4 

W17064 Hs.332848 SWI/SNF related, matrix associated, adin dependent regulator of chromatfoi, subfamfly e, 

BE274312 Hs.214783 Homo sapiens cDNA FU14041 fis. clone HEMBA1005780 

W202g6 Hs.288178 Homo sapiens cDNA FU11968 lis, clone HEMBB1001133 

AA403045 Hs.6906 Homo sapiens cDNA: FU23197 fis. ctone REC00917 

AA085696 Hs.169600 KIAA0826 protein 

AI920783 Hs.191435 ESTs 

BE244667 Hs.295155 CGl-100 protein 

AA393968 Hs.180145 HSPC030 protein 

A1219323 Hs.101077 ESTs, Weakly similar to T22363 hypothetical protein F47G9.4 - Caenorhabditis elegans 

AW372569 Hs.9788 hypothetfeal protein MGC10924 similar to Nedd4 WW-binding protein 5 

AW970843 Hs.55682 eukaryotic translation Initiation factor 3, subunit 7 (zela, 66/67kD) 

AF244 1 35 Hs.30670 hepatocellular candnoma-associated antigen 66 

AA164842 Hs.192619 KIAA1600 protein 

BE304999 Hs.75653 fumaiale hydratase 

AW958264 Hs.103832 simDar to yeast Upf3. variant B 

AF142419 Hs.15020 homolog of mouse quaking QKI (KH domain RNA binding protein) 

BE379765 Hs.129872 spenn associated antigen 9 

AA187101 Hs.213194 hypotheUcal protein IVIGC10895 

W02363 Hs.302267 hypothetical protein FU10330 

AW024973 Hs.283675 NPD009 protein 

AA142922 Hs.278626 Arg/AbMnteiacfing protein ArgBP2 

AA206236 Hs.4944 hypothetteal protein FU12783 

Ai769067 Hs.127824 ESTs, Weak^ sMar to T28770 hypothetical protein W03D2.1 - Caenorhabditis elegans 

AA775268 Hs.6127 Homo sapiens cDNA: FLJ23020 fis, done LNG00943 

AA249611 Hs.47438 SH3 domain binding glutamic acki-rich protein 

AF0431 17 Hs.24594 ubiquitinatkm factor E4B (homok)gous to yeast UFD2) 

D42039 Hs.78871 mesoderm development candidate 2 

AA371156 Hs.107942 DKFZP564M1 12 protein 

AF151852 Hs.111449 C6I-94 protein 

' AA314389 Hs.425O0 ADP-ribosylation factor-like 5 

AI146527 Hs.80475 polymerase (RNA) 11 (DNA directed) polypeptide J (13.3kD) 

AF155568 Hs.155489 NS1 -associated protein 1 

AL098748 Hs.102708 DKFZP434A043 protein 

AK001601 Hs.69594 high-mobility group 20A 
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104104 M422029 
[H.sapiens] 
108154 AA425230 
132091 AA447052 
135073 AA452000 
131367 M456687 
129593 AA487015«s 
135266 AB002326 
133505 C01527 
132054 C01714 
134393 C01811J 
131427 C02352^s 
133435 C02375 
104282 C14448 
134827 D16611_s 
130443 D25216 
131742 D31352 
132837 D58024 s 
130377 D80897 
104334 D82614 
134593 087845 
134731 D89377J 
129913 H08583 
131670 H40732 
104394 H46617 
104402 H56731 
129781 H75570 
129077 H78888 
104417 H81241 
134927 136531 
129280 M63154 
134498 M63180 
104460 M91504 
104488 N56191 
131248 N78483 

129214 N79268 
130017 R14652 
104530 R20459 
104534 R22303 
sequence. 
104544 R33779 
133328 R36553 
104567 R64534 
128562 R66475 
129575 R70821 
130776 R79356 
104599 R84933 
104660 RCJ^007160 
104667 RC^007234^s AI239923 
104718 RCJ\A018409 AI143020 
104764 RCJVA025351 AI039243 

104786 RCJ*kA027168 AA027167 

104787 RC_AA027317 AA027317 
similar to contains Alu repetitive element;, 
134079 RC_AA029423 AK001751 
104804 RCJ\A031357 
104865 RC_AA045136 
130828 RC^AA053400 
104907 RC__AA055829 
WARNING ENTRY [H.sapiens] 
104943 RC^AA065217 AF072873 
105013 RCJVA1 16054 
105024 RCJ^A126311 
132592 RCJ\A129390 
105038 RC_AA130273 
105077 RC.AA142919 
105096 RC.AA150205 

129215 RC.AA176867 
105169 RC„AA180321 
132796 RC^AA180487 
130401 RCJ\A187634 
105200 RC AA195399 
130114 RC_AA234717 
105330 RCJ\A234743 
105337 RCJ\A234957 
129385 RC_AA235604 



AA422029 Hs.143640 ESTs, Weakly similar to hyperpolarization-^ctivated cyclic nucleotide^ated channel hKCN2 



NM_005754 

AW954243 

W55956 

A1750575 

AI338247 

R41179 

AI630124 

AA121098 

W52642 

AF151879 

AI929357 

C14448 

BE314037 

D25216 

AA961420 

AA370362 

NM.014909 

D82614 

NM^000437 

D89377 

NML001310 

H03514 

AA129551 

H56731 

AA306090 

N74724 

AI819448 

L36531 

M63154 

AW246273 

AW955705 

N56191 

AI03898g 

AL044335 

AK000096 

AK001676 

R22303 

AI091173 

AW452738 

AA040620 

AA923382 

F08282 

AF167706 

AW815036 

BE298665 



AI858702 
T79340 
AW631469 
AA055829 



H63789 

AA126311 

AW803564 

AW503733 

W55946 

AL04250S 

AB040930 

BE245294 

NM_006283 

BE396283 

AA328102 

AA233393 

AW338625 

Af468789 

AA172106 



Hs.220689 Ras-GTPase-activating protein SH3-domain-binding protein 

Hs.170218 KIAA0251 protein 

Hs.94030 Homo sapiens mRNA; cDNA DKFZp586E1624 (from done DKFZp5B6E1624) 

Hs. 1 73933 nuclear factor I/A 

Hs.98314 Homo sapiens mRNA; cDNA DKFZp566L0120 (from done DKFZp586L0120) 

Hs.97393 KIAA0328 protein 

Hs.324504 Homo sapiens mRNA: cDNA DKFZp586J0720 (from done DKF2p585a0720) 

Hs.3838 seaim<lndudble kinase 

Hs.8261 hypottietical protein FU22393 

Hs.26706 CGM21 protein 

Hs.323966 Homo sapiens clone H63 unknown mRNA 

HS.33233B EST 

Hs.89866 coproporptiyrinogen oxidase (coproporphyria, harderoporphyria) 

Hs.155650 K1AA0014 gene product 

Hs.31433 ESTs 

Hs.57958 EGF-TM7-latrophi]in-fBlatBd protein 

Hs.155182 K1AA1036 protein 

HS.7B771 phosphoglycerale kinase 1 

Hs.234392 platelet-activating ^tor acetylhydrolase 2 (40kD) 

Hs.89404 msh (Drosophila) homeo Ixix homotog 2 

Hs.13313 cAMP responsive element binding protein-like 2 

Hs.10130 ESTs 

Hs.172129 Homo sapiens cDNA: FU21409 fis, clone COL03924 

HS.1329S6 ESTs 

Hs.124707 ESTs 

Hs.108479 ESTs 

Hs.320861 Kruppet-like factor 8 

Hs.91296 integrin,alpha8 

Hs.1 10014 gastric intrinsic factor (vitamin B synthesis) 

Hs.84131 threonyl-tRNA synthetase 

Hs.62604 Homo sapiens, clone iMAGE:4299322, mRlslA. partial cds 

Hs.10651 1 protocadherin 17 

Hs.332633 Bardet>Biedl syndrome 2 

Hs.109526 zinc finger protein 198 

Hs.143198 inhibitor of growth family, member 3 

Hs.12457 hypothetical protein FU10814 

gb7h26b09.r1 Soares placenta Nb2HP Homo sapiens cDNA clone liyMGE:130841 mRNA 

Hs.222352 ESTs, Weakly simHar to p40 (H.saplensl 

Hs.265327 hypolhetteal protein DKFZp761t141 

Hs.5672 hypothetical protein AF140225 

Hs.101490 ESTs 

Hs.278428 progestin Induced protein 

Hs.19280 cysteine-rich motor neuron 1 

Hs.151251 ESTs 

Hs.14846 Homo sapiens mRNA; cDNA DKFZp564D016 (from ctone DKFZp564D016) 

Hs.30098 ESTs 

Hs.36250 ESTs. Weakly similar b 138022 hypothetical protein [H.sapiens] 

Hs.278585 ESTs 

Hs.10031 K1AA0955 protein 

gbuse97d11^1 Soares.fetaLhearLNbHH19W Homo sapiens cDNA done 1MAGE:366933 3* 
mRNA sequence. 

Hs.171835 hypottietical protein FU10889 

Hs.31803 ESTs. Weakly simRar to N-WASP [H.sapiens] 

Hs.22575 B-cell CLLTIymphoma 6, member B (zinc finger protein) 

Hs.203213 ESTs 

Hs.196701 ESTs. Weakly similar to ALUI^HUMAN ALU SUBFAIWIILY J SEQUENCE CONTAMINATION 

Hs.1 14218 frizzled (Drosophila) homolog 6 

Hs.296288 ESTs. Weakly similar to K1AA0638 protein [H^sapiens] 

Hs.9879 ESTs 

Hs.288650 Homo sapiens cDNA: FLJ2252B fis. done HRC12825 

Hs.9414 KIAA1488 protein 

Hs.234863 Homo sapiens cDNA FU12082 fis, done HEMBB1002492 

Hs.21599 Kruppel-lilce factor 7 (ubiquitous) 

Hs.126085 K1AA1497 protein 

Hs.180789 S164 protein 

Hs.173159 transfomnlng, addle coiled^il containing protein 1 

Hs.173987 eukaryotic translation initiation factor 3. subunit 1 (alpha, 35kD) 

Hs.24641 cytoskeleton assodated protein 2 

Hs.14992 hypothetical protein FLJ11151 

Hs.22120 ESTs 

Ks.23200 myotubularin related protein 1 

Hs.1 10950 Rag C protein 
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105376 RCJ^236559 
105397 RC AA242868 
131962 RC AA251776 
131991 RCAA251909 
128658 RcIaA252672_s 
105489 RCJ\A256157 
105508 RC„AA256680 
105539 RcIaA258873 
135172 RC_AA262727 
131569 RCJ\A281451 
132542 RCJ^281545 
105643 RC AA282069 
105659 RCJ\A283044 
105666 RC.AA283930 
105674 RCJ\A284755 
105709 RCJ\A291268 
105722 RCJ\A291927 
105765 RCJl<V343514 
115951 RCJ^39B109 
105962 RCJ>iA405737 
105985 RCJ^406610 
gb*J(02067 

106008 RCJM411465 
131216 RCJVA4168B6 
134222 RC_AA424013 
113689 RCJ^424148 
106141 RCJ\A424558 
130839 RCJ\A424961j 
106157 RCJ^425367 
130777 RC.AA425921 
receptor 

130561 RCJ\A426220 
106196 RCJ\A427735 
WARNING 

131878 RCJ^430673 
133200 RC_AA432248 
106302 RC_AA435896 
106328 RCJ\A436705 
450534 RCJV^446561 
106423 RC AA448238 
133442 RC_AA448688 
439608 RCJ\A449756 
106477 RC_AA450303 
106503 RCJ^452411 
446999 RCJ\A454566 
106543 RCJ^454667 
130010 RCJ\A456437 
106589 RCJ^456646 
106593 RCJ^4S6826 
106596 RCJVA456981 
CONTAMINATION 
134655 RCJVA458959 
memberl 

106636 RC_AA459950 
106654 RCJ\A460449 
131353 RCJ\A463910 
106707 RCJ^464603 
131710 RCJ^464606 
106717 RC_AA465093 
131775 RC^AA465692 
106747 RCJ^476473 
106773 RC AA478109 
106781 RC_AA478474 
106817 RC_AA480889 
106846 RCJ\A485223 
106848 RC„AA485254 
106856 RCJ\A486183 
418699 RC_AA496936 
WARNING 

107001 RCJ\A598589 
130638 RCJ\A598831_ 
107054 RC_AA600150 
107059 RCJ\A608545 
107080 RCJ^609210 
107115 RC^AA610108 
107130 RCJV\620582 



AW994032 


Hs.8768 


AA814807 


Hs.7395 


AK000046 


Hs.267448 


AF053306 


Hs. 36708 


; BE397354 


Hs.324830 


AA256157 


Hs»24115 


AA1 73942 


Hs.326416 


AB040884 


Hs. 109694 


AB028956 


Hs.12144 


AL389951 


Hs.271623 


AL137751 


Hs.263671 


BE621719 


Hs.173802 


AA283044 


Hs.25525 


AA426234 


Hs.34906 


AI609530 


Hs.279789 


At928962 


Hs.26761 


Ai922821 


Hs.32433 


AA299688 


Hs.24183 


BE546245 


H3.301048 


AW880358 


Hs.339808 


AA406610 




AB033888 


H5.8619 


A1815486 


Hs^43901 


AW855861 


HS.8Q25 


AB037650 


Hs.16621 


AF031463 


Hs.9302 


5 AB011169 


Hs.20141 


W37943 


Hs.34892 


A W1 35049 


Hs.285418 


AB011095 


Hs.16032 


AA525993 


Hs.173699 


AA083764 


Hs.6101 


AB037715 


Hs.183639 


AA398859 


Hs.18397 


AL079559 


Hs.28020 


A1570189 


Hs.25132 


AB020722 


Hs.16714 


AL137653 


Hs.7378 


AW664696 


Hs.301732 


R23324 


Hs.41693 


AB033042 


Hs.29679 


AA151520 


Hs.334822 


AA676939 


Hs.69285 


AA301116 


Hs.142838 


AK000933 


HsJ26661 


AW296451 


Hs^4605 


AA452379 


Hs.293552 


AF265208 


Hs.123090 


AW958037 


Hs.286 


AW075485 


Hs.286049 


AW754182 




AK000566 


Hs.98135 


NM_015368 


Hs.30985 


AA600357 


Hs.239489 


AB014548 


Hs.31921 


NM_007118 


Hs.171957 


AA478109 


Hs.188833 


AA330310 


Hs.24181 


D61216 


Hs.18672 


AB037744 


Hs.34892 


AA449014 


Hs.121025 


W58353 


Hs.285123 


BE539639 


HS.17303D 


At926520 


Hs.31016 


f AW021276 


Hs,17121 


AI076459 


Hs.15978 


BE614410 


Hs.23044 


AL122043 


Hs.19221 


BE379623 


Hs.27693 


AB033106 


Hs.12913 



hypothetical protein FLJ10849 
hypotheUcal protein FU23182 
hypothetical protein FLJ20039 

budding uninhibited by benzlmidazoles 1 (yeast homolog), beta 

diplheria toxin resistance protein required for diphthamide biosynthesis (Saccharomyces)-Hke 2 

Homo sapiens cDNA FU14178 fis, clone NT2RP2003339 

Homo sapiens mRNA; cDNA DKF2p564H1916 (from clone DKFZp564H1916) 

KIAA1451 protein 

KIAA1033 protein 

nudeoporin 50I(D 

Homo sapiens mRNA; cONA DKF2p434l0812 (from clone DKFZp434I0812); partial cds 
KIAA0603 gene product 
hypothetical protein FU11323 

ESTs. Weakly similar to T17210 hypothetical protein DKFZp434N041.1 {H^piensl 

histonedeacetyiase3 

DKFZP586L0724 protein 

ESTs 

ESTs 

sec13-llke protein 
hypothelteal protein FLJ10120 

gb*^15b10.s1 Soares.NhHMPu.S1 Homo sapiens cDNA done IMA6E:753691 3* similar to 

SRY (sex detemilnb^ region Y)-box 18 

Homo sapiens cDNA RJ20738 fis, done HEP08257 

Homo sapiens done 23767 and 23782 mRNA sequences 

PKFZP434I116 protein 

phosdudn-like 

similar to S. cerevlsiae SSM4 
KIAA1323 protein 

Homo sapiens cONA FU10643 fis. done ^rr2RP2005753. highly similar to Homo saptens H 
KIAA0523 protein 

ESTs, Weakly similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

hypothetical protein MGC3178 
hypothetical protein aJ10210 
hypothetical protein FU23221 
KtAA0766 gene product 
KIAA0470gene product 
Rho guanine exchange fador (GEF) 15 

Homo sapiens mRNA; cONA DKFZp434G227 (from done DKFZp434G227) 

hypothetical protein MGC5306 

DnaJ (Hsp40) homolog, subfamily B, member 4 

cofador required for Spl transcriptional activation. subunit3 (130kD) 

hypothetteal protein MGC4485 

neuropilinl 

nudedar phosphoprotein Nopp34 

Homo sapiens d)NA FU10071 fis, done HEMBA1001702 

ESTs 

ESTs. Moderately similar to ALU7.HUMAN ALU SUBFAMILY SO SEQUENCE 

SWl/SNF related, matrix assodated, actin dependent regulator of diromatln, subfamily f. 

ribosomat protein L4 
phosphoserine aminotransferase 

gb:RC2-CT0321-131 19901 1-C01 CT0321 Homo sapiens cONA, mRNA sequence 
hypothetical protein FLJ20559 
pannexin 1 

TIA1 cytotoxic granule-assodated RNA-blnding protein 
K1AA0648 protein 

triple fundtonal domain (PTPRF interacting) 

ESTs 

ESTs 

ESTs 

KIAA1323 protein 

chromosome 11 open reading frame 5 

Homo sapiens mRNA full length insert cDNA done EUROIMAGE 2005779 

ESTs. Weakly sindlar to ALU8.HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 

putative DNA binding protein 
ESTs 

K1AA1272 protein 

RA051 {S, cerevislae) homotog (E coii RecA homolog) 
hypothetical protein DKFZp566G1424 
peptidylprolyl isomerase (cydophinn)-iike 1 
K1AA1280 protein 
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107156 RC_AA621239 
107174 RC_AA621714 
130621 RC_AA621718 
107190 RC_D19673 
132626 RC D25755 s 
107217 RC D51095 
131610 RC_D60272^i 
129604 T08879 
107295 T34527 
(GalNAc-TI) 
107299 rA0327_s 

107315 T62771.S 

107316 T63174 s ' 
107328 T83444 
107334 T93641 
134715 U48263 
128636 U49065 
129938 U79300 
107375 U68573 
130074 U93BS7 
107387 W01094 
132036 W01568 
107426 W26853 
113857 W27179 
135388 W27g&5 
130419 W36280.S 
107469 W47063 
132616 W79060 
107506 W88550 
132358 X60486 
107522 X78931_s 
125827 Z14077_s 
107582 RC_AA002147 
107609 RCJVA004711 
107661 RC_AA010383 
107714 RC.AA015761 
107775 RC^AA018772 
107832 RCJ!^021473. 
sequence. 

107859 RC AA024a35 
124337 RC_AA025858 
107914 RCJ\A027229 
[C.eiegans] 

107935 RCJ\A029428 
116262 RC„AA035143 
131461 RC_jAA035237 
108007 RCJWV039347 
108029 RCJ^040740 
108040 RCJ^041551 
member 1 

108084 RC.AA045513 
108088 RCJkA045745 
108168 RCJkA055348 
130719 RCJU056582. 

108189 RCJVA056697 

108190 RCJ\A056746 
RCJ^057678 
RCJVA058681 
RC_AA058685 
RCJVA062840 
RCJ\A064859 



108203 
108216 
108217 
108245 
108277 
mRNA 
108280 
108309 
133739 
108340 
108403 



RCJ^065069 
RCJVA069923 
RC_AA070799. 
RC^AA070815 
RCJVA075374 
3', mRNA sequence. 
108427 RC_AA076382 
3', mRNA sequence. 
108435 RC^AA078787 
108439 RC.AA078986 
3*. mRNA sequence. 
108465 RC_AA079393 
108469 RCJ\A079487 
sequence 



AA 1 37043 Hs.9663 programmed cell death 6^nteracting protein 

BE1 22762 Hs.25338 ESTs 

AW513087 Hs.16803 LUC7 (S. cerevisiae).like 

AA836401 Hs.5103 ESTs 

AW504732 Hs.21275 hypothetical protein FU11011 

AL080235 Hs.35861 DKFZP586E1621 protein 

AA357879 Hs.29423 scavenger receptor with C-type lectin 

AF088886 Hs.11590 cathepsinF 

AA186629 Hs.80120 UDP-N-acetyI-alpha>D^alactosamine:polypeptjde N-acetylgatactosaminyltransferase 1 

BE277A57 Hs.30651 hypothetical protein MGC4606 

AA316241 Hs.90691 nucleophosmin/nudeoplasmin 3 

T63174 Hs,1 93700 Homo sapiens mRNA; cDNA DKF2p586l0324 (from clone DKFZp586l0324) 

AW959891 Hs.76591 KIAA0887 protein 

T93597 Hs.187429 ESTs 

U48263 Hs.89040 prepronodceptin 

U4g065 Hs.102865 interieulctn 1 receptor-like 2 

AW003668 Hs.135587 Human clone 23629 mRNA sequence 

BE011845 Hs.251064 highnnobility group (nonhistone chromosomal) protein 14 

AL038596 Hs.250745 polymerase (RNA) 111 (DMA directed) (62kD) 

D86983 Hs.1 18893 Melanoma associated gene 

AL157433 Hs.37706 hypothetical proteli DKFZp434E2220 

W26853 Hs.291003 hypothetical protein MGC4707 

AW243158 88.5297 DKFZP564A2416 protein 

W27985 Hs.99865 epinriorphin 

AF037448 Hs.155489 NSl-assodated protein 1 

W47063 Hs.94668 ESTs 

BE262677 Hs.283558 hypothetical protein PR01855 

AB028981 Hs.8021 KIAA1 058 protein 

NM-003542 Hs.46423 H4 histone family, member 6 

X78931 Hs.99971 zinc finger protein 272 

NM„003403 Hs.97496 YY1 transcripUon factor 

AA002147 Hs.59952 EST 

R75654 Hs.164797 hypothetical protein FU 13693 

AA010383 HS.603B9 ESTs 

AA015761 Hs.60642 ESTs 

AW008846 Hs.60857 ESTs 

r AA021473 gb:ze66c11.s1 Soares retina N2b4HR Homo sapiens cDNA done IMAGE:363956 3\ mRNA 

AW732573 Hs.47584 potassium voltage-gated channel, delayed-reclifier, subfamily S, member 3 

N23541 Hs.281561 Homo sapiens cDNA: FU23582 fis, done U^G13759 

AA027229 Hs.61329 ESTs. Wealdy similar to T16370 hypothefical protein F45E12.5 - Caenorhabdltis elegans 

AA029428 Hs.61555 ESTs 

AI936442 Hs.59838 hypothetical protein FU10808 

AA992841 Hs.27263 KIAA1458 protein 

AA039347 Hs.61916 EST 

AA040740 Hs.62007 ESTs. 

AL121031 Hs.159971 SWI/SNF related, matrix associated, adin dependent regulator of chromatin* subfamily b. 

AA058944 Hs.116602 Homo sapiens, done IMAGE:4154008, mRNA. partial cds 

AA045745 Hs.62686 ESTs 

AI453137 Hs.63176 ESTs 

.s AA679262 Hs.14235 hypothetical protein FU20008; KIAA1839 protein 

AW376061 Hs.63335 ESTs. Moderately simOar to A46010 X-finked retinopathy protein (H.saplens} 

AA056746 Hs.63338 EST 

AW847814 Hs.289005 Homo sapiens cDNA: FU21532 fe, done COIJ06049 

AA524743 Hs.44883 ESTs 

AA058686 Hs.62588 ESTs 

BE410285 Hs.89545 proteasome (prosome, macropain) subunit. beta type, 4 

AA064859 gb:zm50(03.s1 Stratagene fibroblast (937212) Homo sapiens cDNA done IMAGE:529085 3\ 

AA065069 gb:zm1 2e1 1 .si Stratagene pancreas (937208) Homo sapiens cDNA done 3\ mRNA sequence 

AA069818 gb:zm67e03.r1 Stratagene neuroepitheiium (937231) Homo sapiens cDNA done 5" similar to 

.s BE536554 Hs.278270 unactive progesterone receptor, 23 kD 

AA069820 Hs.180909 peroxiredoxin 1 

AA075374 gb:zm87a01.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done IMAGE:544872 

AA076382 gb:zm91g08.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done 1MAGE:545342 

T82427 Hs.194101 Homo sapiens cDNA: FU20869 fis, done ADKA02377 

AA078986 gb:zm92h01.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done 1MAGES45425 

AA079393 Hs.3462 cytochrome c oxidase subunit Vllc 

AA079487 gbam97f08.s1 Stratagene colon HT29 (937221) Homo sapiens cDNA done 3*. mRNA 
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108500 RC.AA083207 

108501 RCJ\A083256 
gb:M33308 

108533 RCJ^0B4415 
mRNA 

108562 RCJVV085274 
gb:X15341 

108589 RCJ\A088678 
RC^AA100925 
RCJ^101255 
RCJ\A126474 
RCJ^127017 
RCJ\A129968 
RCJ^130240 
RC.AA131866 
RC^AA132039 
RCJ^132983 
RCJU133250 
131474 RCJ\A133583. 
108894 RCJ\A135941 
108941 RCJ^A148650 
IMAGE:587202 3\ 
108968 RC.AA151110 
RCJ\A155754 
RCJ\A156125 
RCJ^156289 
RC AA156997 
RC.AA157291 
RC AA157293 
RCJ\A164293„ 
RC_AA164676 
RCJ^167375 
RC J^167550 
RCJVA176589 
RC JU180448 
RC AA187144, 
RC J\A189170, 
RC_AA192757 
RCJS^205650 
RCJ\A233342 
RCJ^233472 
RC_AA234110 
RC.D80981 
RC.F01660 
RC,F02206 
RC,F0220B 
RC.F02544 
RC.F03918 
RC_F04258.s 
RC^F04600 
RC F08998 
RC_F09605 
RC^F11115 
RC_H06371 
RC H10995 
RC^H11938 
RC^H16558 
RC^H16772 
RC_H18951 
RC^H20859 
RC_H23747 
RC^H38087 
RC.H40331 
RC^H40567 
RC.H46955 
RC.H56640J 
RC H57154 
RC^H96712 
RC^N20814 
RC_N25249 
RC,N27100 
RC^N39616 
RC.N48982 
RC^N51957 
RC.N52Z71 
RC_N59435 



130890 
134585 
130385 
108749 
108807 
108808 
108833 
107290 
108846 
108857 



108996 
109001 
131183 
109019 
109022 
109023 
109066 
109072 
129021 
130346 
109146 
109172 
131080 
129208 
109222 
109300 
109481 
109485 
109516 
109537 
109556 
109577 
109578 
109595 
109625 
131983 
109848 
109671 
109699 
109820 
109933 
110014 
110039 
110099 
110107 
110155 
110197 
110223 
110306 
110335 
110342 
110395 
110511 
110523 
110715 
110754 
130132 
131135 
134263 
110938 
110983 
115062 
111081 



AA083207 
AA083256 

M084415 

AA100796 

AI732404 
AI907537 
D14041 
AW067800 
AA127017 
AI652236 
AA045088 
AF188527 
W27740 
AL117452 
AK001468 
s L46353 
AK001431 
AA148650 

AI304870 
AW995610 
AI056548 
AI611807 
AA156755 
AA1 57291 
AA157293 
.f AA164293 
A1732585 
AL044675 
H05769 
AA176589 
AA180448 
s NM_001955 
.f AI587376 
AA192833 
AA418276 
AA878923 
BE619092 
AI471639 
Ai858695 
A1925294 
F02206 
F02208 
AA078629 
H29490 
AF119665 
H17800 
R59210 
H18013 
AW016809 
R52417 
AL109666 
H11938 
R44557 
AW151660 
AI559626 
AW090386 
H19836 
H38087 
H65490 
H40g61 
AA025116 
H56640 
A1040384 
H9B712 
AW302200 
U55936 
NM_016569 
AW973443 
N46962 
N015367 
AA253314 
AI146349 



Hs.68270 EST 



Hs.68846 

Hs.76698 

Hs.278573 

Hs.155223 

H8.71052 

Hs.49376 

Hs.62738 

Hs.61661 

Hs.323780 

Hs.44155 

Hs.62180 

Hs.2726 

Hs.5105 



Hs.188680 

Hs.332436 

Hs.72116 

Hs.285107 

Hs72150 

Hs.21479 

Hs72168 

Hs.72545 

Hs.22394 

Hs.173081 

Hs.188757 

Hs.142078 

Hs.144300 

Hs.2271 

Hs.109441 

Hs.333512 

Hs.170142 

Hs.289069 

Hs.28465 

Hs,71913 

Hs^898 

Hs.87385 

Hs^98639 

Hs.27214 

Hs^301 

Hs.22697 

Hs.184011 

Hs.7154 

Hs.26634 

Hs.167483 

Hs.323795 

Hs.20945 

Hs.7242 

Hs.21907 

Hs.23748 

Hs.31444 

Hs.93522 

Hs.112278 

Hs.31697 

Hs.105509 

Hs.18845 

Hs.33008 

Hs,33333 

Hs.221460 

Hs.19102 

Hs.269029 

Hs.6336 

Hs.184376 

Hs.267182 

Hs.8086 

Hs^8034 

Hs.10267 

Hs.154103 

Hs.271614 



gb2n08g12^1 Stratagene hNT neuron (937233) Homo sapiens cDNA clone 3^ similar to 
gb:zn06g09.s1 Stratagene hNT neuron (937233) Homo sapiens cDNA done 1MAGE:546686 3', 
gb:zm26c06^1 Stratagene pancreas (937208) Homo sapiens cDNA clone 3' similar to 
ESTs 

stress-associated endoplasmic reticulum protein 1; ribosome associated membrane protein 4 

H-2K binding factor.2 

5tannk)caldn2 

ESTs 

hypothetical protein FU20644 
ESTs 

ESTs. Wealdy simitar to AF174e05 1 F-box protein Fbx25 [Haptens] 
ESTs 

DKFZP586G1517pro^n 

aniUin (Drosophila Scraps homotog), actin bihding protein 
high-mobili^ group (nonhistone chromosomal) protein isoform l-C 
hyi)othetica1 protein FU10569 

gb:zo09e06^1 Strategene neuroepithelium NT2RAMI 937234 Homo sapiens cDNA done 

ESTs 
EST 

hypothetical protein FU20992 similar to hedgehog-interacting protein 

hypothetical protein FLJ13397 

ESTs 

ubinude'm 1 

ESTs 

ESTs 

hypothetical protein FU10893 
KIAA0530 protein 

Homo sapiens, done MGC:5S64, mRNA, oomplete cds 

EST 

EST 

endothelin 1 
(\4STP033 protein 
similar to rat myomegann 
ESTs 

hypothetzca) protein FU21016 

Homo sapiens cDNA: FU21869 fis. done HEP02442 

ESTs 

ESTs 

ESTs 

Homo sapiens potassium channel subunit (HERG-3) mRNA, complete cds 

ESTs 

ESTs 

ESTs 

pyrophosphatase (Inorganic) 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens done 24993 mRNA sequence 

Homo sapiens mRNA full length Insert cDNA done EUROIMAGE 35907 

histone acetylfransferase 

ESTs 

ESTs 

Homo sapiens mRNA for KIAA1647 protein, partial cds 

arrestin. beta 1 

ESTs 

CTl2gene 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs, WealQly similar to organic anion transporter 1 |H.sapiens) 
ESTs 

K1AA0672 gene product 
synaptosomal-assodated protein. 23kD 
TBX3-tso protein 

RNA touanine-7-) methyltranslerase 

Homo sapiens cDNA FU12924 fis. done NT2RP2004709 

Mill protein 

LIM protein (similar to rat protein kinase C-bind&ig enigma) 
CGM12 protein 
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111128 


RC_N64139 


AW505364 


Hs.19074 


135244 


RC N66981 


AI834273 


Hs.9711 


111216 


RC_N6a640 


AW1 39408 


Hs. 152940 


437562 


RC.N69352 


AB001636 


HS.56B3 


131002 


RC^N95226 


AL050295 


Hs.22039 


111399 


RC R00138 


AW270776 


Hs.18857 


111514 


RC^R07998 


R07998 




similar to 






130182 


RC„R08929 


BE267033 


Hs. 192853 


111574 


RC_R10307 


AI024145 


Hs. 188526 


111804 


RC_R33354 


AA482478 


Hs.181785 


111831 


RC_R36083 


R36095 


Hs.268695 


129675 


RC_R37938J 


NIVL015556 


Hs.172180 


111904 


RC.R39330 


241572 




sequence 






133868 


RC_R40816_s 


AB012193 


Hs.183874 


112033 


RC_R43162LS 


R49031 


Hs^627 


130987 


RC_R45698 


BE613269 


Hs.21893 


112300 


RC_R54554 


H24334 


Hs.26125 


112513 


RC.R68425 


R68425 


Hs.13809 


112514 


RC^R68568 


R68568 


Hs.183373 


112522 


RC R68763 


R68857 


Hs.265499 


112540 


RC^R70467 


R69751 




130346 


RC_.R73565 


H05769 


Hs. 188757 


129534' 


RC.R73640 


AK002126 


Hs. 11260 


112597 


RC^R78376 


R78376 


Hs.29733 


112732 


RC^R92453 


R92453 


Hs.34590 


131458 


RC_T03865 


BE297567 


Hs.27047 


112888 


RC_T03872 


AW1 95317 


Hs.107716 


131863 


RC_T10072 


AI656378 


Hs.33461 


112911 


RC_T10080 


AW732747 


Hs.13493 


132215 


RC_T10132 


AL035703 


Hs.4236 


112931 


RC^T15343 


T02966 


Hs.167428 


112984 


RC_T23457 


T16971 


Hs.289014 


112998 


RC_T23555 


H11257 


Hs.22968 


133376 


RC_T23670 


BE618768 


Hs.7232 


113026 


RC^T23948 


AA376654 


Hs. 183684 


113070 


RC_T33464 


AB032977 


Hs.6298 


128970 


RC^T34413 


A1375672 


Hs.165028 


113074 


RC^T34611 


AK001335 


Hs.31137 


113095 


RC„T40920 


AA828380 


Hs.126733 


113179 


RC^T55182 


BE622021 


Hs.152571 


113337 


RC_T77453 


T77453 


Hs.302234 


113421 


RC.T84039 


AI7G9400 


Hs.189729 


113454 


RCjr86458 


AI022166 


Hs.16168 


113481 


RC.T87693 


T87693 


Hs.204327 


131441 


RC.T89350_s 


AA302862 


Hs.90063 


113557 


RC^T90945 


H66470 


Hs.16004 


113559 


RC_T90987 


T79763 


Hs.14514 


113589 


RC^T91863 


AI078554 


Hs.15682 


113591 


RC T91881 


T91881 


Hs.200597 


113619 


RC_T93783_S 


R08665 


Hs. 17244 


113683 


RC^T96687 


AB035335 


Hs.144519 


113692 


RC^T96944 


AL360143 


Hs.17938 


113702 


RC^T97307 


T97307 




mRNA 








113717 


RC,T97764 


T99513 


Hs.187447 


113824 


RC.W48817 


At631964 


Hs.34447 


113840 


RC.W58343 


R72137 


Hs.7949 


113844 


RC W59949 


AI369275 


Hs.243010 


PROTEIN TCIO 






113902 


RC W74644 


AA340111 


Hs.100009 


113904 


RC.W74761 


AF125044 


Hs.19196 


113905 


RC^W74802 


R81733 


Hs.33106 


113931 


RC^W81205 


BE255499 


Hs.3496 


113932 


RC^W81237 


AA256444 


Hs.l 26485 


131965 


RC^W90146J 


W792o3 


Hs.35962 


114035 


RC^W92798 


W92798 


Hs.269181 


114106 


RC^Z38412 






133593 RC^Z38709 


AI416g88 


Hs.238272 


114161 


RC^Z38904 


BE54B222 


Hs.299883 


424949 


RC^Z39103 


AF052212 


Hs.153934 


129059 RC_Z39930J 


AWG69534 


Hs.279583 


128937 


RC,.239939 


AA251380 


H&10726 


WARNING 






130983 RC_Z40012J 


AI479813 


Hs.278411 



LATS (large tumor suppressor, Drosophila) homolog 2 

novel protein 

ESTs 

DEAD/H (Asp-GIU'Ala-Asp/Hls) box polypeptide 15 

KtAA0758 protein 

ESTs 

gb:yf16g1 1.s1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:127076 3' 

ublquitin-conjugating enzyme E2G 2 (homologous to yeast UBC7) 

ESTs 

ESTs 

ESTs 

KIAA0440 protein 

gb:HSCZYB122 normalized infant brain cDNA Homo sapiens cDNA done c-zyb12. mRNA 

cullin4A 
ESTs 

hypothetical protein DKFZp761N0624 
ESTs 

hypothetical protein FLJ10648 

src homology 3 domain-containing protein HIP-55 

ESTs 

gb:yi40a10.s1 Soares placenta Nb2HP Homo sapiens cDNA done 3\ mRNA sequence 

Homo sapiens, done MGC:5564, mRNA, complete cds 

hypothetical pnotein FLJ11264 

EST 

ESTs 

hypothetical protein FU20392 
hypothetical protein FU22344 
ESTs 

like mouse brain protein E46 
KIAA0478 gene product 
ESTs 

ESTs. Weakly similar to A43932 mudn 2 precursor, intestinal [H.sapiens] 

Homo sapiens clone 1MAGE:451939, mRNA sequence 

acetyl-Coenzyme A carboxylase alptia 

eukaryotic translation initiation factor 4 ganfnna, 2 

K1AA1151 protein 

ESTs 

protein tyrosine phosphatase, receptor type, E 
ESTs 

ESTs, Highly similar to IGF-II mRNA-bindlng protein 2 [H.sapiens) 

ESTs 

ESTs 

ESTs 

EST 

neurocaldn delta 
ESTs 
ESTs 
ESTs 

KIAA0563 gene product 
hypothetical protein FU13605 
T-cell leukemiayiymphoma 6 
DKFZP434H132 protein 

gb:ye53h05.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done 1MAGE:121497 3', 

ESTs 
ESTs 

DKFZP586B2420 protein 

Homo sapiens cDNA FU14445 fis. done HEMBB1001294, highly similar to GTP^INDING 

acykCoenzyme A oxidase 1. patmitoyi 
ublquitin-conjugating enzyme HBUCE1 
ESTs 

hypothetical protein MGC15749 

hypothetical protein FU12604; K1AA1692 protein 

ESTs 

ESTs 

gb:RC5-BT0552-260100-011-A02 BT0562 Homo sapiens cDNA, nnRNA sequence 
inositol 1 ,4.5-triphosphate receptor, type 2 
hypothetical protein FU23399 

core4>indin9 fador, runt domain, alpha subunit 2; translocated to. 2 
' CGI-81 protein 

ESTs, Weakly similarto ALU1 J^UJWVN ALU SUBFAMILY J SEQUENCE CONTAMINATION 
NCK-assodated protein 1 
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114364 
132900 
129034 
1316B1 
452461 
114465 
131376 
101567 
431555 
132944 
114818 
130274 
110330 
114648 



114277 RC_Z40377„s 
[Cetegans] 
114304 RC.240820 
RC.Z41680 
RCJ\A005112 
RCJVA005432 
RCJ\A010163 
RCJW026356 
RC_M026901 
RCJV\036867 
RCJ^044644 
RCJ^046426 
RCJVA054515 
RCJ\A084162 
RCJVA085749 
RCJ\A098874 
RCJ\A101056 
1MAGE:548429 3' 
114658 RC_AA102746 
132456 RCJ\A114250. 
131319 RCJ\A126561. 
132225 RCJ\A128980. 
IMAGE:567164 3' 
132669 RCJ\A129757 
RC_AA129921 
RCJ\A133331 
RCJ\A135958 
RCJVA136524. 
RC_M147044 
RC_AA1488B5 
RC_AA150043 
RC_AA151621 
RC^AA155743 
RCJ^I 56335 
RCJ\A155336 
RC_AA159181 
RC_M159825 
[Celegans] 

114828 RC.AA234185 
RCJ\A234929 
RCJSA234935 
RCJ\A236359 
RCJV^236466 
RC_AA236535 
RCJ'^36935. 
RCJiiA236942 
RCJVA237018 
RC_AA237025 
RCJ^42751 
RC^242760 
RCJU242763 
RCJ\A242809 
WARNING 

132454 RC_AA243133 
RCJ^243495 
RCJVA243706 
RCJ\A250848 
RC^AA250868 
RC_AA251152 
RCJ\A251544. 
RCJ\A251792 
RCJ\A252063 
RCJ^252144 
RC_AA252524 
RCJVA253461 
RCJ^55522 
RECEPTOR. 
115114 RCJ^256468 
RCJ\A256528 
RCJV^257976 
RCJ\A258296 
RCJ\A258409 
RCJ\A258421 
RCJ\A262077 
RCJ\A276650 
RCJVA278766 



114709 
131973 
114750 
115714 
114763 
114767 
114774 
129388 
129163 
128869 
130207 
114798 
114800 



114846 
114848 
114902 
132271 
114907 
135159 
132204 
114928 
132481 
114932 
314162 
131006 
114935 



437754 
114957 
114974 
114977 
114995 
115005 
417177 
131889 
115026 
115045 
115068 
133138 



129584 
115137 
134312 
115166 
115167 
129807 
115239 
115243 



AI052229 H$.25373 ESTs. Weakly similar to T20410 hypothetical protein E02A10.2 - Caenorhabditis elegans 

Ar934204 Hs.16129 ESTs 

AL117427 Hs.172778 Homo sapiens mRNA; cDNA DKFZp566P013 (from clone DKFZp566P013) 

AA777749 Hs.5978 UM domain only 7 

AA481157 Hs.108110 OKFZP547E21 10 protein 

AW361016 Hs.3363 upstream regulatory element binding protein 1 

N78223 Hs.108106 transcription factor 

BE621056 Hs.131731 hypothetical protein FU 11 099 

AK001644 Hs.26156 hypothetical protein FU 10782 

M33552 Hs.56729 lysosomal 

AI815470 Hs.260024 Cdo42 effector protein 3 

T96641 Hs.6127 . Homo sapiens cDNA: FU23020 fis. done LNG00943 

AW979261 Hs.291993 ESTs 

AA128376 Hs.153884 ATP binding protein associated with cell dlfTerenffatfon 

AI288666 Hs.16621 DKFZP434I1 16 protein 

AA101056 gbum25b03^1 Stratagene neuroepithelium NT2RAMI 937234 Homo saptens cDNA done 

AA1 02363 Hs.2491 90 tumor necrosis factor receptor superfamily, member 10a 

.s AB01 1084 Hs.48924 KIAA0512 gene product; ALEX2 

.s Ni\/L003155 Hs.25590 stanniocalcin 1 

} AA128980 gb:zo09a1 1.s1 Stratagene neuroepithetium NT2RAM1 937234 Homo sapiens cDNA done 

W38586 Hs.293981 guanine nudeotide binding protein (G protein), gamma 3, linked 

AA397651 Hs.301959 proline synthetase co-transcribed (bacterial homolog) 

AB018264 Hs.158688 KIAA0741 gene produd 

AA887211 Hs.129467 ESTs 

.3 T19228 Hs.172572 hypothetical protein FU20093 

AA810755 Hs.88977 hypothetical protein dJ511E16.2 

AI859865 Hs.154443 mfnichromosome maintenance defident (S. cerevisiae) 4 

AV656017 Hs.184325 CGI-76 protein 

AA662477 Hs.110964 hypothetical protein FU23471 

BE561 824 Hs^73369 uncharacterized hematopoietic stem/pnsgenitor cells protein MDS027 

AA768242 Hs.80618 hypothetical protein 

AF044209 Hs.l 44904 nudear receptor compressor 1 

AA1 59181 Hs.54900 serologically defined colon cancer antigen 1 

Z19448 Hs.131d67 ESTs, WeaMy similar to T24396 hypothetical protein T03F6^ - Caenoriiabditis elegans 

AA252937 Hs^63522 Homo sapiens mRNA; cDNA DKFZp434J1912 (from done DKFZp434J1912) 

BE016662 ' H^166196 ATPase. Class I. type 88, member 1 

BE614347 Hs.169815 hypothetical protein FU20989 

AW275480 Hs.39504 hypothetical protein M6C4306 . 

A6030034 Hs.115175 sterile^lpha motif and teudne zipper containing kinase AZK 

N29390 Hs.13804 hypothetical protein dJ462023^ 

s U43374 Hs.95631 Human nonnal keratinocyte mRNA 

AA235827 Hs.42265 ESTs 

AA237016 Hs.94869 ESTs 

W93378 Hs.49614 ESTs 

AA971436 Hs.16218 KIAA0903 protein 

6E041820 Hs.38516 Homo sapiens, done MGC:158d7, mRNA, complete cds 

AF064104 Hs.221 16 CDC14 (cell division cyde 14. S. cerevisiae) homolog B 

H23329 Hs.290d80 ESTs. Weakly similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

BE296227 H5.250822 serine/threonine l^inase 15 

R60366 Hs.5822 Homo sapiens cDNA: FU22120 fis, clone HEP18874 

AW170425 Hs.87680 ESTs 

AW966931 Hs.179662 nudeosome assembly protein 1-like 1 

AW296978 Hs.87787 ESTs 

AA769266 Hs.193657 ESTs 

s AI760825 Hs.111339 ESTs 

NM.004458 Hs.81452 fotty-add-Coenzyme A ligase. kmg-chain 4 

NIVL002589 Hs.34073 BH-protocadherin (brain-heart) 

AA251972 Hs.l88718 ESTs 

AW014549 Hs.58373 ESTs 

AW512260 Hs.87767 ESTs 

AV657594 Hs.181161 Homo sapiens cDNAFU14643 fis. done NT2RP200159A weakly stmilarto RYANODINE 

AA527548 Hs.7527 small fragment nuclease 

AV656017 Hs.184325 CGI-76 protein 

AWg68304 H&56156 ESTs 

AB011151 Hs.334659 hypothetical protein M6C14139 

AF095727 Hs.287832 myelin protein zero-like 1 

AA749209 Hs.43728 hypothetical protein 

Y11192 Hs.5299 aldehyde dehydrogenase 5 femily, member A1 (sucdnate-semlaldehyde dehydrogenase) 

BE251328 Hs73291 hypothetical protein FU10681 

AA806600 Hs.116665 KIAA1842 protein 
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100850 RC^AA279667_s 
126884 RC_AA280791 
115322 RC_AA280819 
133626 RC_AA280828 
115372 RC_AA282195 
132825 RC^AA283127_s 
130269 RC^AA284694 
129192 RC^AA291137 
452598 RC_AA291708 
WARNING 

132131 RC_AA293495 
115536 RC_AA347193 
132411 RC_AA398474_s 
115575 RC_M398512 
115601 RC_AA400277 
WARNING 

103928 RC_AA40D896 
125819 RC_AA404494 
115683 RC^AA410345 
115715 RC_AA416733 
132952 RC.AA425154 
115819 RC_AA426573 
132525 RCJ\A431418 
115895 RCJ\A436182 
132333 RCJ\A437099 
115962 RC_AA446585 
115967 RC_AA446887 
115974 RC^AA447224 
115985 RC_AA447709 
129254 RC_AA453624 
133071 RCJ\A455044 
116095 RC_AA456045 
122691 RC_AA460454_s 
116210 RC_AA476494 
116213 RC_AA476738 
134585 RC_AA481422 
134790 RC_AA482269 
116265 RC_AA482595 
129334 RC_AA485084„s 
116274 RCJUV485431,s 
303150 RC_AA489057 
129945 RC_AA489638 
116331 RC^AA491000 
116333 RC_AA491250 
132994 RC_AA505133 
134577 RC^AA598447 
116391 RC_AA599243 
116394 RC.AA599574J 
134531 RCJVA600153 
116417 RCJ\A6093D9 
116429 RCJ^e09710 
116439 RCJU610068 
116459 RCJU\621399 
427505 RCJ\A621752 
132699 RC_C21523 
116541 RC_D12160 
132557 RC_D19708 
112259 RC^D25801 
116571 RC^D45652 
sequence. 

129815 RC_D60208_f 
421919 RC_O80504^s 
116643 RC_F03010 
116661 RC„F04247 
repetitive 

116715 RC_F10966 
116729 RC^F13700 
318709 RC^H05063 
134760 RC.H16758 
116773 RC^H17315_s 
106425 RC,H22556 
116780 RC.H22566 
131978 RC^H48459_s 
116819 RC.H53073 
111428 RC_H56559_s 
133175 RC.H57957_S 



AA836472 

U49436 

L08895 

AW836130 

AW014385 

U82671 

F05422 

AA286914 

A1831594 

AF069291 
AK001468 
AA059412 
AA393254 
AA148984 

D14540 

AA044840 

AF255910 

BE395161 

AI658580 

AA486620 

AW292809 

AB033035 

AA192669 

AI636361 

AI745379 

BE513442 

AA447709 

AA252468 

BE3d4932 

AA043429 

R19768 

BE622792 

AA292105 

D14041 

BE002798 

BE297412 

AW1 57022 

AI129767 

AA887146 

BE514376 

N41300 

AF1 55827 

AA1 12748 

BE244323 

T86558 

NM.006033 

AI742645 

AW499664 

AF191018 

AA251594 

R80137 

AA361562 

AW449822 

D12160 

AAl 14926 

AA337548 

D45652 

BE565817 
AJ224901 
AI367044 

ALl 17440 

BE549407 

R52576 

NNL000121 

A1823410 

H24201 

H22566 

AA355925 

H53073 

AL031428 

AW955632 



Hs.297939 

Hs.286236 

Hs.78995 

Hs.75277 

Hs.88678 

Hs.57698 

HS.16B352 

Hs.183299 

Hs.68647 

Hs,40539 
Hs.62180 
Hs.47986 
Hs.43619 
Hs.48849 

Hs.199160 

Hs.251871 

Hs.54650 

H5.1390 

Hs.61426 

Hs.41135 

Hs.50727 

Hs.51965 

Hs.45032 

Hs.179550 

Hs.42911 

Hs^8944 

Hs.268115 

Hs.1098 

Hs.64313 

Hs.62618 

Hs.172788 

Hs.172788 

Hs.326740 

Hs.278573 

Hs.287850 

Hs.55189 

Hs.4947 

Hs.162874 

Hs.8217 

Hs.165998 

Hs.71616 

Hs.203963 

Hs.279905 

Hs.85951 

HS75113 

Hs.65370 

Hs.110713 

Hs.12484 

Hs.279923 

Hs.43913 

Hs.302738 

Hs.178761 

Hs.55200 

Hs.249212 

Hs.5122 

Hs.333402 



Hs.26498 

Hs.109526 

Hs.153638 



Hs.170263 

Hs.1 15823 

Hs.285280 

Hs.89548 

Hs.1 69149 

Hs.247423 

Hs.30098 

Hs.36232 

Hs.93698 

Hs.174174 



cathepsin B 
KIAA1856 protein 

MADS box transcription enhancer factor 2. polypeptide C (myocyte entiancer factor 2C) 

hypothetical protein FU13910 

ESTs, Weakly similar b Unknown [H.sapiens] 

Empirically selected from AFFX single probeset 

nucleoporin-like protein 1 

ESTs 

ESTs. Weakly similar to AI.U7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 

chromosome 8 open reading frame 1 

anillin (Drosophila Scraps homolog). actin binding protein 

hypothetical protein MGC10940 

ESTs 

ESTs, Weakly similar to ALU4_HUMAN ALU SUBFAMILY SB2 SEQUENCE CONTAMINATION 

myeloid/lymphoid or mixed-Itneage leukemia (trithorax (Drosophila) homolog) 
CTP synthase 

junctional adhesion molecule 2 

proteasome (prosome, macropain) subunit, beta type, 2 

Homo sapiens mesenchymal stem cell protein DSC96 mRNA, partial cds 

endomudn>2 

N*acetytgtucosaminklase. alpha- (Sanfilippo disease illB) 

KIAA1209 protein 

ESTs 

hypotheOcal protein M6C10702 
ESTs 

hypothetical proteni FLJ10631 

ESTs, Weakly similar to T08599 probable transcription factor CA15D {H.sapiens] 
DKFZp434J1813 protein 

ESTs. Weakly similar to AF257182 1 G-protein-coupled receptor 48 [Helens] 
ESTs 

ALEX3 protein 
ALEX3 protein 

hypothetical protein M6C10947 

H-2K binding factor-2 

integral membrane protein 1 

hypothetical protein 

hypothetical protein FLJ22584 

guanine nucleotide binding protein (G protein) alpha 12 

stromal antigen 2 

PAI-1 mRNA-blnding protein 

Homo sapiens mRNA; cDNA DKFZp586N1720 (from clone OKFZp586N1720) 

hypothetical protein FU10339 

clone HQ0310PRO0310p1 

exportin, tRNA (nuclear export receptor for tRNAs) 

general transcription factor illA 

lipase, endothelial 

DEK oncogene (DNA binding) 

Human done 23826 mRNA sequence 

putative nucleotide binding protein, estradtoMnduoed 

PI6F1 gene product 

Homo sapiens cDNA: FU21425 fis. ck)ne COL04162 
268 proteasome-assodated padi homolog 
ESTs 

polymerase (RNA) III (DNA direded) (155kD) 
ESTs 

hypothetical protein MGC12760 

gb:HUMGS02d48 Human adult lung 3' directed Mbot cDNA Homo sapiens cONA 3', mRNA 

hypothetical protein FLJ21657 
zinc finger protein 198 

myelokl/lymphoid or mixed-lineage leukemia 2 

gb7h1^03^1 Soanes in^t brain INfB Homo sapiens cDl^ done 3' similar to contains Mu 

tumor protein p53-binding protein, 1 

n'bonudease P, 40kD subunit 

Homo sapiens cONA: FLJ220g6 fis, done HEP16953 

erythropoietin receptor 

kaiyopherin alpha 1 (importin alpha 5) 

addudn 2 (beta) 

ESTs 

KIAA0186 gene produd 
EST 

KIAA0601 protein 

ESTs» Weakly similar to S19560 proline-rich prote&n MP4 - mouse [Mjnuscutus] 
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116B44 RC.H64938_s 


H64938 


Hs.337434 


116845 RCJ64973 


AA649530 




116B92 RC.H69535 


AI573283 


Hs.38458 


116925 RC H73110 


H73110 


Hs.260603 


116981 RC H81783 


N29218 


Hs.40290 


131768 RC H86259 


AC005757 


Hs.31809 


117031 RCJ88353 


H88353 




contains LI 






117034 RC H88639 


U72209 


Hs.1 80324 


132542 RC H88675 


AL137751 


Hs.263671 


134403 RC.H93708.S 


AA334551 


Hs.82767 


117280 RC N22107 


M18217 


Hs.172129 


117344 RC_N24046 


R19085 


Hs.210706 


117422 RC N27028 


A1355562 


Hs.43880 


117475 RC_N30205 


N30205 


Hs.93740 


117487 RC.N30621 


N30621 


Hs.44203 


130207 RC^N33258 


AF044209 


Hs.144904 


117549 RC.N33390 


N33390 


Hs.44483 


117683 RC N40180 


N40180 




IMAGE:276387 3*sImaarto 




117710 RC_N45198 


N45198 


Hs.47248 


104514 RC N45979^s 


AF164622 


Hs.182982 


117791 RC^N48325 


^M8325 


Hs.93956 


117822 RC„N48913 


AA706282 


Hs.93963 


129647 RC_N49394 


AB018259 


Hs.1 18140 


117895 RC_N50656 


AW450348 


Hs.93996 


[H.sapiens] 






131557 RC_N50721 


AA317439 


Hs.28707 


133057 RC_N53143 


AA465131 


Hs.64001 


118103 RC_N55326 


AA401733 


Hs.184134 


118111 RC,N55493 


N55493 




mRNA 






118129 RC N57493 


N57493 




IMAGE:2773583',mRNA 






118278 RC_N82955 


N62955 


Hs.316433 


118329 RC.N63520 


N63520 




3*, mRNA 






118336 RC.N63604 


BE327311 


Hs.47166 


132457 RC.N64166 


AB017365 


Hs.173859 


118363 RC.N64168 


An83838 


Hs.48938 


118364 RC.N64191 


N46114 


Hs.29169 


118475 RC^N66845 


N66845 




sfmilar to 






118491 RC.N67135 


AV647908 


Hs.90424 


118500 RC_N67295 


W32889 


Hs.154329 


101663 RCJ468399 


NM»003528 


Hs.2176 


118584 RC^N68963 


AW136928 





sequence 

421983 RC^N69331 A1252640 

118661 RCJ17Q777 AL137554 

118684 RC_N71364_s N71313 

118689 RC N71545_s AW390601 

118690 RC_N71571 N71571 
118765 RC^N74456 N74456 
118793 RC_N75594 N75594 
118817 RC N79035 AI668658 
118844 RC N80279 AL035364 
118919 RC^N91797 AW452696 
129558 RC N92454 AW580922 
132692 RC.N94581 AW191962 
118996 RC_N94746 N94746 
119021 RC.N98238 N98238 
119039 RC.R02384 AI160570 
119063 RC_R16833 R16833 
WARNING 

118523 RC.R41828.S Y07759 

119111 RC.R43203 T02865 

133970 RC_R46395 AA214228 

119146 RC.R56863 R58863 

120296 RC.R78248 AW995911 

119239 RCjri1483 T11483 
sequence. 

119281 RC_T16896 A1692322 

119298 RC^T23820 NM^001241 

126502 RC_T30222 T10077 

135073 RC.W15275_s W55956 



Hs.1 10364 

Hs.49927 

Hs.163986 

Hs.184544 

Hs.269142 

Hs.50499 

Hs.285921 

Hs.50797 

Hs.50891 

Hs.130760 

Hs.180446 

Hs.249239 

Hs.274248 

Hs.55185 

Hs^52097 

Hs.53106 

Hs.170157 

Hs.328321 

Hs.127751 

Hs.91815 

H5299883 



Hs.65373 
Hs.155478 
Hs.13453 
Hs.94030 



ESTs, Weakly similar to A46010 X-linked reOnopathy protein [H^piens] 
gbuis44105.s1 NCLCGAP J^tvl Homo sapiens cDNA done. mRNA sequence 
ESTs 

ESTs, l\^oderately similar to A47582 B^II growth ^ctor precursor [H.saptens] 
ESTs 

hypothetical protein 

gb:yw21a02.s1 Morton Fetal Cochfea Homo sapiens cONA done f MAGE*.252842 3' similar to 
YYI-assodated factor 2 

Homo sapiens mRNA; cDNA DKF2p43410812 (from done DKFZp434l0812); partial cds 
spemf) spedTic antigen 2 

Homo sapiens cDNA: FIJ21409 tis. done COL03924 

Homo sapiens cDNA FU13182 fis, done NT2RP3004070 

ESTs. Wealdy similar to A46010 X-linked retinopathy protein IH^sapiensJ 

ESTs. Weakly similar to 138022 hypothetkal protein IH.sapiens] 

ESTs 

nudear receptor compressor 1 
EST 

gb7y44d023l Soares_multiple_sderosls.2Nt)HMSP Homo sapiens cDNA done 

ESTs. Higlily simOar to simitar to Cdc14B1 phosphatase [H.sapiens] 

golgin-67 

EST 

ESTs 

KIAA0716gene produd 

ESTs, Highly similar to SORU.HUMAN SORTILIN-RELATED RECEPTOR PRECURSOR 

signal sequence receptor, gamma (translocon-associated protein gamma) 

Homo sapiens done 25218 mRNA sequence 

ESTs 

gb7v50d)2.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done IMAGE246146 3', 

gb7yS4ti08,s1 Soare8.multlple.sderosls.2NbHMSP Homo sapiens cDNA ctone 

Homo sapiens d3NA FU11375 fis, done HEMBA100041 1 , weakly similar to ANKYRIN 
gb:yy62fOU1 SoaresjTUjitiple.sclerosts^bHMSP Homo sapiens cDNA done IMAGE:278137 

HT021 

frizzled (Drosophila) homotog 7 
hypothetk:al protein FLJ21802 
hypothetical protein FU22623 

gb'^6c11.s1 Soares fetal Dver spleen 1NFLS Homo sapiens cDNA done IMA6E:295604 3* 

Homo sapiens cDNA: Fy23285 fis, done HEP09071 
ESTs 

H2B histone family, member Q 

gb;UI-H-BJ1-adp^8^J.s1 NCLCGAP_Sub3 Homo sapiens cDNA done 3', mRNA 

peptidylprolyl teomerase C (cydophilin C) 
protein kinase NYD-SP15 

Homo sapiens cONA: FU22765 fis, done KAtA1180 
Homo sapiens, done IMAGE:3355383. mRNA, partial ods 
ESTs 
EST 

ESTs. Moderately similar to T47135 hypothetical protein DKFZp761L0812.1 tH.sapiensl 
ESTs 

hypothetical protein 

myosin phosphatase, target subunit 2 

karyopherin (tmporlin) beta 1 

collagen, type VIII. alpha 2 

hypothetical protein FU20758 

ESTs 

pregnancy specific beta-1-g)yooproteln 6 

ESTs, Moderately similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

myosin VA (heavy polypeptkie 12, myoxin) 
EST 

hypothetteal protein 
ESTs 

hypothetteal protein FL J23399 

gb:CHR90049 Chromosome 9 exon Homo sapiens cDNA done 111-1 5* and 3*. mRNA 

ESTs. Weakly similar to T02345 hypofiiettoal protein K1AA0324 [H.sap!ens] 
cydinT2 

hypothetteal protein FLJ14753 

Homo sapiens mRNA; cDNA DKFZp586E1624 (from done DKFZp586E1624) 
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119558 


RC W38194 


Woo 194 




132736 


RC_W42414^S 


AVvOo1oo3 


ns2882Dl 


sapiens mad protein 






132173 


RC W46577^s 


Xo942o 


Hs.41716 


134873 


RC^W49632_s 




nS.90449 


119650 


RC^W57613 


HoZMZ 


nS.79o5o 


119654 


RC W57759 


Vvo//by 




similar to 






119683 


RC.W61118 




rtS.O/ooo 


119694 


RC.W65344 


AA04ii»0 


rlS.a/o47 


119718 


RC^W69216 


Wo92lD 


Ks.92848 


133010 


RC_W69379 


Ai287518 


Ks.62669 


119938 


RC.W86728 


AW014862 


Hs.58885 


120128 


RC_Z38499 


BE379320 


Hs.91448 


120130 


RC_Z38630 


AA045767 


ns.5300 


120148 


RC.Z39494 


rUZoQo 


Hs.65765 


120155 


RC_239623 


Z39623 


ns.o57o3 


131486 


RC Z40071^s 


F06972 


Hs.27372 


120183 


RC_240174 


AW082866 


nS.D5oo2 


120184 


RC Z40182 


Z40182 


nS.ooooS 


120211 


RC Z409D4 


240904 


HS.DDUiZ 


120245 


RC_AA166955 


AW959615 


Lin •I 4 \r\Ac 


120247 


RCJ\A167500 


AA167500 


nS. 103903 


120254 


RCJV^169599_s W90403 


nS.1110o4 


120259 


RC^AA171724 


AW014786 


HS. 19274^ 


120260 


RC AA171739 


AK000061 


Lt^ Ai\A er\i\ 

Hs.101590 


120275 


RC^AA177105 


AA177105 


HS.78457 


120284 


RC_AA182626 


AA179656 




to contains 






114056 


RC^AA186324 


AA188175 


Hs.82506 


129507 


RCJW192099 


AJ236885 


Hs. 11 2180 


120302 


RC AA192173 


AA837098 


Hs.269933 


120303 


RC AA192415 


AI216292 


Hs.96184 


120305 


RC_AA192553 


AW295096 


Hs. 101 337 


120319 


RC AA194851 


T57776 


Hs.191094 


133389 


RC_AA195520^s AA195764 


ns.72oo9 


120326 


RCJ\A196300 


AA196300 


Hs.21145 


134272 


RCJ\A198517 


X76040 


Hs.278614 


133145 


RC__AA196549 


H94227 


lis.6592 


120327 


RCJ\A196721 


AK000292 


HS.27B732 


106688 


RG_AA196729J N66397 


1 an J n Air 

Hs.334825 


120328 


RC_AA196979 


AA923278 


Hs.290905 


120340 


RCJ\A206828 


AA206828 




similar to 






134292 RCJkA207123 


Aig06291 


Hs.81234 


131522 


RC_AA214539J AI380040 


Hs.239489 


129051 


RC^AA226914_s AA227068 


Hs. 108301 


120375 


RC AA227260 


AF028706 


Hs.111227 


120376 


RC_AA227469 


AA227469 




IMAGE:663732 3', mRNA sequence. 




120390 


RC_AA233122 


AA837093 


U_ AAA At!f\ 

nS.m4D0 


303876 


RC JVA233334^S U64820 


rlS.DD521 


dominant, ataxin 3) 






132038 


RC^AA233347 


A{825842 


Hs.3776 


104463 


RC AA233519 


T85825 


Hs.246885 


125750 


RC_AA233714 


AA018515 


Hs,264482 


120396 


RCJ\A233796 


AA134006 


Hs.79306 


120409 


RC_AA235050J AA235050 




gb:L07077 






120414 


RC_AA235704 


AW137156 


Hs.181202 


120420 


RC^AA236031 


AI128114 


Hs. 112885 


120422 


RC_AA236352 


AL133097 


Hs.301717 


132221 


RCJ\A236390_s W94915 


Hs.42419 


120423 


RCJ\A236453 


AA236453 


Hs.18978 


120435 


RCJU^43370 


AA243370 


LI^ f\OACf\ 

nS.96450 


120453 


RCJ^A250947 


AA250947 


Urn, 4TrAoeo 

nS.1 70263 


120455 


RC_AA251083 


AA251720 


HS. 104347 


120456 


RCJ\A251113 


AA488750 


ij_ aaA 4 A 

Hs.88414 


120473 


RCJ\A251973 


AA251973 




128922 


RCJ\A252023 


AI244901 


Hs.9589 


120477 


RCJ\A252414 


AA252414 


Hs.43141 


120479 


RCJ\A252650 


AF006689 


Hs.1 10299 


120488 


RC_AA255523 


AWg52916 


Hs.63510 


120510 


RCJVA25&128 


AI796395 


Hs.111377 


120527 


RCJkA262105 


AA262105 


Hs.4094 


120528 


RCJ\A262107 


At923511 


H5.104413 



Empirically selected from AFFX single probeset 

Homo sapiens cDNA: FU23037 fis. clone LNG02036, highly similar to HSU68019 Homo 

endothelial cell-specific molecule 1 
Human clone 23908 mRNA sequence 

ESTs. Weakly similar to S65657 aipha-IC-adrenergic receptor splice form 2 [H.sapiens] 
gb:zd20g11.s1 SoaresJetaLhearLNbHHIOW Homo sapiens cDNA clone IMAGE:341252 3' 

ESTs 

ESTs. Moderately similar to 1CE4_HUMAN CASPASE-4 PRECURSOR [Rsapiens] 
ESTs 

Homo sapiens mRI4A; cDNA DKFZp586D0923 ((rom done OKFZp586D0923) 
ESTs 

MKP-1 like protein tyrosine phosphatase 
bladder cancer associated protein 
ESTs 
ESTs 

BMX non-receptor tyrosine kinase 

ESTs 

EST 

EST 

ESTs 

EST 

ESTs 

hypothetical protein FU12785 
hypothetical protein 

solute carrier family 25 (mitochondrial canier, ornithine transporter) member 15 

gb:zp54e1 1 .si Stratagene NT2 neuronal precursor 937230 IHomo sapiens cONA done 3* similar 

KIAA1254 protein 

zinc finger protein 148 (pHZ-52) 

ESTs 

ESTs 

uncoupling protein 3 (mitochondrial, proton carrier) 

ESTs 

ESTs 

hypothetical protein RG083M05.2 
protease, serine. 15 

Homo sapiens, done IMAGE:2961368. mRI^JA, partial cds 
hypothetical protein FU20285 
Homo sapiens cDNA FU14752 fts, done NT2RP3003071 
ESTs. Weakly similar to protease [H.sapiens] 

gb:zq80b08.s1 Stmtagene hNT neuron (937233) Homo sapiens cDNA done iMAGE:647895 3" 

irranunoglobulin superiiamily, member 3 

T(A1 cytotoxic granule-assodafed RNA-bfnding protein 

nudear receptor subfamily 2, group C, member 1 

Zic family member 3 (odd-paired Drosophila homoiog, heterotaxy 1) 

gb:zr18a07.s1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA done 

caldum/calmodulin^tependent protein kinase (CaM kinase) 11 delta 

Machado-Joseph disease (spinocerebellar ataxia 3, onvopontocerebeDar ataxia 3. autosomal 

zinc finger protein 216 
hypothetical protein FU20783 

Homo sapiens mRNA; cDNA DKFZp761A0411 (from ctone DKFZp761A0411) 
eukaryotic translation initiation factor 4E 

gb2s38e04.s1 Soares_NhHMPu_S1 Homo sapiens cDNA done IIVIAGE:687486 3' similar to 

hypothetical protein FLJ10038 
spinal cord-derived growth factor-6 
hypothetteal protein DKFZp434N1928 
ESTs 

Homo sapiens cDNA: FU22822 fis, done KA1A3968 
EST 

tumor protein p53-binding protein. 1 

ESTs. Weakly similar to ALUCJiUlVIAN llli ALU CUSS C WARNING ENTRY III [H.sapiensl 
BTB and CNC homology 1, basic leucine zipper transcription factor 2 
ESTs 
ubiquiHn 1 

DKFZP727C091 protein 
mitogen-activated protein kinase kinase 7 
K1AA0141 gene product 
ESTs 

Homo sapiens dDNA FU14208 lis, done NT2RP3003264 
ESTs 
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120529 RC_AA262235 AI434823 

120541 RCJ^27829B W07318 

131445 RC AA278529J NM.014264 

120544 RCJ^A278721 BE548277 

5 120562 RC AA280036 BE244580 

120569 RC_AA2B0648 AA807544 

120571 RC_AA280738 AB037744 

120572 RCJ\A280794 H39599 
129434 RCJVV280837 AW967495 

10 130529 RCJ\A2B0886 AA178953 



120575 RCJVA280934 AW978022 
132635 RCJVA281535 AB020686 
120591 RC_jAA281797„s AF078847 

15 120593 RCJ\A282047 AA748355 
430275 RC_AA283a02 Z11773 
117729 RC AA283709 AA306166 
120609 RC_AA283902 AW978721 
132754 RC AA2B4108 AI752244 

20 130315 RCJ\A284109 A1241084 
132614 RC AA284371 AA284371 
447503 RC.AA284744J AA1 15496 
cds 

135376 RC_AA284784 BE617856 
25 120621 RC_AA2M840 AW961294 
107868 RCJ\A286844 AA286844 
129868 RCJ\A287032 AW172431 
120644 RC J^87038 AI869129 

120660 RCJ^287546 AA286785 
30 135370 RCJ^87553^s BE622187 

120661 RC AA287556 AA287556 
129116 RC^AA287564 AB019494 
131567 RCJ\A291015.s AF015592 
120699 RCJW291716 AI683243 

35 100690 RCJ\A291749_s AA383256 
120726 RC AA293656 AA293555 
120737 RC^AA302430 AL049176 
120745 RCJVA302809 AA302609 
135192 RCJ^A302820.s U83993 

40 120750 RCJ^310499 AI191410 
120761 RCJ\A321890 AA321890 

120768 RCJ\A340569 AA340589 

120769 RCJ\A340622 AI769467 
135232 RCJVA342457J AL038812 

45 CONTAMINATION 

133439 RCJ^342828_s Z23091 
120793 RCJ\A342864 AA342864 
120796 RCJ^342973 AI247356 
120809 RCJ^346495 AA346495 

50 repeat mRNA sequence. 

132459 RC^AA347573 AL1 20071 
120825 RC„AA347614 AI280215 
120827 RC_AA347717 AA382525 
120839 RC_AA348913 AA348913 

5 5 repeat mRNA sequence, 

120850 RCJ\A349647 AA349647 
120852 RC_AA349773 AA349773 
128852 RC_AA350541^s R40622 
135240 RC.AA357159J AA357159 

60 120870 RCJ\A357172.i AA357172 
WARNING 

134637 RC AA369856.S U87309 
120894 RCIaA370132 AA370132 
131854 RC_AA37047^.s AF229839 
65 120897 RC_AA370867 AA370867 
120915 RC AA3772g6 AL135556 

120935 RC.AA383g02 AL04B409 
WARNING 

120936 RCJ\A385934 AA385934 
70 120937 RCJVA386255 AA386255 

120938 RCJ^386260 AA388260 
129722 RCJ^6266 R20855 
120960 RCJVA398014 AA398014 
120985 RCJ^8222 A1219896 
75 120988 RCJ^8235 AA398235 



Hs.104415 ESTs 

H5.240 M-phase phosphoprotein 1 

Hs.172052 serine/threonine kinase 18 

Hs.103104 ESTs 

Hs.302267 hypothetical protein FU10330 

Hs.24970 ESTs. Weakly sMar to B34323 GTP-bnidIng protein Rab2 (H.sapiens] 

Hs.34692 KIAA1323 protein 

Hs.294008 ESTs 

Hs.186644 ESTs 

gb2p39e03.s1 Stralagene muscle 937209 Homo sapiens cDNA ctone 3' similar to contains Ahi 

Hs.238911 hypothetical protein DKFZp762E1511; KIAA1816 protein 

Hs.54037 ectonudeotide pyrophosphatase/phosphodiesterase 4 (putative functton) 

Hs.191356 general transcription factor IIH, polypeptide 2 (44kD subunil) 

Hs.193522 ESTs 

Hs.237786 zinc finger protein 187 

Hs7145 calpain7 

Hs.266076 ESTs, Weakly similar to A46010 X-linked retinopathy protein [H.sapiens] 

Hs.75309 eukaryotic translation elongation factor 2 

Hs.1 54353 nonselective sodium potassium/proton exchanger 

Hs.1 18064 similar to rat nuclear ubiquitous casein kinase 2 

Hs.336898 Homo sapiens, Similar to RIKEN cDNA 1810038N03 gene, done MGC:9890, mRNA. complete 

Hs.99756 mitochondrial ribosome recycling factor 

Hs.143818 hypothetical protein FU23459 

Hs.61260 hypothetical protein FU13164 

Hs.13012 ESTs 

Hs.96616 ESTs 

Hs.99677 ESTs 

Hs.99670 ESTs, Weakly similar to 138022 hypotheUcal protein lH.saplens) 

Hs.263412 ESTs, Weakly similar to ALUB^HUMAN Illl ALU CLASS B WARNING ENTRY !l! [Haptens] 

Hs.225767 IDN3 protein 

Hs.28853 CDC7 (cell division cycle 7, S. cerevislae, homolojg)-like 1 

Hs.97258 ESTs. Moderately simflar to S29539 ribosomal protein L13a. cytosollc [H.saplens] 

Hs.1657 estrogen receptor 1 

Hs.97293 ESTs 

Hs.82223 chordln-fike 

gb£ST10426 Adipose tissue, white I Homo sapiens cDNA 3^ end, nnRNA sequence. 

Hs.321709 purinergbs receptor P2X, l^and^ated ion channel, 4 

Hs.96693 ESTs, l^toderately siiralar to 2109260A B cell growth factor [H.sapiens] 

Hs.1265 branched chain keto add dehydrogenase El, beta polypeplWe (maple synip urine disease) 

Hs.104560 EST 

Hs.96769 ESTs 

Hs.96800 ESTs. Moderately similar to ALU7.HUMAN ALU SUBFAMILY SO SEQUENCE 

Hs.73734 glycoprotein V (plateleQ 

Hs.96812 ESTs 

Hs.96820 ESTs 

gb:EST52657 Fetal heart II Homo sapiens cDNA 3' end similar to EST containing 0 family 

Hs.48998 fibronectin leudne rich transmembrane protein 2 

Hs.96885 ESTs 

Hs.132967 Human EST done 122887 mariner transposon Hsmarl sequence 

gb£ST55442 Infant adrenal gland tl Homo sapiens cDNA 3* end similar to EST containing Alu 

Hs.96927 Homo sapiens cDNA FU12573 fis, done NT2RM4000979 

Hs.191564 ESTs 

Hs.106601 ESTs 

Hs.96986 EST 

Hs.292581 ESTs, Moderately similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

Hs.1 80941 vacuolar protein sorting 41 (yeast homok)g) 

Hs.97063 ESTs 

Hs.173202 l-kappaB-^teracUr^ Ras-Eke protein 1 

Hs.97079 ESTs. Moderately similar to AF174605 1 F-box protein F\sx25 [H^piens] 
Hs.97104 ESTs 

Hs.97177 ESTs. Weakly similar to ALU1J1UMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

Hs.97184 EST, Highly similar to (defline not available 7499603) [C.elegans] 

Hs.97186 EST 

Hs.104632 EST 

Hs.5422 glycoprot^ MOB 

Hs,104684 EST 

Hs.97592 ESTs 

Hs.97631 ESTs 
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121008 RC_AA398348 
GSSs and a CpG 
121029 RC.AA398482 
RCJ^398504 
RC_AA398505 
RCJVA398507 
RCJVA398523 
RCJVA398625 
RC_AA398632 
RCJ^8633 
RCJ^8894 
CONTAMINATION 
121092 RCJ^8895 
RCJ\A398900 
RCJU398904 
RCJ^9122 
RC_AA399371 
RCJ\A399373 
RC_AA399441 
RCJ^399636 
RC_AA399640 
RC_AA399680 
RC_AA400080 
RC„AA400252 
RC_AA400725 
RC^AA400748 
RC_AA400780 
RC_AA401631 
RC_AA401688 
RCJ\A401695 
RC_AA402227 
RCJVA402329 
RCJ^402398 
RC.AA402449 
RC_AA402468 
RC^AA403268 
RCJ^403314 
RC_AA404229 
RC_AA404260 
RCJ\A404271 
RCJ>iA405026 
RCJ\A405182 
RCJ>kA405237 
contains Alu 
121400 RCJ\A406061 
RC_AA406063 
RC^AA406070 
RC_AA406137 
RC^406335 
RC_AA411801 
RCJ\A411804 
RC_AA411833 
RCJVA412219 
RC.AA412259 
RC_AA412497 
contains L1.t3 LI 
121559 RC^AA412498 
RCJ\A416586 
RCJ\A416867 
RCJ^416874 
RC_AA421133 
RC_AA421138 
RC.AA422079 
RC_AA423837 
RC_AA424328 
RCJ^424339 
RCj\A424469 
RCJ\A424502 
RCJVV425004 
RC_AA425734 
CX3MTAMINATI0N 
121853 RC_AA425887 
121691 RCJW^426456 
121895 RCJVA427396 
similar to GontaBis 
121899 RCJVA427555 



AA398348 Hs.301 720 Human DNA sequence from clone RP1 1-251 J8 on chromosome 13 Contains ESTs. STSs. 



121032 
121033 
121034 
121035 
121058 
121060 
121061 
121091 



121094 
121096 
121115 
121121 
121122 
121125 
121151 
121153 
121163 
121176 
121192 
121223 
121227 
121231 
121278 
121279 
121282 
121299 
121301 
121302 
121304 
121305 
134721 
121323 
121324 
129047 
131074 
121344 
121348 
121350 



121402 
121403 
121408 
121431 
132936 
121471 
121474 
121526 
. 121530 
121558 



121584 
121609 
121612 
121737 
121740 
129194 
121784 
121802 
121803 
135286 
121806 
129517 
121845 



AA398482 
AA393037 
AA398505 
AL389951 
AA398523 
AA398625 
AA398632 
AA393288 
AA398894 

AA398895 
AA402505 
AA398904 
AA398187 
AA399371 
AI126713 
AL042981 
AA399636 
AA399640 
AI676062 
AL121523 
AA400262 
AI002110 
AA400748 
AA814948 
AA037121 
AA292873 
AA401695 
AA402227 
NIVL006202 
AA402587 
AA293863 
AA402468 
s AK000112 
AA291411 
AA404229 
AI768623 
U16125 
AA405026 
AA405182 
AA405237 

AA406061 

AA406063 

AA406070 

AA406137 

AA035279 

AL120659 

AA411804 

AA402335 

AW665325 

AA778658 

AA412497 

AI192044 

A1024471 

AA416867 

AA416874 

AA421133 

AA421138 

. AA150797 
T90789 
AI251870 
AI338371 

s AW023482 
AA424313 
AW972853 
AI732692 

AA425887 
AA426456 
AA427396 

R55341 



Hs.97641 EST 

Hs.161798 ESTs 

Hs.97360 ESTs 

Hs.271623 nucleoporinSOkD 

Hs.210579 ESTs 

Hs.97391 ESTs 

Hs.97395 ESTs 

Hs.97396 ESTs 

Hs.97657 ESTs. Moderately simitar to ALU8JHUMAN ALU SUBFAMILY SX SEQUENCE 

Hs.97658 EST 

gb2t62h10.r1 Soaresjestis.NHT Homo sapiens cDNA done 5\ mRNA sequence 

Hs.3326g0 ESTs 

Hs.104682 ESTs. Weatdy similar to mitochondrial citrate transport protein [H^apiens] 

Hs.189095 similar to SALL1 (sal (On)sophiia)4ike 

Hs.192233 ESTs. Highly sM3rtoT00337 hypothetical protein KIAA0568 [H.sapiens] 

Hs.251278 KIAA1201 protein 

Hs.143629 ESTs 

Hs.97694 ESTs 

Hs,111902 ESTs 

Hs.97774 ESTs 

Hs.190093 ESTs 

Hs.97169 ESTs. Weakly similar to dJ667H12.2.1 [H.sapiens] 

Hs.97823 Homo sapiens mRNA; cDNA DKFZp434D024 (from clone DKFZp434D024) 

Hs.96343 ESTs. Weakly similar to ALUC.HUMAN !!ll ALU CLASS C WARNING ENTRY II! [H.saF»ens] 

Hs.98518 Homo sapiens cDNA FU1 1490 lis. done HEMBA1001918 

Hs.177996 ESTs 

Hs.97334 ESTs 

Hs.22826 tropomodulin 3 (ubiquitous) 

Hs.89901 phosphodiesterase 4A, cAMP-specific (dunce (Drosophila)-homolog phosphodiesterase E2) 

Hs.325520 UT1-3TM protein 

Hs.97316 EST 

Hs.291557 ESTs 

Hs.89306 hypothetical protein FU20105 

Hs.97247 ESTs 

Hs.97842 EST 

Hs.108264 ESTs 

Hs.181581 glutamate receptor, lonotroplc. itainate 1 

Hs.193754 ESTs 

Hs 97973 ESTs 

gb:zt06e10^1 NCLCGAP.GCBI Homo sapiens dDNA done IMAGE:712362 3' simitar to 

Hs.98001 EST 

Hs.98003 ESTs 

Hs.98004 EST 

Hs.98019 EST 

Hs.176731 ESTs 

Hs.61 1 1 aryl-hydrocarfaon receptor nudear translocator 2 

Hs.261575 ESTs 

Hs.188760 ESTs. Highly similarto Trad [H^aplens] 

Hs.98120 ESTs 

Hs.98122 ESTs 

gb:zl95g12.s1 Soares_testis_NHT Homo sapiens cDNA clone IMAGE:730150 3' similarto 

Hs.104778 ESTs 

Hs.98232 ESTs 

Hs.98185 EST 

Hs.98168 ESTs 

Hs.1 04671 erythrocyte transmembrane protein 

Hs.98334 EST 

Hs.109276 latexin protein 

Hs.94308 RAB35. member RAS oncogene family 

Hs.188898 ESTs 

Hs.157173 ESTs 

Hs.97849 ESTs 

Hs.98402 ESTs 

Hs.112237 ESTs 

Hs.165066 ESTs. Moderately similar to ALU2.HUMAN ALU SUBFAMILY SB SEQUENCE 

Ks.98502 hypothetical protein FU14303 

Hs.96469 ESTs 

gb'.zw33a02.s1 Soares ovary tumor NbHOT Homo sapiens cDNA done IMAG£:771050 3* 

Hs.50421 iaAAQ203geneprodud 
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121917 RC_AA428218 AA406397 

121918 RCJ^428242 BE274689 

121919 RCJVA426281 AA428261 
121941 RCJ\A428865 AA428865 

5 121942 RCJ^428994 AW452701 
121970 RC_AA429666 AA429866 
121993 RC_AA430181 AW297660 
134660 RCJ\A43D184_s U73524 
126753 RCJVA431288_s AA306478 
10 122022 RCJ\A431293 AA431293 

122050 RCJ^431478 

122051 RCJ^431492 
122055 RCJ\A431732 
122105 RC_AA432278 

15 122125 RCJVA434411 

135235 RCJ<A435512J AW298244 

122162 • RC^AA435698 AA628233 

129406 RC_AA435711 AB018255 

318801 RC_AA435815^s U40763 
20 122186 RC_AA435842 AA398811 

122235 RC_AA436475 

129131 RC_AA436489 

134664 RC^AA442060 

122310 RCJVK442079 
25 122334 RC_AA443151 

122382 RCJ\A446133 

122425 RCJ^447145 

122431 RCJ\A447398 

122450 RCJVA447643 
30 302653 RCJ\A447742„8 AJ404468 

122477 RCJ\A448226 AA448226 

122500 RC_AA448825 

122522 RC_AA449444 

122536 RCJ^450087 
35 122538 RCJ\A450211 

122540 RCJ^450244 

122560 RCJ\A452123 

421919 RCJ^A452155 

122562 RCJ\A452156 
40 mRNA 

122585 RCJ\A453036 

122608 RCJ\A453526 

122635 RC_AA454085 

similar to 
45 122636 RCJ^A454103 

122653 RC_AA454642 

122660 RCJ^454935 

122703 RC_AA456323 

122724 RCJVA457395 
50 122749 RC_AA458850 

122772 RCJ^A459662 

131098 RC_AA459568 

129045 RCJ\A459679.s A1082883 

122777 RC_AA459702 AK001022 
55 135362 RC^AA460017J AA97812B 

122798 RC.AA460324 AW366286 

122837 RC_AA461509 AA461509 

122860 RCJVA4M414J AA464414 

mRNA sequence. 
60 122861 RCJikA454428 AA335721 

122910 RCJ\A470084 AA470084 

132899 RCJSJ^476606_S AA476606 

122967 RCJ^478521 AA808187 

129560 RCJ\A476523 
65 123009 RCJW\479949 

128917 RCJ^461252 

123081 RCJM^485351 

123133 RCJVA487264 

123184 RCJ^489072 
70 129671 RCJ\A489630 

123233 RCJM490225 
^.sapiens] 

123234 RCJ\A490227 
123236 RCJ^490255 

75 123255 RCJ\A490890 



AI453076 

AA431492 

AA431732 

AW241685 

AK000492 



AA435475 
AB026436 
AA256106 
AW192803 
BE465894 
AA446440 
AB007859 
AA447398 
AA447M3 



AA448825 

AA299607 

AF060877 

AA450211 

AA476741 

AW392342 

AJ224901 

AA452156 

At681654 

AA453525 

AA454085 

AW651706 

AW009166 

A1816827 

AA456323 

AA457395 

AA458850 

AW1 17452 

U66669 



Hs.98038 

Hs.184175 

Hs.98560 

Hs.98563 

Hs.293237 

Hs.98617 

Hs.98661 

H$.87465 

Hs.95327 

Hs.98716 

Hs.166109 

Hs.98742 

Hs.98747 

Hs.98699 

Hs.9d606 

Hs.293507 

Hs.79946 

Hs.111138 

Hs.77965 

Hs.104673 

Hs.112227 

Hs.177534 

Hs.87507 

Hs.98974 

Hs.98365 

Hs.98643 

Hs.100955 

Hs.99104 

Hs.112095 

Hs.284259 

Hs.324123 

Hs.99190 

Hs.98969 

Hs.99236 

Hs.99239 

Hs.98279 

Hs.283077 

Hs.109526 



Hs.170737 
Hs.143077 



Hs.99519 

Hs.99376 

Hs.1 80069 

Hs,269369 

Hs.99457 

Hs.293372 

Hs.99489 

Hs.236642 

Hs.30732 

Hs.214397 

Hs.99513 

Hs.145696 

Hs.2g3565 



Hs.119394 
Hs.98358 



AA3i7841 
AA535244 
AI365215 
At815486 
AA487264 
BE247767 
NM.014700 
AW974175 

NM_001938 
AW968504 
AA830335 



129503 RCJ\A490916^s AW768399 



Hs.289101 

Hs.7845 

Hs.78305 

HS208097 

Hs.243901 

Hs.154974 

Hs.18166 

Hs.119004 

Hs.188751 

Hs.16697 
Hs. 123073 
Hs.105273 
Hs.1 12157 



ESTs 

chromosome 2 open reading Irame 3 

EST 

ESTs 

ESTs 

EST 

ESTs 

ATP/GTP-binding protein 

CD3D antigen, delta polypeptide (TiT3 complex) 

ESTs. Moderately similar to T42650 hypothetical protein DKFZp434D0215.1 IH.sapiens) 

ELAV (embryonic lethal, abnormal vision. DrosophilaHHce 2 

EST 

EST 

ESTs 

hypothetical protein 
ESTs 

cytochrome P450. subfamily XIX (aromatization of androgens) 

KIAA0712 gene product 

peptidyl-prolyl isomeiase G (cydophifin G) 

ESTs 

membrane-associated nucleic acid binding prote&i 

dual specificity phosphatase 10 

ESTs 

ESTs. Weakly similar to S65824 reverse transcriptase homolog [H.sapiens] 

ESTs. Weakly similar to LB4D J^UIWAN NADP-DEPENDENT LEUKOTRIENE B4 12- 

ESTs 

KIAA0399 protein 
ESTs 

hypothetical protein DKFZp434F1819 

dynein. axonemal, heavy polypeptide 9 

ESTs 

ESTs 

ESTs 

regulator of G-protein signalling 20 
ESTs 

ESTs. Weakly similar to A43932 mudn 2 precursor, Intestinal [H.sapien5] 
centrosomat P4.1-assocIa(ed protein; uncharaderized bone marrow protein BM032 
zinc finger protein 198 

gb2x29o03.s1 SoaresJotaLfetus.Nb2HF8_9w Homo sapiens cDNA done IIVIAGE:787876 3*. 

hypothetical protein FU23251 
ESTs 

gb-^33a08.s1 Soares_totaLfetus«Nb2HF8_9w Homo sapiens cDNA done »^GE:788246 3' 

hypothetical protein FLJ14007 
ESTs 

nudear respiratory factor 1 

ESTs 

ESTs 

ESTs. Weakly similar to B34087 hypothetteal protein [H^apiens] 
ESTs 

3-hydroxytsobutyryl-Coenzyme A hydrolase 

hypothetical protein FU13409: K1AA1711 protein 

hypothetical protein FLJ10160 similar to Insulin related protein 2 

ESTs. Weak^ similar to T17454 diaphanous-related fonmin - mouse [iVl.musculus] 

splicing factor (CC1.3) 

ESTs. Weakly similar to putative p150 [H.sapiens] 

gb2x78gOU1 Soares ovary tumor NbHOT Homo sapiens cDNA done IMAGE:809904 3*. 

ESTs 
ESTs 

SMAD in the antisense orientation 
glucose regulated protein, 58kD 
hypothetical protein M6C2752 
RAB2. member RAS oncogene family 
oncogene TC21 

Homo sapiens d)NA FU20738 fis, done HEP08257 

Homo sapiens mRNA; cDNA DKFZp667N084 (from done DKFZp667N064) 

K1AA0870 protein 

KIAA0665 gene product 

ESTs. Weakly similar to MAPBJiUMAN MICROTUBULE-ASSOCIATED PROTEIN IB 

down-regulator of transcription 1, TBP-bindIng (negative cofoctor 2) 

CDC2-re!ated protein kinase 7 

ESTs 

ESTs 
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131043 RC^AA490925 
123259 RCJ\A490955 

[H. sapiens) 

123284 RC_AA495812 
123286 RC_AA495824 
123315 RCJ\A496369 
to contains 

129179 RC_AA5(M125^s 
131612 RCJVA521473 
123421 RCJ\A598440 
123449 RC AA598899J 
123021 RcIm599244 
RCJ\A599694.s 
RCJU600037 
RC_AAe09135 
RCJVA609582 
RC_AA609684 
RCJ^609839 



132830 
123497 
123604 
129539 
123712 
123731 
similar to 
130725 RCJ\A609862 
123800 RC_AA620423 
123841 RCJ\A620747 
123929 RC^AA621364 
123978 RC_C20653 
133184 RC_D20085 
132835 RC D20749 
132406 RC_D51285_s 
128695 RC_D59972.i 
124028 RC.F04112J 
sequence. 

124057 RC_F13604 
134899 RC^H01662 
130973 RC_H05135 i 
124105 RC_H12245 
124136 RC^H22842 
124165 RC„H30894 
131229 RC„H4344^s 
124178 RC^H45996 
129948 RC.H89281J 
134374 RC„H69485J 
124254 RC.H6g899 
similar to 

129056 RC_H70627.s 
100919 RC_H73050.s 
130724 RC^H73260 
100716 RC.H77531,S 
124274 RC^H80552 
129078 RC^H80737^s 
124828 RC„H93412 
124315 RC_H9489^s 
100747 RC^H95643_s 
124324 RC_H96552 
452933 RC^H97146 
132231 RC.H99131_S 
129170 RC^H99462^s 
133143 RC H99837_s 
132963 RC_N22140 
135297 RC_N22197 
134347 RC_N23756_s 
130365 RC^N24134 
421642 RC^N24195 
439311 RC„N26739 
124383 RC^N27098 
124387 RC^N27637 
129341 RC_N33090 
129081 RC_N35967 
102827 RC_.N38959_f 
124433 RC.N3g069 
124441 RC.1^6441 
132338 Ra.N48270J 
131403 RC_N48365_s 
124466 RC.N51316 
132210 RC_N51499^s 

124483 RC.N53976 

124484 RC_N54157 

124485 RC.N54300 



AF084535 Hs.22464 epilepsy, progressive myocionus type 2, Lafora disease (laforin) 

AI744152 Hs^83374 ESTs. Weakly similar to CA15_HUMAN COLUGEN ALPHA 1(V) CHAIN PRECURSOR 

AA48B988 Hs.293796 ESTs 

AA495824 Hs.188822 ESTs. Weakly similar to A46010 X-ilnked retinopathy protein [H.sapiens] 

AA496369 gb:zv37d10.s1 Soares ovary tumor NbHOT Homo sapiens cDNA done 1MAGE:75S827 3* simHar 

AW969025 Hs.1091S4 ESTs 

AU076668 Hs.334884 SEC10 (S. cerevisiaeH'ke 1 

AA598440 Hs2911S4 EST, Weakly simitar to 138022 hypothetical protein [H^apiens] 

AL049325 Hs.112493 Homo sapiens mRNA; cDNA DKFZp564D036 (from done DKFZp564D036) 

AL044675 Hs.173081 KIAA0530 protein 

NIVL014777 Hs*57730 KIAA01 33 gene product 

AA765256 Hs.135191 ESTs, Weakly similar to unnamed protein product [H^apiens] 

AA609135 Hs^076 ESTs 

T47614 Hs.323022 ESTs. Highly simitar to p60 katanin [H.sapiens] 

AA609684 Hs.1 12748 Homo sapiens cDI4A: FU21543 fis. done COU)6171 

AA609839 gb:ae62f01 .s1 Stratagene lung cardnoma 937218 Homo sapiens dDNA done IMAGE:951481 3' 

T98807 Hs.80248 RNA-brnding protein gene with multiple splksing 

AA620423 Hs.112862 EST 

AA620747 Hs.112896 ESTs 

AA621364 Hs.1 12981 ESTs 

T89832 Hs.1 70278 ESTs 

AA001021 Hs.6685 thyroid homione receptor interactor 8 

Z83844 Hs.5790 hypothetical protein dJ37E1 6.5 

AL1 33731 Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (from done DKFZp761C1712) 

NIVL003478 Hs.101299 cullinS 

F041 12 gb:HSC2JH062 normalized infant brain cDNA Homo sapiens cDNA done c-2jh06 3'. mRNA 

AA902384 Hs73853 bone morphogenetic protein 2 

A1609045 Hs.321775 hypothetical protein DKFZp434D1428 

AI6384 18 Hs.78580 DEAD/H (Asp-Glu-AIa^sp/His) box polypeptide 1 

H1 2245 gb:ym17a12.r1 Soares infant brain 1NIB Homo sapiens cONA done 3', mRNA sequence 

H22842 Hs.101770 EST 

H30039 Hs.107674 ESTs 

NM_015340 Hs.2450 leucyl-tRNA synthetase, mitochondrial 

BE463721 Hs.97101 putative G protein-coupied receptor 

AI537162 Hs.263988 ESTs > 

N22687 H5.8236 ESTs 

H69899 gb:yu70c12^1 Weizmann Olfactory Epithelium Homo sapiens cDNA done IMAGE*.239158 3* 

AI769958 Hs.108336 ESTs. Weakly similar to ALUEJ^UMAN IIII ALU CUSS E WARNING ENTRY III [H.sapfen5] 

X54534 Hs.278994 Rhesus blood group. CcEe antigens 

AK001507 Hs;306084 . Homo sapiens done FUB6914 PR01821 mRNA. complete cds 

X89887 Hs.172350 HIR (histone cell cyde regulation defective. S. cerevlsiae) homolog A 

H80552 Hs.102249 EST 

AI351010 Hs.102267 lysosomal 

AW952124 Hs.1 3094 presenilins associated rhomboid-like protein 

NI^005402 Hs.288757 v-ral simian leukemia viral oncogene homolog A (ras related) 

X04588 Hs.85844 neurotrophic tyrosine kinase, receptor, type 1 

H96552 Hs.159472 Homo sapiens cONA: FU22224 fis, done HRC01703 

AW391423 Hs,288555 Homo sapiens cDNA: RJ22425 fis, done HRC08686 

AA662910 Hs.42635 hypothetical protein DKFZp434K2435 

AW250380 Hs.109059 mitochondrial ribosomal protein L12 

AA094538 Hs.272808 putative transcription legulatton nudear protein; K1AA1669 protein 

AA0g9693 Hs.34851 epsilon-tubulin 

AL1 18782 HS.3C0208 Sec23-intBfacting protein p125 

API 641 42 Hs.82042 solute carrier family 23 (nudeobase transporters), member 1 

W561 19 Hs.1 55103 eukaryotic translation initiation factor 1 A. Y chromosome 

API 72066 Hs.1 08346 retinok: add repressible protein 

BE270668 Hs.151945 mitochondrial ribosomal protein L43 

N27Q98 Hs.102463 EST 

N27637 Hs.109019 ESTs 

AI193519 H&226396 hypothetical protein PU 11126 

AI364933 Hs.168913 serine/threonine kinase 24 (Ste20, yeast homolog) 

BE244588 Hs.6456 chaperonin containing TCP1 . subunit 2 (beta) 

A/^80319 Hs.288840 PR01 575 protein 

AW450481 Hs.161333 ESTs 

AA353868 Hs.1^982 go|gin-67 

AI473114 Hs.26455 ESTs 

R10084 Hs.1 13319 kinesin heavy chain member 2 

NIVL007203 Hs.42322 A kinase (PRKA) anchor protein 2 

AI821780 Hs.179864 ESTs 

H661 18 Hs.285520 ESTs. Weakly similar to 210926QA B cell growth factor [H.sapiens] 

AB040933 Hs.15420 K1AA1500 pmtein 
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N54831 Hs.271381 ESTs, Weakly similar to 138022 hypothetical protein [H-sapiens] 

N59849 Hs.13565 Sam68-!ike phosphotyroslne protein. T-STAR 

N79264 Hs.269104 ESTs 

N62375 Hs.102731 EST 

AA903424 Hs.6786 ESTs 

D54120 Hs.146409 cell division cycle 42 (GTP-binding protein, 25kD) 

AI301740 Hs.173381 dihydropyrimidinase-like 2 

BE296313 Hs.265592 ESTs, Weakly similar to 138022 hypothetical protein [H.sapiensJ 

N68168 gbizal 1cOU1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA done S", mRNA sequence 

N68201 H5.269124 ESTs, Weakly similar to 138022 hypothetical protein [H.sapiens] 

N68300 gb:za12g07^1 Scares fetal Uver spleen 1NFLS Homo sapiens cDNA done IMAGE:292380 3\ 

N68321 Hs.231500 EST 

N69575 Hs.102788 ESTs 

AL133572 Hs.199009 protein containing CXXC domain 2 

AI680737 Hs.289068 Homo sapiens cDNA FU11918 fo, done HEMBB1000272 

T78277 Hs.100293 O-iinked N-acetylgluoosamine (GlcNAc) transferase (UDP-N-acetylglucosamlne:polypeplide-N- 

AW582962 Hs.102897 CGM7 protein 

W19407 Hs.3862 regulator of nonsense transcripts 2; DKFZP434D222 protein 

AB002316 Hs.65746 K1AA031 8 protein 

AK001357 H5.102951 Homo sapiens cDNA FU10495 fis, done NT2RP2000297. moderately sMar to ZINC FINGER 

AA464836 Hs.291079 ESTs^ Weakly similar to T27173 hypothetical protein Y54G11A.9 - Caenorhabditis elegans 

J00077 Hs.155421 alpha-fetoprotein 

R05283 gb:ye91o08^1 Scares fetal Bver spleen 1NFLS Honrn sapiens cDNA clone IMAGE:125102 3* 

T97733 Hs.185685 ESTs 

R23053 gb7h31a05.r1 Soares placenta Nb2HP Homo sapiens cDNA done 5' similar to contains LI 

RC_R23930^s AL137586 Hs.52763 ar^phase-promoting complex subunlt 7 

AI248173 Hs.191460 hypothetical protein MGC12936 

AW162919 Hs.170160 RAB2. member RAS oncogene family^ltke 

AL134275 Hs,6434 hypothetical protein DKFZp761F201 4 

H11368 Hs.141055 Homo sapiens clone 23758 mRNA sequence 

AA553722 Hs.92096 ESTs, Moderately similar to A46010 X-linked retinopathy protein [H.sapiens] 

R45154 Hs.106604 ESTs 

R41179 Hs.97393 KIAA0328 protein 

AW293165 Hs.143134 ESTs 

NM_004427 Hs. 1 652S3 early development regulator 2 (homoJog of poJyhomeotlc 2) 

AW297929 Hs.328317 EST 

W38537 Hs^80740 hypothetical protein MGC3040 

R44357 HS.4B712 hypothetical protein FU20736 

R44519 gb7g24h04.s1 Soares infant brain 1NIB Homo sapiens cDNA done IMAGE:33350 3', mRNA 

R45088 gb:yg38g04.s1 Soares infant brain 1N1B Homo sapiens cDNA done IMAG£:34896 3\ mRNA 

R47948 Hs.188732 ESTs 

H87832 Hs.7388 keich (DrosophiiaHike 3 

AW966158 Hs.58582 Homo sapiens cDNA FU12789 fis. done NT2RP2001947 

R55241 Hs.101214 EST 

R59585 Hs.101255 ESTs 

W07701 Hs.304177 Homo sapiens done FLB8503 PR02286 niRNA. confv»tetB cds 

BE561430 Hs.239388 Human DNA sequence from done RP1-304B14 on chromosome 6. Contains a gene for a novel 



124661 RC.R66690 
130141 RC„R67266_s 
124879 RC_R7358B 
124892 RC.R79403 
124906 RC_R87647 
124922 RC.R93622 

124940 RC.R99599_s 

124941 RC.R99612 
124943 RC.T0288B 
WARNING ENTRY [H.sapiensl 
124947 RC_T03170 
124954 RC.T10465 
132924 RC_T15418_f 
133113 RC_T15597 f 
132975 RC.T1565^I 
133235 RC_T16898 s 
131082 RC_T26644J 
124980 RC.T40841 
124984 RC_T47566 i 
124991 RC.T50116 



R67567 Hs.107110 ESTs 

NM.004455 Hs.150956 exostoses (multipleHike 1 

R73588 Hs.101533 ESTs 

A1970003 Hs.23756 hypothetical proteirislrnilar to svvineacylneuraminate lyase 

H75964 Hs.107815 ESTs 

R93622 Hs.12163 eukaryotic translation initiation fador 2, subunit 2 (beta, 38kD ) 

AF068846 H3.103604 heterogeneous nudear ribonudeoprotein U (scaffold attachment factor A) 

AI76G681 Hs.27774 ESTs. Highly similar to AF161349 1 HSPC086 IH.sapiens] 

AW983279 Hs.123373 ESTs. Weakly simHar to ALU1J1UMAN ALU SUBFAMILY J SEQUENCE CONTAI^INATION 

T03170 Hs.100165 ESTs 

AW964237 Hs.6728 KIAA1548 protein 

U55184 Hs.154145 hypothetical protein FU11585 

BE383768 Hs.65238 95 kDa retinoblastoma protein binding protein; K1AA0661 gene produd 

R43504 Hs.6181 ESTs 

AW960782 Hs.6856 ash2 (absent small, or homeotic. Drosophila* homolog)-like 

At091121 Hs.246218 Homo sapiens cDNA: FU21781 fls, done HEP00223 

T40841 Hs,98681 ESTs 

BE313210 Hs.223241 eukaryotk; transtetion elongation factor 1 delta (guanine nucleotide exchange protein) 

T501 16 gb:yb77c10.s1 Stratagene ovary (937217) Homo sapiens cDNA done IMAGE:77202 3* similar 



to similar to SP:VE22 iMABD P03756 EA22 GENE , mRNA sequence. 
129475 RC_T50145j3 NI\/L004477 Hs.203772 FSHD region gene 1 



99 



3NSDOCID: <WO ^02079492A2_L: 



wo 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



125000 RC T58615 
132932 RC_T59940_f 
129534 RC^T63595 

125008 RCjr64891 

125009 RC^T64924 
132940 RC,T64933.r 

125017 RC_T68875 
sequence. 

125018 RC.T69027 
125020 RC^T69924 
129891 RC.770353 
134204 RC.T79780_s 
125050 RC.T79951 
125052 RC^T80174_s 
125054 RC.T80622 
125063 RC.T85352 



T58615 

AW1 18826 

AK002126 

T91251 

T64924 

T79136 

T68875 



Hs.1 10640 

Hs.6093 

Hs.11260 

Hs.303046 
Hs.127243 



ESTs 

Homo sapiens cONA: FU22783 fis. done KAIA1993 
hypothetical protein FLJ11264 

gb:yd60a10.s1 Soaies fetal liver spleen 1NR.S Homo sapiens cDNA clone 3^. mRNA sequence 
ESTs 

Homo sapiens mRNA for KIAA1724 protein, partial cds 

gb:yc30f05.s1 Stratagene liver (937224) Homo sapiens cDNA clone IMAGE:82209 3', mRNA 



T69027 Hs.57475 sex comb on midleg homotog 1 

T69981 gb:yc19d03.r1 Stratagene lung (937210) Homo sapiens cDNA done 5*. mRNA sequence 

Aia84813 Hs.13197 ESTs 
AI873257 Hs.7994 hypothetical protein FU20551 
AW970209 Hs.1t1805 ESTs 

T85104 Hs.222779 ESTs, Moderately simitar to similar to NEOD-4 [H.sapiens] 
T80622 Hs.268601 ESTs, Weakly similar to envelope [H.sapiens] 

T85352 gb:yd82(f01^1 Scares fetal llverspteen 1NFLS Homo sapiens cDNA done IMAGE:H4721 y 

similar to contains Alu repetitive element:contatns LI repetitive element u mRNA sequence. 

125064 RC^T85373 T85373 gb:yd82ID7.s1 Scares fetal Twer spleen 1NFLS Homo sapiens cDNA done IMAGE:1 14757 3* 

similar to contains Alu repetitive element;contaHi5 MER3 repetitive element mRNA sequence. 

125066 RC^T86284 T86284 gb:yd77b07.s1 Scares fetalBver spleen 1NFLS Homo sapiens cDNA done 3' similar to contains 

Alu repetitive element, mRNA sequence 
112264 RC^T89579^s AL045364 Hs.79353 
125080 RC^T90360 T90360 Hs.268620 
WARNING ENTRY ^sapiens) 

125097 RCjr94328J AW576389 Hs.335774 EST, Moderately similar to S65657 alpha-IC-adrenergic receptor splice forni 2 [H.sapiens) 
125104 RC^T95590 T95590 gb:ye40a03.s1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA done 3' similar to 

gb|M10817|IGURRAA Iguana iguana 5S (rRNA );, mRNA sequence 

135107 RCjr97257 J T97257 Hs.337531 ESTs, Moderately similar to 138022 hypothetical protein [H.sapiens) 
129550 RC^T97599J AA845462 Hs.124024 dellex (Drosophila) homolog 1 

125118 RC.T97620 R10606 gb:yf35f11.sl Scares fetal Twer spleen 1NFLS Homo sapiens cDNA done IMAGE:128877 3' 

similar to contains Alu repetitive element;, mRNA sequence. 



transcription fector Dp-1 

ESTs, Highly similar to ALU6_HUMAN ALU SUBFAMILY SP SEQUENCE CONTAMINATION 



125120 RC.T97775 
134160 RC^T98152 
125136 RC W31479 
125144 RC^W37999 
125150 RC_W38240 
104180 RC^W40150 
131987 RC_W45435 
125178 RC_W58202 
125180 RC.W58344 
125182 RC_W58650 
130588 RC_W68736 
125197 RC.W69106 
133497 RC_W69111 
100562 RC^W59385.s 
125639 RC_W69399^s 
129232 RC_W69459 
101495 RC_W72424 
125209 RC_W72724 
125212 RC.W72834 
129132 RC^W73955 
125223 RC_W74701 



T97775 
T98152 
AW962364 
AB037742 
W38240 
AA247778 
AW453069 
W93127 
W56469 
AA451755 
AL030996 
AF08S270 
BE617303 
NM.006185 
Z97630 
R98881 
W72424 
W72724 
AA746225 
BE383436 
AI916269 



WARNING ENTRY [H.sapiens] 



125225 RC_W76540 
125228 RC_W79397 
132393 RC_W85888 
125238 RC_W86038 
125247 RC^W86881 
129296 RC^W87804 
125263 RC_W88942 
125266 RC_W90022 
PRECURSOR [H.sapiens] 
131321 RC_W92272 
131601 RC^W92764^s 
131677 RC„W93040 
120837 RC.W93092 

125277 RC„W93227 

125278 RC.W93523 
125280 RC_W93659 
131856 RC.W94003.S 
131844 RCJW94401_s 
125284 RC.W94688 
313447 RC_W94787^s 
130799 RC.Z38294^s 
125289 RC.Z38311 
128874 RC_Z38465.s 



W74169 
AA033982 
AL1 35094 
N99713 
AA694191 
AI051967 
AA098878 
W90022 

U91543 

NM-007115 

H05317 

BE14g656 

W93227 

AI216439 

AI123705 

W93949 

Ai419294 

NIVL002666 

AW016321 

AB028945 

T34530 

H06245 



Hs,100717 
Hs.79432 
Hs.129051 
Hs.24336 

Hs.119155 

Hs.3657 

Hs.31845 

Hs.103120 

Hs.263560 

Hs.16411 

Hs.278554 

Hs.74266 

Hs.301512 

Hs.226117 

Hs.109655 

Hs.112405 

Hs.103174 

Hs.103173 

Hs.108847 

Hs.109057 

Hs.16492 
Hs.1 10059 
Hs.47334 
Hs.109514 
Hs.163914 
Hs.1 10122 

Hs.186809 

Hs.25601 

Hs.29352 

Hs.283549 

H&308621 

Hs.103245 

Hs.129998 

Hs.106932 

Hs.33245 

Hs.324342 

Hs.103253 

HS.823Q6 

Hs.12696 

Hs.4210 

Hs.106801 



EST 

fibrillin 2 (congenital contradural arachnodactyly) 
ESTs 

KIAA1321 protein 

Empirically seleded from AFFX single probeset 

Homo sapiens mRNA full length Insert cDNA done EUROIMAGE 814975 

activity-dependent neuroprotecGve protein 

ESTs 

ESTs 

ESTs 

hypothetical protein LOC57187 . 

heten3chromatIn4ike protein 1 

hypothetical protein MGC4251 

nuclear mitotic apparatus protein 1 

HI histone femily, member 0 

sex comb on midleg (Drosophila)-like 1 

S100 caicium-binding protein A9 (calgranulin B) 

ESTs. Weakly similar to TSP2.HUMAN THROMBOSPONDIN 2 PRECURSOR {H.saplensl 
ESTs 

hypothetical protein MGC2749 

ESTs. Weakly similar to ALU5.HUMAN ALU SUBFAMILY SC SEQUENCE CONTAMINATION 
DKFZP564G2022 protein 

ESTs, Weakly similar to 138022 hypothetical protein (H.sapiens] 

hypothetical protein FLI14495 

ESTs 

ESTs 

ESTs 

gb:zn45g10.r1 Stratagene HeLa cell s3 937216 Homo sapiens cDNA done 5*. mRNA sequence 
ESTs, Highly similar to LCT2_HUMAN LEUKOCYTE CEa-DERIVED CHEMOTAXIN 2 

chromodomain helicase DNA binding protein 3 
tumor necrosis fector. alpha-induced protein 6 
ESTs 

Homo sapiens dDNA FU1 1963 fis. done HEMBB1001051 
EST 

enhancer of polycomb 1 

ESTs 

ESTs 

ESTs 



destrin (acOn depolymerfzing fector) 

cortactin SH3 domain-binding protein 

Homo sapiens cDNA FU13069 fis. done NT2RP3001752 

ESTs, Weakly similar to PC4259 femtin assodated protdn [H^tens) 
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130966 RC.Z38525„s 
128875 RC_Z38538J 
133200 RC.^51.S 
130158 RC„Z38783.s 
125295 RC_Z39113 
domain, (semaphorin)4F 
125298 RC^9255J 
125300 RCJ39591 
323122 RC_Z39783^s 
RC.Z39920 
RC_Z40166J 
RC_Z4038a„s 
RC^Z40646 
RC_Z41697 
RC_Z99349 
RC.Z99394_s 
RC_AA027168 
O58024_S 
RC_AA251113 
132459 RCJU347573 
101545 M31210 
CGI 527 
RC„N62948_s 
RC_W42674 
RC^W43000_s 
RC N3175a_s 
AF009301 
L37347 
M99564 
RC^AA430673 
RC_AA251297 
RCJ\A620722 
132173 RC.W46577.S 
113332 RC,W81237 
114452 RCJ>iA020825 
PROTEIN TC10 
115243 RCJ>iA27B766 
134403 RCJ93708_S 
129647 RC.N49394 
111428 RC_H56559„s 
115987 RCJ\A446887 
120726 RC_AA293656 
114995 RCJ^251152 
303876 RCJ\A233334. 
dominant, ataxin 3) 
311463 RC_Z39920 
RCJ\A192173 
RCJ^455D44 
RCJ\A398504 
U41813 
RCJ\A166965 
RCJ^A398222 
RC.Z39095 
RCJ^A284744. 



311463 
130882 
128888 
125310 
125315 
125317 
135098 
104785 
132837 
120456 



133505 
132360 
132738 
119586 
129914 
130839 
132813 
134342 
131878 
105426 
132968 



120302 
133071 
121032 
129829 
120245 
120985 
114184 
447503 
cds 

132837 
121034 
119718 
120455 



RC.AA428201 
RCJ\A398507 
RC_W69216 
RCJ\A251083 
125260 RC_W93659 
132155 RCJVA227903 
120609 RCJ\A283902 
121278 RCJ^401631 
109023 RCJ\A157293 
129815 RCJ)60208J 
108061 RCJ^043979 
113287 RC_T66847 
114082 RC_Z38239 
116334 RCJ^491457 
131486 RC_Z40071^s 
107860 RCJ\A024961 
131263 RC_AA443826 
132207 RCJ^443294 
129183 RCJ^155743 
408431 RC«T23708 
120575 RCJ^280934 



AW971018 Hs.21659 ESTs 

AB040923 Hs.106808 kelch (DrosophUa^like 1 

AB037715 Hs.183639 hypothetical protein FU10210 

AB032947 Hs.151301 Ca2-MJependent activator protein forsecretion 

AB022317 H5.25887 sema domain, Immunogtdbunn domain (Ig). transmembrane domain (W) and short cytoplasmic 

AW972542 Hs.289008 Homo sapiens cDlsiA: R.J21B14 fis. done HEP01068 

Z39591 Hs.101376 EST 

BE622770 Hs.264915 Homo sapiens cDI^ FU12908 fis, clone NT2RP2004399 

R55344 Hs.22142 cytochrome b5 reductase b5R.2 

AA497044 Hs.20887 hypothetical protein FU10392 

AI760853 Hs.24155a ariadna (Drosophila) homolog 2 

R59161 Hs.1 24953 ESTs 

R38110 Hs.106296 ESTs 

Z99348 Hs.1 12461 ESTs. Weakly similar to 138022 hypothetical protein [Haptens] 

AA081258 Hs.132390 zinc finger protein 36 (KOX 18) 

AA027167 Hs.10031 KIAA0955 protein 

AA370362 Hs.57958 EGF-TM7-latfophain-related protein 

AA488750 Hs.88414 BTB and CNC homology 1, basic (eudne zipper transciiptton factor2 

AL120071 Hs.48998 fibronectin leucine rich transmembrane protein 2 

BE246154 Hs.154210 endothelial differentiation, sphingoiipid G-protein-coupled receptor. 1 

AI630124 Hs.324504 Homo sapiens mRNA; cDNA DKFZp586J0720 (from clone DKFZp586J0720) 

AW893660 Hs.46440 solute canier family 21 (organic anion transporter), member 3 

AK000738 Hs.264636 hypothetical protein FLJ20731 

AF088033 Hs.159225 ESTs 

NM_012421 Hs.13321 rearranged L-myc fusion sequence 

AB011169 Hs.20141 similar to S. cerevisiae SSM4 

BE313625 Hs.57435 solute carrier family 1 1 (proton-coupled divalent metal ion transporters), member 2 

NM_000275 Hs.82027 oculocutaneous albinism II (pinl(-eye dilution (murine) homolog) 

AA083764 Hs.6101 hypothetical protein M6C31 76 

W20027 H5^39 ESTs 

AF234532 Hs.61638 myosin X 

X89426 Hs.41716 endothelial cell-specific molecule 1 

AA256444 Hs.126485 hypothetical protein FLJ12604: KIAA1692 protein 

AI369275 Hs.243010 Homo sapiens CDNA FU14445 fis, done HEMBB1001294, highly similar to GTP-BINDING 

AA806600 Hs.116665 KIAAI 842 protein 

AA334551 Hs.82767 sperm specific antigen 2 

AB018259 Hs.1 18140 K1AA0716 gene product 

AL031428 Hs.174174 KIAA0601 protein 

AI745379 Hs.42911 ESTs 

AA293655 Hs.97293 ESTs 

AA769266 Hs.193657 ESTs 

.s U64820 Hs.66521 Machadd^oseph disease (spinooeiebellar ataxia 3, olivopontocerebellar ataxia 3, autosomal 

R55344 Hs.22142 cytochrome b5 reductase b5R^ 

AA837098 Hs.269933 ESTs 

BE384932 Hs.64313 ESTs. Weakly similar to AF257182 1 G-proteh-coupIed receptor 48 [H.sapiensl 

AA393037 Hs.1 61798 ESTs 

AF010258 Hs.127428 homeobQxA9 

AW959615 Hs,111045 ESTs 

A1219896 Hs.97592 ESTs 

R56434 Hs.21062 ESTs 

J AA1 15496 Hs.336898 Homo sapiens. Similar to RIKEN cDNA 1810038N03 gene, done MGC:9890. mRNA. complete 

AA370362 Hs.57958 ^GF-TM7-latR)phHln-related protein 

AIJ389951 Hs.271623 nudeoporin SOkD 

W69216 Hs.92848 ESTs 

AA251720 Hs.104347 ESTs, Weakly similar to ALUC_HUMAN Illl ALU CLASS C WARNING ENTRY II! [H.sapiensl 

AI123705 Hs.106932 ESTs 

AK001607 Hs.41127 hypothetical protein FU1 3220 

AWg78721 Hs266076 ESTs, Weak^ slntflar to A46010 X-Unked retinopathy protein [H^sapiens] 

AA037121 H5.98518 Homo sapiens cDNA FU114gO fis, done HEMBA1001918 

AA157293 HsJ2168 ESTs 

BE565817 Hs.26498 hypothefical protein FU21657 

AA043979 Hs.62651 EST 

T66847 Hs.194040 ESTs. Weakly similar to 138022 hypothetical protein IH.sapiens] 

AK001612 Hs.26962 Homo sapiens dDNA FU10750 fis. done NT2RP3001929 

AL038450 Hs.48948 ESTs 

F06972 Hs.27372 BMX non-reoeptor tyrosine kinase 

AA024961 Hs.50730 ESTs 

AU077002 Hs.24950 regulator of G-protein signalling 5 

BE206939 Hs.42287 E2F transcription fador 6 

BE561824 Hs.273369 uncharacterized hematopoiefic stem/progenitor ceils protein MDS027 

AI338631 Hs.43266 Homo sapiens cDNA: FLJ22536 fis, done HRC13155 

AW978022 Hs.238911 hypothetical protein DKFZp762E151 1; KIAA1816 protein 
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132121 
117657 
134922 
118523 
116845 
115291 
120326 
130174 
129131 
129868 
118661 
129829 
115985 
•134637 
132714 
129771 
123360 
132902 
113716 
113825 
130367 
120541 
116727 
118219 
119767 
128917 
451553 
132716 
118525 
114618 
119743 
108154 
122798 
133746 
119822 
122186 
114941 
118053 
123234 
129280 
118995 
116750 
129028 
105127 
114513 
411856 
132036 
130091 



RC_AA443284_s 

RC N39074 

RC_W04507_s 

RC^R41828_s 

RC_H64973 

RC_AA279943 

RC_AA196300 

M29550 

RCJ\A436489 

RC_AA287032 

RC_N70777 

.RC^AA496921 

RC_AA447709 

RC_AA369856_s 

RC_AA252598 

RC_H73237 . 

RC_AA504784 

RC_AA490969 

RC_T97750 

RC.W48860 

RC.238501 

RC_AA278298 

RC^13684 

RC_N62231 

RC^W72562 

RCJ\A481252 

RC_AA020928 

RC_AA251288 

RC_N67861 

RCJVA084162 

RC_W70242 

RC^AA425151_s 

RC_AA460324 

U44378 

RC W74471 

RC_AA435842 

RC_AA243017 

RC_N53367 

RC_AA490227 

M63154 

RC.N94591 

RC_H05960 

M98833 

RC.AA158132 

RCJ\A044825 

RC_T35697 

W01568 

RC_W88999 



NM_004529 

N39074 

AI718295 

Y07759 

AA649530 

BE545072 

AA196300 

M29551 

AB026436 

AW172431 

AL137554 

AF010258 

AA447709 

U87309 

W39388 

AL096748 

AA532718 

A1936442 

AA001356 

AW014486 

AL135301 

W07318 

R76472 

AA862391 

W72562 

AI365215 

AA018454 

BE379595 

N67861 

AW979261 

AA947552 

NM_005754 

AW366286 

AW410035 

AF086409 

AA398811 

AA236512 

N53391 

NWL001938 

M63154 

N94591 

AA760689 

AL120297 

AA045648 

AA044873 

H67899 

AL157433 



A1267592 

W81486 

AW957372 

AI247568 

AI567972 

AA225048 

X03068 

AA814948 

AA478486 

AB000115 

W93227 

AA188175 

AA399640 

AA416B67 

AA287556 

AA349647 

T03170 

AA178953 



AA385934 
R78376 
AW082866 
AI869129 



Hs.404 • 
Hs.44933 
Hs.91161 
Hs.170157 

Hs.122579 

Hs.21145 

Hs.151531 

Hs.177534 

Hs.13012 

Hs.49927 

Hs.127428 

Hs.268115 

Hs.180941 

Hs.55336 

Hs.102708 

Hs.178604 

Hs.5g838 

Hs.18159 . 

Hs.22509 

HS3768 

Hs.240 

Hs.65646 

Hs.48494 

Hs.58119 

Hs^06097 

Hs.269211 

Hs.283738 

Hs.49390 

Hs.291993 

Ks.58086 

Hs.220689 

Hs.145696 

Hs.75862 

Hs.301327 

Hs.104673 

Hs.87331 

Hs.47629 

Hs.16697 

Hs.1 10014 

Hs.323056 

Hs,92418 

Hs.108043 

Hs.301957 

Hs.103446 

Hs.4190 

Hs.37706 



sequence 
414108 U09564 
119881 RC.W81456 
117770 RC^N47953 
119850 RC_W80447 
115439 RCJ0^284561 
123107 RC^AA486071 
406698 M24364 
121231 RCJiiA400780 
132074 AB002366 
413670 AB000115 
125277 RC^W93227 
114056 RCJ^186324 
121153 RCJ\A399640 
121609 RC_AA416867 
120661 RC_AA287556 
120850 RC^AA349647 
124947 RC_T03170 
130529 RC_AA280886 
repetitive element;, mRNA sequence 
117683 RC^N40180 N40180 
IMAGE:276387 3' similar to contains Ll.tl 
120745 RCJSLA302809 AA302809 
120936 RCJU385934 
112597 RC_R78376 
120183 RC^40174 
120644 RCJ\A287038 



Hs.75761 

Hs.58848 

Hs.46791 

Hs.56452 

Hs.193090 

Hs.104207 

Hs73931 

Hs.96343 

Hs.3852 

Hs.75470 

Hs.103245 

Hs.82506 

Hs.97694 

Hs.98185 

Hs.263412 

Hs.96927 

Hs.100165 



LI 



Hs.97184 
Hs^33 
Hs.65882 
Hs.96616 



myeloid/lymphoid or mixed-lineage leukemia (trithorax (Drosophila) homolog); translocated to, 3 

ESTs 

prefoldin 4 

myosin VA (heavy polypeptide 12, myoxin) 

gbais44fD5.s1 NCLCGAP_A(v1 Homo sapiens cDNA clone. mRNA sequence 
hypothetical protein FLJ10461 
hypothetical protein RG083M05.2 

protein phosphatase 3 (formerly 28). catalytic subunit, beta isofbrm (caldneurin A beta) 

dual specificity phosphatase 10 

ESTs 

protein Icinase NYD-SP15 
homeo box A9 

ESTs, Weakly sM}ar to T08599 probable transcription factor CA150 (H.sapiens] 

vacuolar protein sorting 41 (yeast homolog) 

Homo sapiens, clone MGC:17421 , mRNA, complete cds 

DKFZP434A043 protein 

ESTs 

hypothetical protein FLJ10808 

ESTs 

ESTs 

hypothetical protein FLJ10849 
M^hase phosphoprotein 1 
ESTs 

ESTs. I^Aodeiatety similar to A46010 X-linked retinopathy protein [H.saplens] 
ESTs 

oncogene TC21 
ESTs 

casein idnase 1, alpha 1 

ESTs 

ESTs 

ESTs 

Ras-GTPase-activating protein SH3-domain-binding protein 
splicing factor (CC1.3) 

MAD (mothers against decapentaplegic. Drosophila) homolog 4 

ESTs 

ESTs 

ESTs 

ESTs 

down-regulator of transcription 1. TBP-binding (negative cofactor 2) 

gastric intrinsic factor (vitamin B synthesis) 

ESTs 

ESTs 

Friend leukemia vims integration 1 

nudix (nucleoside diphosphate linked moiety X)-type motif 5 

ESTs 

Homo sapiens cDNA: FLJ23269 fis. done COL09533 
hypothetical protein DKFZp434E2220 

gbzh70h03.s1 Soares_fetaLliver.spleen_1NFLS.S1 Homo sapiens cDNA done 3'. nf^NA 

SFRS protein kinase 1 
ESTs 

ESTs. Weakly similar to 138022 hypothetical protein {H^piens] 
ESTs 

ESTs. Highly similar to AF161437 1 HSPC319 [H.saplens] 

ESTs 

major histocompatibility complex, class II, DQ beta 1 

ESTs. Weakly similar to ALUC_HUMAN "I! ALU CLASS C WARNING ENTRY 111 (H.sapiensl 
KIAA0368 protein 

hypothetical protein, expressed in osteoblast 
EST 

KIAA1254 protein 

ESTs 

EST 

ESTs, Weakly similar to ALUB_HUMAN 11!! ALU CLASS B WARNING ENTRY III (H^apfensJ 

Homo sapiens cDNA FLJ12573 fis, clone NT2RM4000979 

ESTs 

gb:zp39e03.s1 Stratagene musde 937209 Homo sapiens cDNA done 3* similar to contains Aiu 

gb7y44d02.s1 Soares_mu!tiple_sderosls_2NbHMSP Homo sapiens cDNA done 
element ;, mRNA sequence. 

gb:EST10426 Adipose tissue, white I Homo sapiens cDNA 3* end, mRNA sequence. 

EST. Highly similar to (defllne not available 7499603) [Celegans] 

EST 

ESTs 

ESTs 
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119023 RC.N98488 • N984a8 
IMAGE:3101293', mRNA sequence. 

107582 RCJVA002147 AA002147 

118249 RC.N62560 N62580 

115022 RCJ^252029 AA252029 

117710 RC_N45198 N45198 

115341 RCJ^81452 AA281452 

118896 RC_N90680 N46213 

121121 RC.AA399371 AA399371 

1 18329. RC.N63520 . N63520 
3\ mRNA sequence. 

119496 RC„W35416 W35416 

118111 RC_N55493 N55493 
mRNA sequence. 
119062 RC.R16698 
116710 RC_F10577J 
119261 RC^T15956 
122723 RCJ\A457380 



AW444881 
F10577 
T15956 
AA457380 



gb:zb82h01.s1 Soares_senescenLfibroblasts_NbHSF Homo sapiens cDNA clone 
Hs.59952 EST 

Hs.322925 EST, Weakly similar to putative p150 [H.sapiens] 
Hs.87935 ESTs 

Hs.47248 ESTs, Higttly similar to similar to Cdc14B1 ptiosphatase [H.sapiens] 
Hs.88840 EST. Weak^ shniiar to granule cell marker protein [M.musculus] 
Hs.54642 methionine adenosyitransferase II, beta 
Hs.189095 similar to SALL1 (sal (DrosophilaHike 

gb:yy62f01.s1 Soares_multiple„sclerosis_2NbHMSP Homo sapiens cDNA clone IMAGE:278137 

Hs.156861 ESTs, Moderately similar to A46010 X-linked retinopathy protein [H.sapiens] 

gb:yv50c02.s1 Soaies fetal liver spleen 1 NFLS Homo sapiens cDNA done 1MAGE:246146 3', 

ESTs 



HS77829 
Hs.30608d 
Hs.65289 



v-crk avian sarcoma vims CTIO oncogene homolog 
EST 

gb:aa86b10^1 Stratagene fetal retina 937202 Homo sapiens cDNA done IMAGE:838171 3' 
similar to contains L1.b3 LI repetitive element mRNA sequence. 

1 17732 RC_N46452 N46452 gb:yy76h09.s1 Soares_mulliple_sclerosis„2NbHMSP Homo sapiens cDNA done 

IMAGE:279521 3' similar to contains L1,t2 LI repetitive element mRNA sequence. 

104787 RC^AA027317 AA027317 gb.ze97d11.s1 SoaresJetaLhearLNbHH19W Homo sapiens cDNA done IMAGE:366933 3" 

similar to contains Alu repetitive element;, mRNA sequence. 



100071 A28102 
115819 RCJVV426573 
130882 RC^Z40166J 
125225 RC^W76540 
108339 RCJ^070801 
WARNING ENTRY [H.sapiensl 



100338 D63483 

121636 RCJVA417027 

103875 RCJVA418387 

118716 RC.N73460 

119763 RC^W72450 

121917 RCJ\A428218 

132806 M91488 

130949 Y10659 

108806 RQ_AA129933 

133276 RCJ^490478 

134760 RC.H16758 

132867 AA121287 

132051 AA091284 

114208 RC Z39301 

104094 AA418187 

128718 AA426361 

302032 RC.N20407 

115501 RCJ\A291553 

101997 U01160 

103708 AA037206 

101899 S59184 

115839 RCJ\A429038 

409459 D50678 

103563 Z22534 

123233 RC_AA490225 
(H.sapiens] 

121305 RCJ^402468 

114798 RC_AA159181 

133145 RC_AA196549 

131567 RCJ\A291015.S AF015592 

112300 RC.R54554 

129507 RC_AA192099 

121033 RCJ^398505 

121151 RCJ^399636 

121402 RCJ\A406063 

123203 RCJ^489671 

132271 RCJ\A236466 

125197 RC_W69106 

114935 RC AA242809 
WARNING ENTRY [H.sapiens] 



125279 
108778 
108087 
132466 
133328 
124057 
124800 



RC W93640 

RC_AA128548 

RCJW^045709 

RC^N56810_s 

R36553 

RC F13604 

RC_R45115 



A281 02 Human GABAa receptor alphas subunit 

AA486620 Hs.41l35 endomudn-2 

AA497044 Hs.20887 hypotheUcal protein FLJ10392 

W74169 Hs.16492 DKFZP564G2022 protein 

AW151340 Hs.51615 ESTs, Weakly similar to ALU7.HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 

Hs.57735 acetyl LDL receptor; SREC 

Hs.306654 Homo sapiens cDNA FU13574 fis. done PUCE1008625 

Hs.48802 Homo sapiens done 23632 mRNA sequence 

Hs.1 18722 fucosyttransferase 8 (alpha (1 ,6) fuoosyltransferase) 

Hs.10450 Homo sapiens cONA: FLJ22063 fis. done HEP10326 

Hs.g8038 ESTs 

H5.278619 . hypothetical protein FU10099 

Hs.285115 interleukin 13 receptor, alpha 1 

Hs.71 168 Homo sapiens done 24674 mRNA sequence 

Hs.69504 ESTs 

Hs.89548 erythropoietin receptor 

Hs.58553 CTP synthase II 

Hs.180145 HSPC030 protein 

Hs.7859 ESTs 

Hs.330515 ESTs 

Hs.281706 sortTin 1 

Hs.128087 coagulatton feK^torll (thrombin) receptor 

Hs.190086 ESTs 

Hs.50984 sarcoma amplified sequence 

Hs72071 hypothetical protein FU20038 

Hs.79350 RYK receptor-like tyrosine kinase 

Hs.28935 transdudn-like enhancer of split 1, homolog of Drosophita E(sp1) 

Hs.54481 low density lipoprotein receptor^related protein 8, apolipoproteln e receptor 

Hs.150402 AcbVIn A receptor, type I (ACVR1) (ALK-2) 

Hs.188751 ESTs, Weakly simflar to MAPB JHUMAN MICROTUBULE-ASSOCIATED PROTEIN IB 

Hs.291557 ESTs 

Hs.54900 serotogically defined colon cancer antigen 1 

Hs.6592 Homo sapiens, done tMAGE:2961368. mRNA, partial cds 

Hs.28853 CDC7(cendivistoncyde7,S.cer6visiae,homotog)-iike1 

H24334 Hs^6125 ESTs 

AJ236885 Hs.1 12180 ztoc Hnger protein 148 {pHZ-52) 

AA398505 Hs.97360 ESTs 

AA399636 Hs.143629 ESTs 

AA406083 Hs.98003 ESTs 

AA352335 Hs.65641 hypothetical protein FD20073 

AB030034 Hs.1 15175 stenle-elpha motif and leudne zipper containing kinase A2K 

AF086270 Hs.278554 heterochromatin-like protein 1 

H23329 Hs.290880 ESTs. Weakly similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

AW401809 Hs.4779 KIAA1 160 protein 

AF133123 Hs.90d47 general transcrlptton factor IIIG, polypeptide 3 (102kD) 

AA045708 Hs.40545 ESTs 

AI597655 Hs.49265 ESTs 

AW452738 Hs.265327 hypothetical protein DKFZp761l141 

AA902384 H3.73853 bone morphogenefic protein 2 

AW864086 Hs.138617 thyroid honnone receptor Interactor 12 



086864 
AA379203 
T26379 
AI658908 
R54146 
AA406397 
AI699432 
AV656840 
AF070578 
AW978439 
NM«000121 
AF226667 
AA393968 
AL049466 
AA418187 
NM_002959 
NM«001992 
AA291553 
AU076536 
AA430591 
S59184 
BE300266 
D86407 
L02911 
AW974175 

AA402468 
AA159181 
H94227 
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121029 RC^AA398482 
120663 RC MKimil 
102133 U15173 
108246 RC_AA062855 
125226 RC_W78134 
120260 RC_^AA171739 
124906 RC R87647 
109406 RC_.AA226877 
109271 RCJVA195668 
125052 RC_T80174_s 
109101 RC AA167708 
115241 RC_AA278723 
117163 RC^H979D9 
113530 RCjr90313 
120375 RC_M227260 
129435 AA314256 
114864 RC_AA235256 
103988 AA314389 
131006 RCJ\A242763 
106781 RC.AA478474 
106141 RC^AA424558 
116213 RCJ^476738 
135266 AB002326 
135058 RC_AA430152 
119908 RC_W85844 
103695 AA018758 
103978 AA307443 
109485 RC_AA233472 
129574 AA458603 
115347 RC_AA281528 
120765 RCJVA338735 
WARNING ENTRY [H.saplens] 
121059 RCJ\A398628 AA393283 
131887 AA046548 • 
member 1 

112064 RC_R43812 
115606 RC_AA400465 
131750 RC_H94855_s 
102123 U14518 
129847 RC_W46767 
133809 RC^AA235275 
132210 RC.N51499_s 
122356 RC.AA443794 
114958 RCJ11A243708 
103951 AA287840 

134703 RCJW280704 
128727 AA287864 
105743 RCJ\A293300.s BE246502 
domain, (semaphorin)4B 
103744 AA076003 
sequence 
114348 N80402 
114009 RC_W90067 

134704 RCJ\A280849 
128629 AA399187 
104410 H65925 
110200 RC_H21075 
124483 RC_N53976 
101391 M14648 
109657 RC^F04826 
117140 RC_H96813 



AA398482 
AAB27798 
AU076845 
AI423132 
AA78253S 
AK000061 
H75964 
AA199883 
AW137422 
T85104 
AW608930 
AA648278 
N36861 
T90313 
AF028706 
AF151852 
AA135332 
AA314369 
AF064104 
AA330310 
AF031463 
AA292105 
R41179 
AI379720 
AA524470 
AW207152 
NM^016940 
BE619092 
AA026815 
AA356792 
AW961026 



Hs.97641 

Hs.105089 

Hs.155596 

Hs.146343 

Hs.122647 

Hs.101590 

Hs.107815 

Hs.67624 

Hs.86022 

Hs.222779 

Hs.52184 

Hs.193859 

Hs.42344 

Hs.16732 

Hs.111227 

Hs.111449 

Hs.71608 

Hs.42500 

HS22116 

Hs.24181 

Hs.9302 

Hs.326740 

Hs.97393 

Hs.93814 

Hs.58753 

Hs.186600 

Hs.34136 

Hs.28465 

Hs.11463 

Hs.334824 

Hs.96752 



W17064 

AL049390 
AI025829 
^1^A_004349 
NM.001809 
N64025 
AV649326 
NM.007203 
AA443794 
N2D912 
A1^944 
AF117065 
AI223335 



AA079267 

AL050321 
AI248544 
AA837124 
AL096748 
AI807519 
H21075 
AI821780 
NM^002210 
R60900 
H96813 

132937 RCJVA233706J AW952912 
129799 R36410 AW967473 
105077 RCJ\A142919 W55946 
100850 RC_N58561_s AA835472 
131043 RC_AA490925 AF084535 
118417 RC_N66048J AF080229 
129254 RC_AA243695 AA252468 
119149 RC_R58910 BE304701 
133996 AA091367 AA380267 
110223 RC^H23747 H19836 
117626 RC_N36090 AKD01757 
135286 RCJ^424469_S AW023482 
122967 RCJVA478521 AA806187 
131236 AA282640 AF043117 
128568 AA46338D H12912 



Hs.332848 

Hs.22689 

Hs.86320 

Hs.31551 

Hs.1594 

Hs.296178 

HS76359 

Hs.42322 

Hs.98390 

Hs.42369 

Hs.50115 

Hs.88764 

Hs.50651 

Hs.9598 



Hs.301532 

Hs.103000 

Hs.88780 

Hs.102708 

Hs.104520 

Hs.31802 

Hs.179864 

Hs.295726 

Hs.26814 

Hs.42241 

Hs.300383 

Hs.239114 

Hs.234863 

Hs.297939 

Hs.22464 

Hs.1098 

Hs.65732 

Hs.78277 

HS31697 

H5.261348 

Hs.97849 

Hs^89101 

Hs.24594 

Hs.274691 



EST 
ESTs 

BCL2/adenovIrus E1B 19ki>interacting protein 2 
ESTs 

N-myristoyllransferase 2 

hypothetical protein - 

ESTs 

ESTs 

ESTs 

ESTs, Moderately similar to similar to NEDD-4 [H^apiens] 

hypothetical protein FLJ20618 

ESTs 

ESTs 

ESTs 

Ztefemily member 3 (odd-pair^ DrosophOa homotog. heterotaxy 1) 

061-94 protein 

ESTs 

ADP-ribosylation fector-Uke 5 

CDC14 (cell division cyde 14, S. cerevisiae) homotog B 

ESTs 

phosductn-lilce 

hypothetical protein M6C10947 

KIAA0328 protein 

hypothetical protein 

ESTs 

ESTs 

chromosome 21 open reading frame 6 

Homo sapiens cDNA: FU21869 fis. done HEP02442 

UMP-CMP kinase 

hypothetical protein FU14825 

ESTs. Weakly similar to ALU8 JHUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 

gb:zt74e03.r1 Soares Jestis_NHT Homo sapiens cDNA done 5*. mRNA sequence 
SWI/SNF related, matrix assodated, actin dependent regulator of chromatin, subfamily e, 

Homo sapiens mRNA; cDNA DKFZp58601318 (from done DKFZp58601318) 
ESTs 

core-binding factor, mnt domain, alpha subunil2; transtocated to, 1; cydin D-related 
centromere protein A (17kD) 
hypothetical protein FLJ22637 



A kinase (PRKA) anchor protein 2 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp761J1112 (from done DKFZp761J1112) 
maie^pedito lethal^ (DrosophiiaHike 1 
Janus kinase 1 (a protein tyrosine Idnase) 

sema domain, immunoglobulin domain (Ig). transmembrane domain (TM) and short cytoplasmic 

gb3Ti97e10.s1 Stratagene colon HT29 (937221) Homo sapiens cDNA done mRNA 

CRP2 binding protein 
KIAA0831 protein 
ESTs 

DKF2P434A043 protein 

Homo sapiens cDNA FU13694 fis. clone PLACE20001 15 

ESTs, Highly similar to A59266 unconventional myosin-15 [H.sapiensl 

ESTs 

integrin, alpha V (vitronectin receptor, alpha polypeptide, antigen CD51) 

ESTs 

ESTs 

hypothetical protein MGC3032 

mannosidase, atpha, class 1A. member 2 

Homo sapiens cDNA FU12082 fis. done HEMBB1002492 

cathepsin B 

epilepsy, progressive myodonus type 2, Lafora disease (taforfn) 

gbiHuman endogenous retrovirus K done 10.1 polymerase mRNA, partial cds 

DKFZp434J1813 protein 

ESTs 

DKFZP434F2021 protein 
ESTs 

hypoflietlcal protein FLJ10895 
ESTs 

glucose regulated protein, 56kD 

ubh)u!tinatIon factor E4B (homologous to yeast UFD2) 

adenylate kinase 3 
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112888 RCT03872 AW195317 Hs.107716 hypothetical protein FU22344 

115192 RcIaA261920 AA741024 Hs.88378 ESTs 

118688 RC N71484 AK000708 Hs.169764 hypothetical protdn FU20701 

122264 RCIm436837 AA436837 gb2v57g07^1 Soaies.testisJ«IHT Homo sapiens cDNA clone 3', mRNA sequence 

128981 AA135452 AA927177 Hs.86041 CGG triplet repeat binding protein 1 

131042 RC R42457 A1826288 Hs.171637 hypothetical protein MGC2628 

103704 AA028171 AA028171 Hs.151258 hy(X>theticat protein FU21062 

121341 AA233107 AF035528 Hs.153863 IVIAD (nrwthers against decapentaplegia Drosophila) homolog 6 

106593 RC AA456826 AW296451 Hs.24605 ESTs 

115195 RC AA262156 AW968619 Hs.155849 ESTs 

115425 RcIaA2B4071 AA811895 Hs.180680 ESTs, WealUy similar to 154374 gene NF2protKn[H.sapiensl 

117258 RC.N21299 AF086041 Hs.42975 ESTs 

120209 RC.Z40892 F02951 gb:HSC1HB082 nonnalized mfant brain CDNA Homo sapiens cDNA clone c-1hb08 3*. mRNA 

L16991 Hs.79006 deoxythymidylate kinase (thymldylate kinase) 

AW959755 Hs.28B895 Homo sapiens cDNA FU12977 fe, done NT2RP2006261 

AA059459 Hs.62592 ESTs 

N26707 Hs.154304 Homo sapiens chromosome 19. BAG 282485 (Crr-B-344H19) 

N91453 Hs.102987 ESTs 

R22039 Hs.23217 ESTs 

AA2B0095 Hs.886B9 ESTs 

AA282292 Hs^79841 hypothetical protein FU 10335 

N41549 Hs.285410 ESTs 

AI269096 Hs.135578 chltobiase. di-N-acetyl- 

AL137275 Hs.20137 hypothetical protein DKFZp434P0116 

NIVL0a7331 Hs.1 10457 Woimirschhom syndrome candWale 1 

NI\/L002267 Hs.3886 karyopherin alpha 3 (importin alpha 4) 

R36095 Hs.268695 ESTs 

AA463550 Hs.337532 ESTs, Weakly similar to A47582 B-ceB growth factor precursor [Hsapiens] 

AL1 37939 Hs.40096 ESTs 

BE246400 Hs.285176 acetyl-Coenzyme A transporter 

R24024 Hs.1 58101 Homo sapiens cDNA FLJ14673 lis. done NT2RP2003714. moderately similar to ZINC FINGER 

AI091731 H5.87293 hypothetical protein FLJ20045 

R85350 Hs.101368 ESTs 

AA355925 H5.36232 K1AA01 86 gene product 

N80181 Hs^1498 ESTs 

BE149656 Hs^621 Homo sapiens cDNAFUl 1963 fis, done HEMBB1001051 

NM.015361 Hs^68053 KIAA0029 protein 

AF071076 Hs.112255 nudeoporin 98kD 

AA075000 gb:2m83c07.s1 Slratagene ovarian cancer (937219) Homo sapiens cDNA done 3', mRNA 

H56995 Hs.37372 Homo sapiens DNA binding peptide mRNA, partial cds 

AA173223 Hs.289044 Homo sapiens cDNA FLJ12048 fis. ctone HEMBB1001990 

AI041793 Hs.42502 ESTs 

AA075998 gb:zm89b09.r1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done 5* similar to 



sequence 
134082 L16991 
104774 RCJ\A026066 
115625 RCJVA401630 
104469 N28707 
107401 W20054 
111686 RC_R21510 
•115300 RCJ^80026 
115378 RC AA282292 
132224 RC_H97819 
113791 M95767 
129144 AA004987 
104448 L44574 
132084 RCT26981_s 
111831 RC_R36083 
114765 RC_AA252163 
115029 RCJ^A252219 
100457 H81492 
104536 R24011 
PROTEIN 91 
116167 RCJ\A461562 
103889 AA235771 
131978 RC.H48459.S 
118843 RC.N80181 
120837 RC.W93092 
133647 D21852 
129521 U41815 
103746 AA081876 
sequence 

132019 RCJ^134965J 
132310 RC_AA284107 
117367 RC_N24954 
103743 AA075998 



gb:M15887 ACYL-COA-BINDING PROTEIN (HUMAN);. mRNA sequence 



103761 AA0B5138 AA765163 
BINDING PROTEIN (HUMAN);. mRNA sequence 



1J59060 
RC„N72879 
AA045930 
AA096412 
RCJ\A599244 
AA293634 
H05583 
U79298 
23803 mRNA 
118612 RC^N69466 
AA203138 
RC_N38882 
RC R01245 
RC„T76962 
AA417310 
RCJ^6879 



130237 
128752 
135162 
131386 
129021 
424274 
129913 
131888 



322026 
110892 
111429 
113334 
104091 
105246 



AA913909 

AA504428 

AI187925 

BE219898 

AL044675 

W73933 

NM 001310 

AW294659 

AB037788 

AW024973 

AL035301 

AI038052 

AW974666 

BE465093 

AA226879 



Hs.1 53088 
Hs.10487 
Hs.95667 
Hs.173135 
Hs.173081 
Hs.283738 
Hs.13313 
Hs.34054 

Hs.224961 

Hs.283675 

Hs.97375 

Hs.19162 

Hs.293024 

Hs.106101 



IMAGE:663856 3* similar to contains Alu repetitive element;, mRNA sequence. 



gbuiz79b10.s1 NCLCGAP.GCBI Homo sapiens cDNA done 3' similar to gb:M34539 FK506* 

TATA box binding protein (TBP)-assoclated factor. RNA polymerase I. A, 48kD 
Homo sapiens, done II\^GE:3954132, mRNA, partial cds 
F-box protein 30 

duat-spedfidty tyrosin&<Y)>phosphorylation regulated kinase 2 

KIAA0530 protein 

casein kinase 1, alpha 1 

cAMP responsive element binding protein-iike 2 

Homo sapiens cDNA: FIJ22488 fis. done HRC10948. highly similar to HSU79298 Human done 

cleavage and polyadenylation specific factor 2. lOOkD subunit 
NPD009 protein 

H^piens gene from PAC 106H8 

ESTs. Weakly similar to 154374 gene NF2 protein [H^apiens] 

ESTs 

hypothetical protein FLJ22557 

gb3r19c09.s1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA done 



113300 RC_T67448 T67448 Hs.13101 

117147 RCJH97225_S AW901347 Hs.38592 

121349 RCJVA405205 AA405205 Hs.97960 

100294 D49396 AA331881 Hs.75454 

133999 M28213 AA535244 Hs.78305 

133259 AA278548 BE379646 Hs.6904 

129423 AA371418 AA204686 Hs.234149 

131098 RCJ^459668 U66669 Hs.236642 

135272 AA399391 A1828337 Hs.97591 

129155 AA046865 AI952677 Hs.108972 



ESTs 

hypofhetk^al protein FU23342 

ESTs, Weakly similar to T51 146 ring-box protein 1 [H.sapiens] 
peroxiredoxin 3 

RAB2. member RAS oncogene family 

Homo sapiens mRNA full length insert cDNA done EUR0IMA6E 2004403 
hypothetical protein FU20647 
3-hydroxyisobutyiyl-Coenzyme A hydrolase 
ESTs 

Homo sapiens mRNA; cDNA DKFZp434P228 (from done DKFZp434P228) 
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311291 AA056319 
120750 RC_AA310499 
101002 J04058 
133012 AA099241 
103879 AA228148^s 
131281 RC_AA443212 
115109 RCJ\A256383 
118502 RC^N67317 
134100 L07540 
131869 AA484944 
115398 RC AA282985 
103860 AA203742 
135089 N75611^s 
129938 U79300 
107508 W90095 
103685 AA005190 
125170 AA203147 
129179 RCJ\A504125.s AW9e9025 
116262 AA477046 AI936442 
123009 RCJ\A479949 
131004 D29833 
103317 X83441 
132814 RC.C15251_f 
103992 U77718 
109258 X59710 
110754 RC_N20814 
132727 AA136382_s 
100341 D63506 
134664 AA256106 
103826 Ml 65564 
111678 RC R20628 
101341 L76159 
115455 RC^AA285068 
111192 RCJVA477748 
129385 RCJ\A235604 
125050 RC„T79951 
122105 RC_AA432278 
121324 RC_AA404229 
120938 RC_AA386260 
115001 RCJ^251376 
sequence. 

124799 RC_R45088 
sequence. 

122724 RC.AA457395 
117791 RC.N48325 
121895 RCJVA427396 



AA782601 

AI191410 

AV655843 

AA847843 

BE543269 

AA251716 

AJ249977 

AL1 57488 

AA460085 

AW968547 

AA810854 

AW976877 

AI918035 

AW003668 

N74925 

AA158008 

AL020996 



AA535244 
D29833 
X83441 
D60730 
BE018142 
AL044818 
AW302200 
N27495 
AF032922 
AA256106 
AW162998 
R38487 
NM_004477 
AA876002 
AW021968 
AA172106 
AW970209 
AW241685 
AA404229 
AA386260 
AA251376 

R45088 

AA457395 
N48325 
AA427396 



Hs.319817 ESTs 

Hs.96693 ESTs, Moderately similar to 2109260A B cell growth factor [H.sapiens] 

Hs.169919 electron-transfer-flavoprotein. alpha polypeptide (glutaric aciduria II) 

Hs.6271 1 Homo sapiens, clone IMAGE-.3351295, mRNA 

Hs.50252 mitochondrial ritx)somal protein L32 

Hs.25227 ESTs 

Hs.88049 protein kinase, AMP-activated. gamma 3 non-catalytic subunit 

Hs.50150 Homo sapiens mRNA; cDNA DKFZp564B182 (from done DKFZp564B182) 

Hs.171075 replication factor C (activator 1) 5 (36.5kD) 

Hs.33540 ESTs. Weakly slmflarto dJ309K20.4 [H.sapiens] 

Hs.89081 ESTs 

Hs.38057 ESTs 

Hs.301 198 roundabout (axon guidance receptor, DrosophOa) homolog 1 

Hs.1 35587 Human clone 23629 mRNA sequence 

Hs.38761 Homo sapiens cDNA: FU21564 fjs. ctone COL08452 

Hs.292444 ESTs 

Hs.8518 selenoprotein N 

Hs.109154 ESTs 

Hs.59838 hypothetical protein FU10808 

Hs.78305 RAB2, member RAS oncogene feimiiy 

Hs.2207 salivaiy proline-rich protein 

Hs.166091 ligase iV. DMA. ATP-dependent 

Hs.57471 ESTs 

Hs.300954 Huntingtin interacting protein K 

Hs.84928 nuclear transcriptton factor Y. beta 

Hs.6336 KIAA0672 gene product 

Hs.5565 hypothetical protein FLJ22626 

Hs.8813 syntaxin binding protein 3 

Hs.87507 ESTs 

Hs.24684 KIAA1376 protein 

Hs.169927 ESTs 

Hs.203772 FSHD region gene 1 

Hs.120551 ton-like receptor 10 

Hs.1 09438 Homo sapiens clone 24775 mRNA sequence 

Hs.1 10950 Rag C protein 

Hs.111805 ESTs 

Hs.98699 ESTs 

Hs.97842 EST 

Hs.104632 EST 

gb:zs10a06.s1 NCLCGAP_GCB1 Homo sapiens cDNA clone IMAGE:684754 3', mRNA 

gb:yg38g04.s1 Soares Infant brain 1NIB Homo sapiens cDNA clone IMAGE:34896 3\ mRNA 

Hs.99457 ESTs 

Hs.93956 EST 

gb:zw33a02.s1 Soares ovary tumor NbHOT Homo sapiens cDNA done IMAGE:771050 7 



similar to contains Alu repetitive elementicontains MER12.t2 MER12 repetitive element mRNA sequence. 



108244 RCJ\A062839 
3*. mRNA sequence. 
117852 RC.N4940B 
109298 RCJ\A205432 
122432 RCJ^447400 
124627 RC_N74625 



AA062839 gbsmOScOS^I Stratagene oomeal stroma (937222) Homo sapiens cDNA done IMAGE:513232 

AW877787 Hs.136102 K1AA0853 protein 

R77854 . Hs.250693 Krueppel-related zinc finger protein 

AA447400 Hs.187684 ESTs, WeaWy similar to B34087 hypothetical protein [H.sapiensJ 

N74625 gb:2a55c03.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done IMAGE:296452 3* 



similar to gb:M14338 VITAMIN K-DEPENDENT PROTEIN S PRECURSOR (HUMAN);contains 0FR.b3 OFR repetitive element;, mRNA sequence. 



115141 RCJVA258071 AA465131 Hs.64001 
128636 U49065 U49065 Hs.102865 

115373 RCJ^282197 AA664862 Hs.181022 
114651 RCJ^101400 AA101400 Hs.189960 
132796 RCJ\A180487 NM^006283 Hs.173159 
103749 RC_N35583 AL135301 Hs.8768 
107328 T83444 AW959891 Hs.76591 

116349 RCJ^281563 AF121176 Hs.12797 
111490 RC^R06862 R06862 
Similar to contains LI repetitive element ;. mRNA sequence. 
103763 AA085354 AA085291 
contains Alu repetitive element;, mRNA sequence 



Homo sapiens done 25218 mRNA sequence 
interleukin 1 receptor-like 2 
CGl-07 protein 
ESTs 

transfomning. addic coiled-ooil containing protein 1 
hypothetical protein FLJ10849 
KIAA0887 protein 

DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 16 

gb:yf11e09.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done IMAGE:126568 3' 
i, 

gb:2n01g06.s1 Stratagene colon HT29 (937221) Homo sapiens cDNA done 3* similar to 



118791 RC_N75520 
116644 RC^F03032 
116823 RC^H56485 
[H.saplens] 

108940 RC_AA148603 



N75520 
F03032 
AW204742 

AA148603 



tAJIAGE:567198 3*. mRNA sequence. 



112218 RG_R50057 

116557 RG_D2057^.I 

133649 U25849 

131745 RC.C20746 



R50057 
D20572 
U25849 



Hs.261003 ESTs, Moderately simHar to B34087 hypothetical protein [H.sapiens] 

Hs.290278 ESTs. Weakly similar to B34087 hypothetical protein [H.sapiens] 

Hs.143542 ESTs, Highly similar to CS/LHUIVIAN COCKAYNE SYNDROME WD-REPEAT PROTEIN CSA 

gb2o09e04.s1 Stratagene neuroepithelium NT2RAMI 937234 Homo sapiens cDNA done 

Hs.272251 Homo sapiens mRNA; cDNA DKFZp586M1418 (from ck3ne DKFZp586M1416) 

Hs.90171 EST 

Hs.75393 add phosphatase 1 , soluble 

Hs.31447 ESTs. Moderately slmHarto A46010 X-linked retinopathy protein |H^iens] 
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116801 RC_H43879 H43879 
sequence. 

115006 RC_AA251548 
123424 RC AA59850O 
120831 RcZaA347919 

103691 AA018298 
121555 RC.AA412491 
111193 RCJM67946 
132061 RCJ\A058946 
134575 RCJUM94568J AA194568 
115050 RCJ\A2S2794 AA252794 
420208 U31799 
133735 AC002045j(pt1 
128546 Z21305 
111946 RC_R40697 
124879 RC_R73588 
115683 AA410345 

103692 AA018418 



AA251548 

H29882 

AA347919 

AA018298 

AF025771 

N67946 

AB020700 



BE276055 
R66740 
NWL003478 
R40697 
R73588 
AF255910 
AW137912 



Hs.186809 



(CACNA1F) gene, complete cds; HSP27 pseudogene, 
103767 AA089688 BE244667 Hs.296155 
125266 W90022 W90022 
PRECURSOR [H.sapiens] 
135235 AA435512 AW298244 
134497 RCJVA404494 BE258532 
426754 RCJVA278529J NIVL014264 
412177 RC_AA342828.s Z23091 
132000 RC_AA044M4 AW247017 
124738 RCJ\A044644 T07568 
324000 RC_AA196729^I AA604749 
106696 RC_AA196729_i AW073202 
132000 RC^AA025658 AW247017 
129577 RC_AA025858 N75346 
107091 RCJ\A233519 AI949109 
130296 RC_N52271 D31139 
102855 RC_N68399 N[VL003528 
113689 RCJVA098874 AB037850 
100939 RCJ^79667.s L04288 
130430 RC_H22556 W27893 
108734 RC.lsM5979_s BE298690 
Intersecfin 2 long isofonn (rrSN2) mRNA 
135148 RCJ^31288.s AA306478 
134221 RCJ\A609862 BE280456 
105376 RC.N35583 AW994032 
124541 U77718 AP1 12222 

134546 AA203147 AIJ020996 
134000 RC_W93092 AW175787 
125656 RC_W93092 AW516428 
100939 RC_N58561^s L04288 
125656 RCJW93092 AW516428 
101779 RC^W69385_s BE543412 
332489 RC_R22947 R23053 
133000 RC_N38959J AL042444 
125905 RC^N38959J AI678638 
129000 RC^H73050_s AA744902 
100920 RC_H73050.s X54534 



gb7o6gh09.s1 Scares breast 3NbHBst Homo sapiens cDNA clone IMAGE:183233 3\ mRNA 

Hs.87886 EST 

Hs.162614 ESTs 

Hs.96889 EST 

Hs.103332 ESTs 

Hs.50123 zinc finger protein 189 

Hs.1 17569 ESTs 

Hs.3830 KIAA0893 protein 

Hs.85936 EST 

Hs.88009 ESTs 

Hs.95972 silver (mouse homolog) liice 

Hs.110613 KIAAQ220 protein 

Hs.101299 cull}n5 

HsJ6666 C9orf10 protein 

Hs.101533 ESTs 

Hs.54650 junctional adhesion molecule 2 

1^3.227583 Homo sapiens chromosome X map Xpl 1.23 L-type calcium channel aIpha-1 subunit 
complete sequence; and JM1 protein. JM2 protein, and Hb2E genes, complete cds 
CGI-100 protein 

ESTs. Hfeihly similar to LCT2-HUMAN I^UKOCYTE CELL-DERIVED CHEMOTAXIN 2 

Hs.293507 ESTs 

Hs.251871 CTP synthase 

Hs.172052 serine/threonine kinase 18 

Hs.73734 glycoprotein V (platelet) 

Hs.36976 melanoma antigen. faniOy A. 3 

Hs.137158 ESTs 

Hs.190213 ESTs 

Hs.334825 Homo sapiens cDNA FU14752 fis. clone NT2RP3003071 

Hs.36978 melanoma antigen, family A. 3 

Hs.82906 CDC20 (cell division cyde 20» S. cerevisiae. homolog) 

Hs.246885 hypothetical protein FLJ20783 

lHs.154103 LIM protein (similar to rat protein Idnase C-binding enigma) 

Hs.2178 H2B histone family, member Q 

Hs.16621 DKFZP434i116 protein 

Hs.297939 cathepsinB 

Hs.150580 putetive translation initiation facbr 

Hs.288173 Homo sapiens cDNA: FLJ21747 fis. done COLF5160, highly similar to AF182198 Homo sapiens 

Hs.95327 CD3D antigen, delta polypeptide (TiT3 complex) 

Hs.80248 RNA*blndlng protein gene with multiple splidng 

Hs.8768 hypothetical protein FU10849 

Hs.44499 pinin. desmosome associated protein 

Hs.8518 selenoprotein N 

Hs.334841 selenium binding protein 1 

Hs.78687 neutral sphingomyelinase (N-SMase) activation assodated factor 

Hs.297939 cathepsin B 

Hs.78687 neutral sphingomyelinase (N-Si\tese) activation assodated factor 

Hs^50505 retinoic add receptor, alpha 

NA HuOl Chip Redos 

Hs.62402 p21/Cdo42/Rac1<activated kinase 1 (yeast Ste20-related) 

Hs.6456 chaperonin containing TCP1 . subunit 2 (bete) 

Hs,107767 hypothetical protein PR01489 

Hs.278994 Rhesus blood group. CcEe antigens 



107 



BMSIXXrO: <WO_ 



_02079492A2_L> 



wo 02/079492 



PCT/US02/04915 



TABLE 1 A 

Table 1A shows the accession numbers for those pkeys lacking unigenelD's for Tables 1. The pkeys in Table 7 lacking unlgenelD's are represented wiffiin 
Tables l-GA. For each probeset we have fisted the gene cluster number fronrt which the oligonucleotides were designed. Gene dusters were compiled 
5 using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarly using Clustering and 

Alignment Tools (E^oubleTwist Oakland Califbmia). The Genbank accession numbers for sequences oomprismg each cluster are listed in the "Accession* 
column. 

1 0 Pkey: U nique Eos probeset Identifier number 

CAT number Gerie cluster number 
Accession: Genbank accession numbers 



IS Pkey CAT Number Accession 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



108469 116761J AA079467 AA128547 AA128291 AA079587 AA079600 
124106 125446J H 1 2245 AA094769R1 4576 
108501 13684.-12 AA083256 

108562 36375J AA100796 AF020589AA07462g AA07S946 AA100649AA085347 AA126309AA079311 AA079323 AA085274 

125008 1802095.1 T91251 T64891 T85665 

125020 116017 J T69981T69924AA078476 

125056 1B14993_1 T86284 T81933 

116661 1532859_1 R61504 F04247 

125104 413347_1 T95590 AA703278 H62764 

124575 1666649 1 N68168 N69188 N90450 

125263 1547.2 AA098878 W88942 

116845 393481_1 AA649530 AA659316 H54973 

1 18417 37186.1 AF080229 AF080231 AF080230 AF080232 AF080233 AF080234 BE550633 AI636743 AW614951 BE467547 AI680833 

AI633818 N29988 U87592 U87593 U87590 U87591 S46404 U87587 AA463992 AW206802 AI970376 AI583718 AI672574 
N25695 AW665466 AI818326 AA126128 AI480345 AW013827 AA248638 AI214968 AA204735 AA207155 AA206262 AA204833 
AW003247 AW496808 A1080480 A1631703 A1651023 AIB67418 AWB18140 AA502500 A)206199 A1571282 AI352545 BE501030 
AI652535 BE465762 AA206331 AW451866 AA471088 AA206342 AA204834 AA206100 AW021661 AA332922 N6604d 
AA7033g6 H92278 AW139734 H92683 U87589 U87595 H69001 U87594 BE466420 AI624817 BE46661 1 AI206344 AA574397 
AA348354 AI493192 

1 18584 532052_1 AW136928 AI685655 BE218584 BE465078 N68963 AA975338 BE147199 N76377 

103743 112194 1 AA075998AA075999AA070986AA070896AA1 29207 AA078942AA070783AA078941 

103744 114161.1 AA079267AA076003 
103746 113452 J AA075000 AA081876 

103761 114208_1 AA765163 AW298222 AA126126AA08513B AA076068 

103763 48290.6 AA085291 AA085354 

120209 1531817J F02951 240892 F04711 

120284 15B963_1 AA179696 AA1 82626 AA1 82603 

112540 1605263.1 R69751 R70467 H69771 H80879 H80878 

111904 1719336.1 Z41572 R39330 

121059 273450.1 AA393283 AA398628 

121094 275729.1 AA402505 AA398900 

114108 1182096.1 AW60252& BE073859 Z38412 

130091 23961.-3 W88999 

122264 295527.1 AA436837 AA442594 

108280 110682.1 AA065089 AA085108 

129961 1706092-1 R23053 R79884 R76271 

130529 158447.1 AA178953 AA192740 

AA069818 AA069971 AA069923 AA089908 
AA021473 
AA609839 
D45652 
AA128980 
T68875 
T85352 
T85373 



108309 111495.1 
107832 genbanK_AA021473 
123731 genbanK„AA609839 
116571 genbanK^D45652 
132225 genbank_AA128980 
125017 genbank_T68875 

125063 genbank.T85352 

125064 genbank.T85373 
100964 entfe?_J00212J00212 

125118 149288.1 R10606T97620AA576309 
102269 entrez^U30245U30245 
125150 NOT_FOUND.enlre^W38240 W38240 
116801 genbanK_H43879 H43879 
118111 genbank.N55493 N55493 
118129 genbanK.N57493 N57493 
118329 genbank.N63520 N63520 
118475 genbanK_N66845 N66845 
111490 genbanK.R06862 R06862 
111514 genbanK.R07998 R07998 
104534 R22303.at R22303 
120340 genbaniU\A206828 AA208828 
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120376 genbankJ\A227469 
104787 genbanleAA027317 
120409 genbankJVA235050 
120745 genbankj^302809 
120809 genbankj^346495 
120839 genbanKJ^A348913 
113702 genbanleT97307 
115001 genbanlLAA251376 
122562 genbanK.AA452156 
122635 genban)LAA454085 
108244 genbanIU\A062839 
108277 genbankJ\A064859 
122723 genbankuAA457360 
124028 genbank.H)4112 
108403 genbankJUV075374 
122860 genbanlU\A464414 
108427 genbanKJVA076382 
108439 genbanLAA078986 
131353 231290.1 
AA969360 

108533 genbanleAA084415 
117031 genbanleH88353 
124254 genbanleH69899 
101447 entre^M21305 
101458 enlre2LM22092 
124577 genbank_N68300 

108940 genbanK.AA148603 

108941 genbanKJ^148650 
124627 genbank^N74625 
124720 144582J 
124793 genbariK-R44519 
124799 genbanK.R45088 
117683 genbanK.N40180 

genbanleN46452 
genbankjr50116 
genbanleN98488 
95573.2 



117732 
124991 
119023 
119239 
119558 
119654 
105246 
121350 
121558 



AA227469 

AA027317 

AA235050 

AA302809 

AA346495 

AA348913 

T97307 

AA251376 

AA452156 

AA454085 

AA062839 

AA064859 

AA457380 

F04112 

AA075374 

AA464414 

AA076382 

AA078986 ^ 
AW411259 H23555 AW015049 AI684275 AW015886 AW068953 AW014085 A1027260 R52686 AA918278 AI129462 

N34d69 AI948416 AA534205 AA702483 AA705292 

AA084415 

H88353 

H69899 

M21305 
M22092 



N68300 
AA148603 
AA148650 
N74625 

R05283 R11056 

R44519 

R45088 

N40180 

N46452 

T50116 

N98488 

T11483T11472 



NOT.FOUND.entrB2LW38194 W38194 
genbank^W57759 W57759 



genbanKJ\A226879 AA226879 

genbanK-AA405237 AA405237 

genbanK_AA412497 AA412497 

105985 genbanlU\A406610 AA406610 
100071 entre^8102A28102 

114648 genbanKJ\A101056 AA101056 

121895 genbanKJ\A427396 AA427396 
100327 entreZ-D55640D55640 
123315 714071_1 AA496369 AA496646 
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TABLE 2: 



Pkey: Unique Eos probeset identifier number 

Accession: Accession number used for previous patent filings 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 



10 



15 



20 



25 



30 



35 



40 
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55 



60 



65 



70 



75 



Pkey 


Accession ExAccn 


UnlgeneED 


100420 


100420 


086983 


Hs.1 18893 


10U4CS4 


100484 


NM-005402 




100991 


100991 


J03836 


nS.oZ0o5 


4M ICO 


101168 


NM_005308 


Lio o<i<iccn 
r1S.2115o9 


101261 


101261 


D30857 


nS.o2353 


101447 


101447 


M21305 




101543 


101543 


M31166 


Hs.2050 


101560 


101660 


AW958272 


nS.34732o 


101714 


101714 


M68874 


1 L nA A COT 

Hs.211587 • 


101838 


101838 


BE243845 


Hs.75511 


102012 


102012 


BE259035 


MS. 118400 


102164 


102164 


NM_000107 


Hs.77602 


102283 


102283 


AW161552 


Hs.83381 


102564 


102564 


U59423 


Hs.79067 


102759 


102759 


NrvL005100 


Hs.788 


102804 


102804 


NI\/L002318 


Hs.83354 


102898 


102898 


NM_002205 


Hs.149609 


103036 


103036 


M13509 


Hs.83169 


103095 


103095 


NM-005424 


Hs.78824 


103166 


103166 


AA1 59248 


Hs.1 80909 


103280 


103280 


U84722 


Hs.76206 


103850 


103850 


AA1 87101 


Hs.213194 


104592 


104592 


AW630488 


Hs.25338 


104766 


104786 


AA027167 


Hs.10031 


104865 


104865 


T79340 


Hs.22575 


104952 


104952 


AW076098 


Hs.345588 


105178 


105178 


AA313825 


Hs.21941 


105330 


105330 


AW338625 


H5.22120 


105729 


105729 


H46612 


Hs.293815 


105977 


105977 


AK001972 


Hs.30822 


106031 


106031 


X64116 


Hs.171844 


106155 


106155 


AA425414 


Hs.33287 


106423 


106423 


AB020722 


Hs.16714 


107174 


107174 


BE1 22762 


Hs.25338 


107295 


107295 


AA1 86629 


HS.B0120 


108756 


108756 


AA127221 


Hs.1 17037 


108888 


108888 


AA1 35606 


Hs.1 89384 


109166 


109166 


AA219691 


Hs.73625 


109768 


109768 


F06838 


Hs.1 4763 


110906 


110906 


AA035211 


Hs.1 7404 


111006 


111008 


BE387014 


Hs.1 66146 


111133 


111133 


AW580939 


Hs.97199 


113073 


113073 


N39342 


Hs.1 03042 


113923 


113923 


AW953484 


Hs.3849 


115061 


115061 


AI751438 


Hs.41271 


115145 


115145 


AA740907 


Hs,88297 


115947 


115947 


R47479 


nS.y*>/D 1 


116339 


116339 


AK000290 


Hs.44033 


116589 


116589 


AI557212 


Hs.17132 


117023 


117023 


AW070211 


Hs.102415 


117553 


117563 


AF055634 


Hs.44553 


11B475 


118475 


N66845 




119073 


119073 


BE245360 


Hs.279477 


119174 


119174 


R71234 




119416 


119416 


T97186 




121335 


121335 


AA404418 




123160 


123160 


AA488687 


Hs.284235 


123523 


123523 


AA608588 




123964 


123964 


013961 




124315 


124315 


NMJ005402 


Hs.288757 


124669 


124669 


AI571594 


Hs.102943 


124875 


124875 


At887664 


Hs.285814 


125103 


125103 


AA570056 


Hs.122730 


125565 


125565 


R20840 





Melanoma associated gene 
v-ral simian leukemia viral oncogene horn 
serine (or cysteine) proteinase inhibito 
G protein-coupled receptor kinase 5 
protein C receptor, endothelial (EPCR) 
gb:Human alpha satellite and satellite 3 
pentaxin-retated gene, rapidly induced b 
intercellutar adhesion molecule 2 
phospholtpase A2. group IVA (cytosoOc, 
connective tissue growth factor 
singed (Drosoph)la)-l]ke (sea urchin fas 
damage-specific DNA binding protein 2 (4 
guanine nucleotide binding protein 11 
MAD (mothers against decapentaplegic. Dr 
A kinase (PRKA) anchor protein (gravin) 
lysyl oxldase-like 2 
integrin. alpha 5 (fibronectin receptor, 
matrix metalioproteinase 1 (interstitial 
tyrosine kinase with immunoglobulin and 
peroxiredoxin 1 

cadherin 5. type 2, VE-cadherin (vascula 
hypothetical protein MGC10895 
protease, serine, 23 
KIAA0955 protein 

B-cell CLL/lymphoma 6, member B (zinc fl 
desmoplakin (DPI, DPIl) 
AD036 protein 
ESTs 

Homo sapiens HSPC265 mRNA. partial cds 
hypothetical protein FU11110 
Homo sapiens cDNA: FtJ22296 fis. clone H 
nuclear factor l/B 

Rho guanine exchange factor (GEF) 15 
ESTs 

UDP-N-acetyl-alpha-D-galactosamine:polyp 
ESTs 

gb:zl10a05^1 Soares_pregnanLuterus_NbH 
RAB6 interacting, kinesin-rlike (rabkines 
ESTs 
ESTs 

Homer, neuronal immediate early gene, 3 
complement component Clq receptor 
mlcrotubule^ssociated protein 16 
hypothetical protein FLJ22041 similar to 
Homo sapiens mRNA full length insert cDN 
ESTs 

KIAA1691 protein 

dipeptidyl peptidase 8 

ESTs, Moderately similar to 154374 gene 

Homo sapiens mRNA: cDNA DKFZp586N0121 (f 

unc5 (Celegans homolog) c 

gb2a46c1 1 .si Scares fetal iiver spleen 

ESTs 

gb:yi54c08.s1 Soares placenta Nb2HP Homo 
gb:ye50h09^1 Soares fetal liver spleen 
gb:zw37e02.s1 SoarBS_totaLfetus.Nb2HF8. 
ESTs, Weakly similar to 138022 hypotheti 
gb:ae54e06.s1 Stratagene lung carcinoma 
gb:C13961 Clontech human aorta polyA^-nlR 
v-ral simian leukemia viral oncogene horn 
hypothetical protein M6C12916 
sprouty (Drosophila) homolog 4 
ESTs, Moderately similar to KIAA1215 pro 
gb:yg05c08.r1 Soares infant brain 1NIB H 
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126511 


126511 


T92143 


Hs.57956 


EGF-TM7>latrophiiin-retated protein 


126649 


126649 


AA001860 


Hs.279531 


ESTs 


4496Q2 449602 


AA001860 


Hs.279531 


ESTs 


127402 


127402 


AA358869 


Hs^27949 


SEC13 (S. oerevisiae)-lil<e 1 


128992 


128992 


H04150 


Hs.107708 


ESTs 


129188 


129188 


NM_001078 


Hs.109225 


vascular oeD adhesion molecule 1 


129371 


129371 


X06828 


Hs.1 10802 


von Willebrand factor 


129765 


129765 


M86933 


Hs.1238 


amelogenin (Y chromosome) 


129884 


129884 


AF055581 


Hs.13131 


lysosomal 


130639 




AI557212 


Hs.17132 


ESTs, Moderately similar to 154374 gene 


130828 


130828 


AW631469 


Hs^03213 


ESTs 




13in80 


NM.001955 


Hs.2271 


endothelin 1 




1311fl2 


A1624144 


Hs.23912 


ESTs 




lOlOt o 


AA040311 


Hs.28959 


ESTs 


l«9 1 f «a) 




AA443966 


Hs.31595 


ESTs 


lO lOOl 


131fl81 


AW361018 


Hs.3383 


upstream regulatoiy element binding prot 




139nR3 


BE386490 


Hs^79663 


Pirin 






N^A_003542 


Hs.46423 


H4 histone family, member G 






AB011084 


Hs.48924 


KIAA0512 gene product; ALEX2 


132676 


132676 


N92589 


Hs^61038 


ESTs. Weakly similar to 138022 hypotheti 


170710 




Nl^004600 


Hs.554 


Sjogren syndrome antigen A2 (60kD, n1}on 


132760 


132760 


AA1 25985 


Hs.56145 


thymosin, beta, identified in neuroblast 






AF234532 


Hs.6163d 


myosin X 






AI186431 


Hs.296638 


prostate differentiation factor 


loo ID 1 


1331S1 


AW021103 


H5.6631 


hypothetical protein FU20373 




looofin 

IJO&DU 


AA403045 


Hs.6906 


Homo sapiens cDNA: FU23197 fis, done R 




I'W^QI 


BE619053 


Hs.170001 


eukaryotic translation initiation factor 




I'^'^'i'ift 


All 29903 


Hs.74669 


veside-associated membrane protein 5 (m 


lOOO 




NM_003003 


Hs.75232 


SEC14 (S. cerevisiaeHike 1 


irifiQi 




M85289 


Hs^11573 


heparan sulfate proteoglycan 2 (perlecan 


tooa 10 


1 oOv 1 o 


AU076964 


Hs.7753 


caiumenin 






L34657 


HsJ8146 


platelet/endothelial ceil adhesion molec 






A1379954 


Hs.79025 


KIAA0096 protein 






AW580939 


Hs.97199 


complement component Clq receptor 


11 RATH 


1 IDHf U 


AI272141 


Hs.83484 


SRY (sex determining region Y)-box 4 






AW968058 


Hs.923ai 


nudbc (nucleoside diphosphate linked moi 






W55956 


Hs.94030 


Homo sapiens mRNA; cDNA DKFZp5d6E1624 (f 


innii^ 


lUU 1 It 


X02308 


H5.82962 


fhymidylate synthetase 


inni^i 




AII076465 


Hs.278441 


KIAA0015 gene product 


innonA 
lUUZUo 


innonft 


NM 002933 


Hs.76224 


ribonudease, RNase A family, 1 (pancrea 


iQU4(k> 




AW2d1587 


Hs.82733 


nidogen 2 


1UU4dd 




AVV888941 


Hs.75789 


N^yc downstream regulated 


lUUDlO 


innRiA 

lUUu lO 


AI752163 


Hs.l 14599 


collagen, type Vlll, alpha 1 


lUUDOo 




U56725 


Hs.1 80414 


heat shock 70kD protein 2 


luU/ lo 


l\i\Jf 10 


BE295928 


Hs.75424 


inhU)itor of DNA binding 1, dominant neg 


4AAOOD 


1 nnftOA 


AL048753 


Hs.303649 


small indudble cytokine A2 (monocyte ch 




inOQ91 


J03836 


Hs.82085 


serine (or cysteine) proteinase Inhibito 


inmn 

lUl 11U 


101110 


AI439011 


Hs.86386 


myeloid ceH leukemia sequence 1 (BCL2-r 


ini i(\R 


1011<^R 


AA340987 


Hs.75693 


protylcarboxypeptidase (angiotenslnase C 




101164 


NM.001674 


Hs.460 


activating transcription f^dor3 




101^17 
lUl J If 


L42176 


Hs.8302 


four and a half LIM domains 2 
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caidtonin receptor-like 
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BE410405 


Hs.76288 


calpain 2, (m/ll) large subunit 
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XI 2784 


Hs 119129 


collagen, type iV, alpha 1 


101543 


101543 


M31166 


Hs,2050 


pentaxin-reialed gene, rapidly induced b 


101560 


101560 


AW958272 


Hs.347326 


intercellular adhesion molecule 2 


101592 


101592 


AF054853 


Hs.91299 


guanine nudeotlde binding protein (G pr 


101634 


101634 


AV650262 


Hs.75765 


GR02 oncogene 


101662 


101682 


AF043045 


Hs.81008 


filamin B. beta (actin-binding piDtein-2 


101720 


101720 


M69043 


Hs.81328 


nudear ^ctor of kappa Gght polypeptld 


101744 


101744 


AI879352 


Hs.1 18625 


hexokinase 1 


101837 


101837 


M92843 


H5.343S86 


zinc finger protein homotogous to Zfp-36 


101840 


101840 


AA236291 


H5.183583 


serine (or cysteine) proteinase Inhibito 


101864 


101864 


BE392568 


Hs.75777 


transgelin 


101966 101966 


X96438 


Hs76095 


immediate early response 3 


102013 


102013 


BE616267 


Hs.178452 


catenin (cadheiin-assodated protein), a 


102059 


102059 


AI752666 


Hs.76669 


nicotinaniide N-methyltransferase 


102283 


102283 


AW161552 


Hs.83381 


guanine nucleotide binding protein 11 


102378 


102378 


AU076887 


Hs.28491 


spermidkie/spennlne Nl-ace^transferase 


102460 


102460 


U48959 


Hs.211582 


myosin, light polypeptide kinase 


102499 


102499 


BE243877 


Hs.76941 


ATPase, Na+/K+ transporting, beta 3 poly 


102560 


102560 


R97457 


Hs.63984 


cadherin 13, H-cadheiin (hearQ 


102589 


102589 


AU076728 


H5.8867 


cysteine-rk:h, angibgente inducer, 61 


102645 


102645 


ALl 19566 


Hs.6721 


Ijnsosomal 


102693 


102693 


AA532780 


Hs.183684 


eukaryotic translatbn initiation factor 


102759 


102759 


NM_005100 


Hs.788 


A kinase (PRKA) anchor protem (gravin) 
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102882 102882 A1767736 Hs.290070 

102915 102915 X07820. Hs.2258 

102960 102960 AI904738 Hs.76053 

103020 103020 X53416 Hs.195464 

5 103036 103036 M13509 Hs.83169 

103080 103080 AU077231 Hs.82932 

103138 103138 X65965 

103195 103195 AA351647 Hs.2642 

103371 103371 X91247 Hs.13046 

10 103471 103471 Y00815 Hs.75216 

104447 104447 AW204145 Hs.l56044 

104783 104783 AA533513 Hs,93659 

104865 104865 T79340 Hs.22575 

104894 104894 AF065214 Hs.18858 

15 105113 105113 AB037816 Hs.8982 

105195 105196 W84893 Hs.9305 

105263 105263 AW388633 Hs.6682 

105330 105330 AW338625 Hs.22120 

105492 105492 AI805717 Hs.289112 

20 105594 105594 AB024334 Hs.25001 

105732 105732 AW504170 Hs.274344 

105882 105882 W46802 Hs.81988 

106031 106031 X64116 Hs.171644 

106222 106222 AA356392 Hs.21321 

25 106263 106263 W21493 Hs28329 

106366 106366 AA1B6715 Hs^3842g 

106634 106634 W25491 Hs.288g09 

106793 106793 H94997 Hs.16450 

106842 106842 AF1 24251 Hs.26054 

30 106890 106890 AA489245 Hs.88500 

106974 106974 A1817130 Hs.9195 

' 107051 107061 BE147611 Hs.6354 

107216 107216 D51069 Hs.211579 

107444 107444 W28391 Hs.343258 

35 108507 108507 AI554545 Hs.68301 

108931 108931 AA147186 

109195 109195 AF047033 Hs.132904 

109456 109456 AW956580 Hs.42699 

110411 110411 AW001579 Hs.9645 

40 110908 110908 AA035211 Hs.17404 

111091 111091 AA300067 Hs.33032 

111378 111378 AW160993 Hs.326292 

111769 111769 AW629414 Hs.24230 

112951 112951 AA307634 Hs.6650 

45 113195 113195 H83265 Hs.8881 

113542 113542 H43374 Hs.7890 
113847 113847 NM^005032 Hs.4114 

113947 113947 W84768 

115061 115061 AI751438 Hs.41271 
50 ' 115870 115870 NM.005985 Hs.48029 
116228 116228 AI767947 Hs.50841 
116314 116314 AI799104 Hs.178705 
117023 117023 AW070211 Hs.102415 
117156 117156 W73853 
55 117280 117280 M18217 Hs.172129 
119866 119866 AA496205 Hs.193700 
121314 121314 W07343 Hs.182538 
121822 121822 AI743860 
122331 122331 AL133437 Hs.110771 
60 123160 123160 AA488687 Hs.284235 
124059 124059 BE387335 Hs.283713 
124358 124358 AW070211 Hs.102415 
124726 124726 NM_003654 Hs.104576 
125167 125167 AL137540 Hs,102541 
65 125307 125307 AW580945 Hs.330466 
107985 107985 T40064 Hs.71968 
125598 125598 T40064 Hs.71968 
413731 413731 BE243845 Hs.75511 
116024 116024 AA088767 Hs.83883 
70 418000 418000 AA932794 Hs.83147 
126399 126399 AA088767 Hs.83883 
127566 127566 AI051390 Hs.1 16731 
128453 128453 X02761 Hs.287820 
128515 128515 BE395085 Hs.10086 
75 128623 128623 BE076608 Hs.l05509 
128669 128669 W28493 Hs.180414 



gelsolin (amyloidosis, Finnish type) 
matrix metalloproteinase 10(stromelysin 
DEAD/H (Asp-GIu-Ala-Asp/His) box polypep 
niamln A, alpha (actin-binding protein* 
matrix metalloproteinase 1 (interstiOal 
cyclin D1 (PRAD1: parathyroid adenomatos 
gb:H.sapiens SOD-2 gene for manganese su 
eukaryotic translation elongation factor 
thioredoxin reductase 1 
protein tyrosine phosphatase, receptor t 
ESTs 

protein disulfide isomerase related prot 

B-cell CLL/lymphoma 6. member B (zinc fi 

phospholipase A2, group IVC (cytosofic. 

Homo sapiens, done 1MAGE:3506202, mRNA. 

angiotensin receptor-lilte 1 

solute carrier family 7. (cationk; amino 

ESTs 

C6I-43 protein 

tyrosine 3-monoQxygenaseAiyptDphan 5-nio 
hypothetical protein MGC12942 
disabled (Drosophila) homolog 2 (mitogen 
Homo sapiens cDNA: FU22296 fis. done H 
Homo sapiens done FLB9213 PR02474 mRNA. 
hypothetical protein FIJ14005 
RIKEN cDNA 9130422N19 gene 
hypothetical protein FU22471 
ESTs 

novel SH2-containing protein 3 
mitogen-activated protein kinase 8 inter 
Homo sapiens cDNA FIJ13698 fis, clone PL 
stroma) ceD derived factor receptor 1 
melanoma cell adhesion molecule 
proliferation^assodated 2G4. 38kD 
ESTs 

gbUEo38d01.5l Stratagene endothelial eel 
solute carrier MIy 4, sodium bicarbon 
ESTs 

Homo sapiens mRT^ for K1AA1741 protein. 
ESTs 

hypothetical protein DKF2p434N185 
hypothetical gene DKFZp434A1 1 14 
ESTs 

vacuolar protein sorting 45B (yeast homo 
ESTs, Weakly similar to S41044 chromosom 
Homo sapiens mRNA for KIAA1671 protein, 
ptastin3(Tisofom)) 

gb:zh53d03^1 SoaresJetaUiver.spleen. 
Honx> sapiens mRI^ full length insert cDN 
snaQ 1 (drosophila homok)g), zinc finge 
ESTs 

Homo sapiens cDNA FUl 1333 fis. done PL 
Homo sapiens mRNA; cDNA DKF2p586N0121 (f 
ESTs 

Homo sapiens cDNA: FLJ21409 fis. done C 

Homo sapiens mRNA; cDNA DKF2p586l0324 (f 

phosphoRpld scramblase 4 

metallothionein IE (functional) 

Homo sapiens cDNA: FU21904 fis. clone H 

ESTs, Weakly simflar to 138022 hypothetl 

ESTs, Weakly similar to S64054 hypotheti 

Homo sapiens mRNA; cDNA DKF2p586N0121 (f 

carbohydrate (keratan sulfate Gal-6) sui 

netrin 4 

ESTs 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
connective tissue growth factor 
transmembrane, prostate androgen induced 
guanine nucleotide binding protein>like 
transmembrane, prostate androgen induced 
ESTs 

fibronecfini 

type I transmembrane protein Fn14 
CTL2gene 

heat shock 70kD protein 8 
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128914 128914 AW867491 Hs.107125 plasmalemma vesicle associated protein 

129188 129188 NM-.001078 Hs.109225 vascular cell adhesion molecule 1 

129265 129265 AA530892 Hs.171695 dual specificity phosphatase 1 

129468 129468 AW410538 Hs.111779 secreted protein, acidic, cystelne-rich 

5 101838 101838 BE243845 Hs75511 connective tissue growth factor 

129619 129619 AA209534 Hs.284243 tetraspan NET-6 proteh 

129762 129762 AA453694 Hs.12372 tripartite motif protein TRIM2 

130018 130018 AA353093 metaltothionein 1L 

130178 130178 U20982 H5.1516 insuMike growth factor-binding prote 

10 130431 130431 AW505214 Hs.155560 calnexin 

130553 130553 AFa62649 Hs.252587 pituitary tumor-transforming 1 

130839 130639 AI5S7212 Hs.17132 ESTs, Moderately similar to 154374 gene 

130686 130686 BE546267 Hs.337986 Homo sapiens cONAFU10934fB, done OV 

130818 130818 AW190920 Hs.19928 hypothetical proteb)SP329 

15 130899 130899 A1077288 Hs.296323 serum^htcocortlcold regulated kinase 

131080 131080 NM»001955 Hs.2271 endotheiin 1 

131091 131091 AJ271216 Hs.22880 dipeptidylpeptldase III 

131182 131182 AI824144 Hs.23912 ESTs 

131319 131319 N^A_003155 Hs.25590 slanniocaldn 1 

20 131328 131328 AW939251 Hs.25647 v-fos FBJ murine osteosarcoma viral onco 

131328 131328 AW939251 Hs.25647 v-fos FBJ murine osteosarcoma viral onoo 

131555 131555 T47364 Hs.278613 interferon, alpha-lndudble protein 27 

131573 131573 AA040311 Hs.28959 ESTs 

131756 131756 AA443966 Hs,31595 ESTs 

25 131909 131909 NM-016558 Hs.274411 SCAN domain-containing 1 

132046 132046 AI359214 Hs.179260 chromosome 14 open reading frame 4 

132151 132151 BE379499 Hs.173705 Homo sapiens cDNA: FU22050 fis, don© H 

132187 132187 AA235709 Hs.4193 DKFZP58601624 protein 

132314 132314 AF1 12222 Hs.323806 pinin, desmosome assodated protein 

30 132398 132398 AA876616 Hs. 16979 ESTs. Weakly similar to A43932 mudn 2 p 

132490 132490 NM.001290 Hs.4980 UM domain binding 2 

132546 132546 M24283 Hs.168383 intercellular adhesion molecule 1 (CD54) 

132716 132716 BE379595 Hs.283738 casein kinase 1. alpha 1 

132883 132883 AA373314 Hs.5897 Homo sapiens mRNA; cDNA DKFZp586P1 622 (f 

35 132989 132989 AA480074 Hs,331328 hypotheUcal protein FU1321 3 

133071 133071 BE384932 Hs,64313 ESTs. Weakly similar to AF257182 1 6-pro 

133099 133099 W16518 Hs.279518 amytoid beta (A4) precursor-fike protein 

133149 133149 AA370045 Hs.6607 AXINl up-regulated 

133200 133200 AB037715 Hs.183639 hypottiettoal protein FU10210 

40 133260 133260 AA403045 Hs.6906 Homo sapiens cDNA:FIJ23ig7fls, done R 
133349 133349 AW6312S5 Hs.8110 L^ydroxyacylCoenzyme A dehydrogenase 
133398 133398 NM.000499 Hs.72912 cytochrome P4S0,sutyfamilyt (aromatic c 
133454 133454 BE547647 Hs.177781 hypothetk»l protein MGC561 8 
133491 133491 BE619053 Hs.1700O1 eukaryolk; translation initiation factor 
45 133517 133517 NM.000165 Hs.74471 gapjundlon protein, alpha 1,43kD (con 
133538 133538 NM«003257 Hs.74614 fight junction protein 1 (zona ocdudens 
133584 133584 D90209 Hs.181243 activating transcription fador 4 (tax-r 
133617 133617 BE244334 Hs.75249 ADP-ribosylatton fador-Iike 6 Interactl 
133671 133671 AW503116 Hs.301819 zinc finger protein 146 
50 133681 133681 AI352558 tyrosine 3-monooxygenase/tryptophan 5-mo 

133730 133730 BE242779 Hs.179526 upregulatedby U5-dihydroxyvitaminD-3 
133802 133802 AW239400 Hs.76297 G protein-coupled receptor kinase 6 
133838 133838 BE222494 Hs.180919 inhibitor of DMA binding 2, dominant neg 
133889 133889 U48959 Hs.211582 myosin, fight polypeptide kinase 
55 133975 133975 C18356 Hs.295944 tissue fador pathway inhibitor 2 
134039 134039 NM_002290 Hs.78672 laminin, alpha 4 
134081 134081 AL034349 Hs.79005 protein tyrosine phosphatase, receptor t 
134203 134203 AA161219 Hs.799 diphtheria toxin receptor (heparin-bindl 
134299 134299 AW580939 Hs.97199 complement component CI q receptor 
60 134339 134339 R70429 Hs.81988 disabled (Drosophlla) homolog 2 (mitogen 
134381 134381 AI557280 Hs.184270 capping protein (actin filament) musde 
134416 134416 X68264 Hs.211579 melanoma celt adheston molecute 
134558 134558 NIVL001773 Hs.85289 CD34 antigen 
134983 134983 D28235 Hs.196384 prostaglandin-endoperoxide synthase 2 (p 
65 135052 135052 AL136653 Hs.93675 decidual protein Induced by progesterone 

135069 135069 AA876372 Hs.93961 Homo sapiens mRNA; cDNA DKFZp667D095 (ffr 
135073 135073 W55956 Hs.94030 Homo sapiens mRNA; cDNADKFZp586E1 624 (f 
135196 135198 C03577 Hs.9615 myosin regulatory Ifght chain 2, smooth 
134404 134404 AB000450 Hs.82771 vacdnia related kinase 2 
70 100082 100082 AA130080 Hs.4295 proteasome(prosome.macropain)26Ssubu 
130150 130150 BE094848 Hs.15113 homogentlsate 1,2-dioxygenase (homogentl 
130839 130839 AB011169 Hs^0141 similar to S.cerevlsiaeSSM4 
100113 100113 NM.001269 Hs.84746 chromosome condensation 1 
100129 100129 AA469369 Hs.5831 tissue inhibitor of metaDoproteinase 1 
75 100169 100169 AL037228 Hs.82043 
100190 100190 M91401 Hs.178658 



01 23 gene produd 

RAD23 (S. cerovlsiae) homolog B 
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131319 


131319 


NwLOOOlOO 


Hs.2oo90 


stanniocatefn 1 


132316 


132316 


UZooOl 


ns.44ooo 


1^1 A A 4 Cil 4 MMtoiM 

KiAA1d41 protein 


134365 


134365 


AA568906 


Hs.82240 


oyiiioAiii wn 


102298 102298 


AA382169 


Hs!54483 


N-myc (and STAT) interactor 


302344 302344 


BE303044 


Hs.192023 


eukaryotlc translation initiation fedor 


102367 


102367 


U39656 


Hs.1 18825 


mitogen-activated protein kinase kinase 


102394 


102394 


NM_003816 


Hs.2442 


a distntegrin and metaltoproteinase doma 


129521 


129521 


AF071076 


Hs.1 12255 


nudeoporin 98kD 


102251 


102251 


NM«004398 


Hs.41706 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 


133746 


133746 


' AW410035 


Hs.75862 


MAD (mothers against decapentaptegic. Dr 
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132828 132828 AB014615 Hs.57710 

132828 132828 AB014615 Hs.57710 

130441 130441 U63630 Hs.155637 

129350 129350 U50535 Hs.110630 

5 130457 130457 AB014595 Hs.155976 

102560 102560 R97457 Hs.63984 

134305 134305 U61397 Hs.81424 

132736 132736 AW081883 Hs.211578 

102663 102663 NM.002270 H5.16B075 

10 102735 102735 AF111106 Hs.3382 

101175 101175 U82671 Hs.369d0 

132164 132164 AI752235 Hs.41270 

102826 102826 NM-007274 Hs.8679 

102846 102846 BE264974 Hs.6566 

15 134161 134161 AA634543 Hs.79440 

302383 302363 AW1B3799 Hs.198365 

125701 125701 T72104 Hs.93194 

134656 134656 AI750878 Hs.87409 

102968 102968 AU076611 Hs.154672 

20 134037 134037 AI808780 Hs.227730 

103023 103023 AW500470 Hs.117950 

130282 130282 BE245380 Hs.153952 

128568 128568 H12912 Hs.274691 

103093 103093 S79876 Hs.44926 

25 129053 129063 X63094 Hs.283B22 

133227 133227 AW977263 Hs.68257 

103184 103184 U43143 Hs.74049 

103208 103208 AW411340 Hs.31314 

131486 131486 FC6972 Hs.27372 

30 103334 103334 NIVL001260 Hs.25283 

135094 135094 NlvL003304 Hs.250687 

103352 103352 H09366 H3.78B53 

132173 132173 X89426 Hs.41716 

131584 131584 AA598509 H3.29117 

35 103378 103378 AL119690 Hs.153618 

103410 103410 AA158294 Hs.295362 

103438 103438 AW175781 Hs.152720 

103452 103452 NM_006936 Hs.85119 

135185 135185 AW404908 Hs.96038 

40 134662 134662 NIVL007048 Hs.284283 

103500 103500 AW408009 H$.22580 

132084 132084 NA4_002267 Hs.3886 

133152 133152 Z11695 Hs,324473 

103612 103612 BE336654 Hs.70937 

45 103692 103692 AW137912 Hs.227583 

129796 129796 BE218319 Hs.5807 

132683 132683 BE264633 Hs.143638 

103723 103723 BE274312 Hs.214783 

133260 133260 AA403045 Hs.6906 

50 103766 103766 AI920783 Hs.191435 

132051 132051 AA393968 Hs.180145 

135289 135289 AW372569 Hs.9788 

103794 103794 AF244135 Hs.30670 

134319 134319 BE304999 Hs.285754 

55 119159 119159 AF142419 Hs.15020 

103850 103850 AA187101 Hs.213194 

322026 322026 AW024973 Hs.283675 

103861 103861 AA206236 Hs.4944 

447735 447735 AA775268 Hs.6127 

60 131236 131236 AF043117 Hs.24594 

129013 129013 AA371156 Hs.107942 

103988 103988 AA314389 Hs.342849 

425284 425284 AF155568 Hs.348043 

133281 133261 AK001601 Hs.69594 

65 108154 108154 NM.005754 Hs.220689 

135073 135073 W55956 Hs.94030 

129593 129593 A1338247 Hs.98314 

132064 132064 AA121098 Hs.3838 

131427 131427 AF151879 Hs.26706 
70 104282 104282 C14448 Hs.332338 
130443 130443 D25216 Hs.155650 
132837 132837 AA370362 Hs.57958 
104334 104334 D82614 Hs.78771 
134731 134731 D89377 Hs.89404 
75 131670 131670 H03514 Hs.15589 
104402 104402 H56731 Hs.132956 



fibroblast growth factor 8 (androgen-ind 
fibroblast growth fedor 8 (androgen-ind 
protein kinase, DNA-adivated, catalytic 
Human BRCA2 regfon. mRNA sequence CG006 
culfin 4B 

cadherin 13. H-cadherin (heart) 
ubiquitln-like 1 (sentrin) 
Homo sapiens cDNA: FU23037 fis, done L 
karyopherin (Innportin) beta 2 
protein phosphatase 4, regulatoiy subuni 
melanoma antigen, family A. 2 
procollagen-lysine, 2-oxogtutarate 5-dio 
cytosolic acyl coenzyme A thioester hydr 
thyroid hormone receptor interactor 13 
IGF-II mRNA-binding protein 3 
2,3-btsphospho9tycerate mutase 
apolipoprotein A-l 
Ihrombospondin 1 

methylene tetrahydrofolatB dehydrogenase 

integrin. alpha 6 

multifunctional polypeptide similar to S 
5' nucleotidase (CD73) 
adenylate kinase 3 

dipeptidylpeptldase IV (CD26. adenosine 
Rhesus bk)od group. D antigen 
general transcription factor IIP, polype 
fms-reiated tyrosine kinase 4 
retinoblastoma-binding protein 7 
BIVIX non-receptor tyrosine kinase 
cydin-dependent kinase 8 
transient receptor potential channel 1 
uracii-DNA glycosytase 
endothelial cell-specific molecule 1 
purine-rk:h element binding protein A 
HCGVIIM protein 

DRI-assodated protein 1 (negative cofac 

M-phase phosphoprotein 6 

SMr3 (suppressor of mif two 3. yeasQ ho 

Rio (DrosophllaHlke. expressed In many 

butyrophllin. subfamily 3, member A1 

alkylglyoerone phosphate synthase 

kaiyopherin alpha 3 (importin alpha 4) 

mitogen-adivated protein kinase 1 

H3 histone family, member A 

Homo sapiens chromosome X map Xpl 1 .23 L- 

GTPase Rab14 

WD repeat domain 4 

Homo sapiens cDI^ FLJ14041 fs, done HE 
Homo sapiens cDNA: FLJ23197 fis. done R 
ESTs 

HSPG030 protein 

hypothetK:ai protein MGC10924 similar to 
hepatocellular carcinoma-assodated anti 
fumarate hydratase 

homolog of mouse quaking QKI (KH domain 

hypothetical protein MGC10895 

NPD009 protein 

hypothetical protein FU12783 

Homo sapiens cDUA: FLJ23020 fis. ctone L 

ubiquitination fador E4B (hbmotogous to 

DKFZP564M112 protein 

ADP-ribosylatton fador-fike 5 

NSI-assodated protein 1 

high-moblDty group 2QA 

Ras-GTPase4u:tlvatlng protein SH3-domaln 

Homo sapiens mRNA; cDNA DKFZp586E1624 (f 

Homo sapiens mRNA; cDNA DKFZp586L0120 (f 

serum-indudble kinase 

CGM21 protein 

EST 

KIAA0014 gene produd 

EGF-TM7-latrophilln-reteted protein 

phosphogiycerate kinase 1 

msh (DrosophQa) homeo box homok)g 2 

ESTs 

ESTs 
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25 
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40 



45 



50 



55 



60 



65 



70 



75 



129077 


190077 


W7A794 

IN f *f 1 C.H 


ns. iU04/ y 


PQTo 
CO 1 S 








nS.y iiiyD 


integnn, alpha 8 


4*>AAQQ. 


1 04430 


AWz4d^/J 


HS.04131 


tnreonyl-lRNA synthetase 


iU44oo 


104488 


N56191 


Hs.l 06511 


protocadherin 17 


129214 


129214 


AL044335 


ns.1 09526 


zinc finger protein 198 


104530 


104530 


AKUUjo/O 


nS.lz4o7 


nypotneiical protem rU10ol4 


104544 


104544 


AlUy llM 


nS.^Z23o2 


coTs, Weakly similar to p40 [H.sapiensj 


104567 


104567 


AA040620 


Hs.5672 


hypothetical protein AF140225 


129575 


129575 


r0o2o4 


nS.27o4Zo 


progestin induced protein 


104599 


104599 


AW8 15036 


Hs.151251 


ESTs 


104667 


104667 


AI239923 


Hs.63931 


ESTs 


104764 -104764 




nS.2ro5o5 


CblS 


104787 


104787 


AA027317 




9b:ze97a1 1.s1 Scares_fetaLneait.NbHH19W 


104804 


104804 


Alo5o702 


Hs.31o03 


ESTs, Weakly similar to N-WASP [H.sapien 


130828 


130828 


AWo314d9 


Hs.203213 


ESTs 


104943 


104943 


Ar072873 


o A AAA nA a 

Hs.1 14218 


frizzled (Drasophila) homolog 6 


105024 


105024 


AA1 26311 


Hs.9879 


ESTs 


105038 105038 


AVV503733 


ns.9414 


A AAAt%ti ■ - - * - * 

KiAA14B8 protein 


105096 


105096 


AL04250D 


Hs.21599 


KruppeMike factor 7 (ubkiuitous) 


105169 


105169 


BE245294 


Hs.1 80789 


SI 64 protein 


130401 


130401 


BE398283 


Hs.1 73987 


eukaryotic translatbn initiation factor 


130114 


130114 


AA233393 


nS.14992 


Yiypotneucal protein FU1 1 151 


105337 


105337 


AI4do7o9 


LJm 0A?>I 07 

nS.347187 


myotubutarin related protein 1 


105376 


105376 


AW994032 


Hs.8768 


hypothetical protein FLJ10849 


131962 


131962 


AK000046 


Hs.343877 


hypothetical protein FLJ20039 


128658 


128658 


BE397354 


1 1^ tr%A nm 

Hs.324830 


diptheria toxin resistance protein requi 


105508 


105508 


AAI 73942 


Hs.326416 


Homo sapiens mRNA; cDNA DKFZp564H1916 (f 


135172 


135172 


A Dnooncc 
Ad02o9oq 


|_|_ 4 nA A A 

nS.12144 


KIAA1033 protein 


132542 


132542 


AL1 37751 


nS.2o3671 


Homo sapiens mRNA; cDNA uKrZp43410812 (f 


105659 


105659 


A A OOOAil A 

AA283044 


Hs.25525 


hypothetical protein FU11323 


105674 


105674 


AI609530 


Hs.279789 


histone deacefylase 3 


105722 


105722 


AI922821 


Hs.32433 


ESTs 


115951 


115951 


BE546245 


Hs.301048 


sec13-)ike protein 


105985 


105985 


AA406610 




gb:2v15b10.s1 Soares_NhHMPu_S1 Homosapi 


131216 


131216 


A)8 15486 


Hs.243901 


Homo sapiens cONA F1J20738 fis, done HE 


113689 


113689 


AB037850 


Hs.16621 


DKFZP434I1 16 protein 


130839 


130839 


AB01 1 169 


Hs.20141 


sImDar to S. cerevlsiae SSM4 


130777 


130777 


AW1 35049 


Hs.26285 


Homo sapiens cDNA FU 10643 lis, done NT 


106196 


106196 


AA525993 


Hs.1 73699 


ESTs, Weakly sln^ar to ALUl^HUMAN ALU S 


133200 


133200 


AB037715 


1 lino Am 

Hs. 183639 


hypothetical protein FUl 0210 


108326 


106328 


AL079559 


Hs^8020 


KIAA0766 gene product 


108423 


106423 


AB020722 


Hs.16714 


Rho guanine exchange fador (GEF) 15 


439608 439608 


AW864696 


Hs.301732 


hypothetical protein MGC5306 


106503 


106503 


A nnAA#%A A 

AB033042 


Hs.29679 


* ^ * J # A AM. ^ ^ Mt 

cofactor required for Spl transcrlptiona 


106543 


106543 


AA676939 


Hs*69285 


neuropifin 1 


106589 106589 


AK000933 


Hs.28661 


Homo sapiens cDNA FU10071 fis. done HE 


106596 


106596 


AA452379 




AAJ^ J.t. * vt M.^ All >«* 1 II tA A A A 1 A 

ESTs. Moderately stmHar to ALU7.HUMAN A 


106636 


106636 


AW958037 


Hs.288 


ribosomal protein L4 


131353 


131353 


AW754182 




gb:RC2-CT0321-131 199-01 1-c01 CT0321 Homo 


131710 


131710 


NMj0153d8 


ns.30985 


pannexin 1 


131775 


131775 


ABO 14548 


ns.31921 


KIAA0648 protein 


106773 


106773 


A A A^Q^nn 
AA47oi09 


HS.1 00033 


ESTs 


106817 


106817 


Ud121d 


nS.1oD72 


ESTs 


106848 


106848 


A A il AnnA A 

AA449014 


Hs.l 21 025 


chromosome 11 open reading frame 5 


418699 


418699 


Dc5o9o39 


MS. 173030 


ESTs, Weakly similar to ALUo_HUMAN ALU S 


130638 


130638 


AlAlrtOHOTC 

AWUzlzro 


U» 47404 

nS.1/ 121 


tSTs 


107059 


107059 


Bcd1441U 


nS.2ou44 


RAD51 (S. cerev'isiae) homolog (E coli Re 


107115 


107115 


BE379623 


LJiM. 0*»AA.^ 

Hs.27693 


peptldylprolyt isomerase (cydophiiinH 


107156 


107156 


AAA OTAjIO 

AAI 37043 


Hs.9663 


programmed cell death 6-interacting prot 


130621 


130621 


AW513087 


Hs.l 6803 


LUC7 (S. cerevlsiae)-like 


132626 


132626 


AW504732 


nS.Z1275 


hypothetical protein FLJ1101 1 


131610 


131610 


AA357879 


Hs.29423 


scavenger receptor with C-type lectin 


107295 


107295 


AA1 86629 


Hs.80120 


UDP-N-acetyl-alpha-D-galadosamine:poiyp 


107315 


107315 


AA3 16241 


Hs.90691 


nudeophosmin/nudeoplasmin 3 


107328 


107328 


AW959891 


Hs.76591 


KIAA0887 protein 


134715 


134715 


U48263 


Hs.69040 


prepronociceptin 


129938 


129938 


AW003668 


Hs.1 35587 


Human done 23629 mRNA sequence 


130074 


130074 


AL03o59d 


HS.250745 


__1 #r^AIA% III #r%AI A -l*-^_A ^\ ^AAI- 

polymerase (RNA) III (DMA directed) (62k 


132036 


132036 




ns.o/ r UD 


nypotneucai protein DKr£p4o4b222u 


113857 


113857 


AW243158 


Hs.5297 


DKFZP564A2416 protein 


130419 130419 


AF037448 


Hs.155489 


NSI-assodated protein 1 


132616 


132616 


BE262677 


Hs.283558 


hypothetical protein PR01855 


132358 132358 


NM_003542 


Hs.46423 


H4 histone family, member G 


125827 


125827 


N1VL003403 


Hs.974g8 


YY1 transcriptkm (actor 


107609 


107609 


R75654 


Hs.164797 


hypothetica) protein FU13693 


107714 


107714 


AA015761 . 


Hs.60642 


ESTs 
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107832 


107832 


AA021473 




gb:ze66c11.s1 Scares retina N2b4HR Homo 




124337 


N23541 


Hs 281561 


Homo sapiens cDNA: FLJ23582 fis. clone L 


129577 


129577 


N75346 


Hs.305121 


CDC20 (ceil division cyde 20. S. cerevi 


132000 


132000 


AVV247017 


Hs^697& 


melanoma antigen, feimily A, 3 


107935 


107935 


AA029428 


Hs.61555 


ESTs 


131461 


131461 


AA992841 


Hs.27263 


KIAA1458 protein 


108029 


108029 


AA040740 


Hs.62007 


ESTs 


108084 


108084 


AA058944 


Hs.1 16602 


Homo sapiens, done IMAGE:4154008, mRIMA, 


IflftIRR 

lUO lUO 


108168 


AI453137 


Hs 63176 


ESTs 


lUO 109 


108180 
lUO 109 


tvn^t QUO 1 


Hs 63335 


ESTs. Moderately similar to A46010 X-lin 




108903 


AW847814 


Hs 289005 


Homo sapiens dDNA: FU21532 fis, done C 


i 011917 


108917 
111041/ 


AA058888 


Hs 62588 


ESTs 


10A977 


10R977 


AA08485Q 




gb:zm50f03^1 Stratagene fibroblast (937 


lOft^OQ 


10R30Q' 


AAQfiQ818 




gb:zm67e03.r1 Stratagene neuroepithelium 


lUOOtU 


10A340 


AAflfiQ890 


Hs 180909 


DfiiDxiredoxin 1 


4AQA97 


inRA97 


AAfi7fi3a9 




gb:2m91g08.s1 Stratagene ovarian cancer 


1 110*109 




AA078986 




gb:2m92h01.s1 Stratagene ovarian cancer 


lOA^RQ 
lUOtO? 


10R4RQ 
IU0*IQ9 


AA07Q487 




gb:znt97ro8.s1 Stratagene colon HT29 (937 


lORRAI 
lUOOvl 


108(;01 


AA083256 




gb:zn06g12.s1 Stratagene hNT neuron (937 


4nR(;A9 


108569 


AA1QQ79B 




gb:zm26c06.s1 Stratagene pancreas (93720 






AI907537 


Hs 76698 


stress-associated endopl&srnic reticulum 




130385 


AW067800 


Hs.1 55223 


stanniocaldn 2 


108807 


108807 


A1652236 


Hs.49376 


hypothetical protein FU20644 


108833 


108833 


AF18B527 

r«i ivww4w' 


Hs.61661 


ESTs, Weakly similar to AF1 74605 1 F-box 


lUOCfD 




Al 117459 


Hs 44155 


DKFZP586G1517 protein 




131474 


I 4R353 


Hs.2726 


high-mobility group (nonhistone chromoso 




108CM1 


A Al 48850 




gb:zo09e06.s1 Stratagene neuroepithelium 






A\A/QQ5R10 


Hq 339436 


EST 


10 1 100 


131183 


Alfi11807 


Hs 285107 


hypothetical protein FLI13397 




1 no 099 


AA1 57991 

r\M lOf il*7 1 


Hs.21479 


ubfnuclein 1 




10QOR8 


AA1fi4?<}3 


Hs 72545 


ESTs 




19Q091 


AL044675 


Hs.173081 


KIAA0530 protein 


10Q148 




AA1765B9 


Hs.1 42078 


EST 


1 31 080 


131080 


NM 001955 


Hs.2271 


endotheCn 1 


•moo 00 


100999 


AA1Q9833 
n/\ 1 9^0J0 


Hs 333512 


similar to rat mvomeaalln 


10Qd81 
1 U940 1 


100481 


AA878923 


Hs 289069 


hypothetical protebi FLJ21016 


109516 


109516 


A1471639 


Hs.71913 


ESTs 


109556 


109556 


AI925294 


Hs.87385 


ESTs 


109578 


109578 


FQ2208 


Hs.27214. 


ESTs 


109625 


109625 


H294g0 


Hs.22697 


ESTs 


109648 


109648 


H17800 


Hs.7154 


ESTs 


109699 


109699 


H18013 


Hs:i67483 


ESTs 


109933 


109933 


R52417 


Hs.20945 


Homo sapiens clone 24993 mRNA sequence 


110039 


110039 


H1 1938 


H5.21907 


histone ace^transferase 
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TABLE 2A 

Table 2A shows the accession numbers for those pkeys lacking unigenelD*s for Table 2. The pkeys in Table 7 lacking unigenelD's are represented within 
Tables 1-6A. For each probeset we have listed the gene duster number from which the oligonudeotides were designed. Gene dusters were compfled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 



Pkey: Unk)ue Eos probeset Mentifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey 



CAT Number Accession 



108469 
108501 
108562 
101300 



125565 
132983 



121335 
130018 
121822 



108309 
107832 
123523 
123964 
118475 
104787 
106595 
113947 
108277 



117156 145392 J 



1704098J 
11922.1 



133681 13893J 



1 16761.1 AA079487 AA128547 AA128291 AA079587 AA079600 

13684.-12 AA083256 

36375,1 AA100798 AF020589 AA074629 AA075946 AA100849 AA085347 AA126309 AA079311 AA079323 AA085274 

_1 BE53551 1 M62098 AA305787 AW891766 AA348998 AA338869 AA344013 AW956561 AW389343 AW403607 L40391 

AW408435 AA121738 AI568978 H13317 R20373 AW948724 AW948744 AA335023 AA436722 AA448590 C21404 
AW884390 AA345454 AA303292 AA174174 BE092290 T90514 AA035104 R76028 AA126924 AA741086 AW022056 
AW1 18940 AA121666 A)832409 AA683475 AI140901 AI623576 AW51 9064 AW474125 Ai953923 AI735349 AW1 50109 
AI436154 AW1 18130 AW270782 AI804073 N27434 AA876543 AA93781 5 AI051 1 66 AA505378 AI041975 AI335355 
AI089540 AA662243 AI127912 AI925604 AI250880 AI366874 At564386 A1815196 AI683526 AI435885 AI160934 H79030 
AI801493 AA448691 AI673767 AI076042 AI804327 AA813438 AA680002 A1274492 T16177 AI287337 AI935050 
AA907805 AA911493 AI589411 A1371358 AW576236 AI078866 AW516168 AA346372 AI560185 AA471009 R75857 
AA296025 AA5231 55 AA853168 AI6g6593 AI658482 AI566601 AW072797 AA128047 AA03S502 AW243274 AA992517 
R43760 

W73853 AA9281 12 W77887 AW889237 AA148524 AI749182 AI754442 AI338392 AI253102 AI079403 Ai370541 AI697341 
H97538 AW188021 AI927669 W72716 AI051402 AI188071 AI335900 N21488 AW770478 W92522 AI691028 AI913512 
AI144448 W73819 AA604358 N28900 W95221 A1868132 H98465 AA148793 
R20840R20839 

M30269 Nl^002508 X82245 AI078760 AW957003 D78945 M27445 AA650439 AL048816 AV660256 AV660347 
AA333052 BE295257 T60999 AA383049 AW369677 Z26985 AW175704 AA343326 AW747957 AI818389 W17308 
W17302 H15591 AA371284 AA370412 W94966 BE384365 T28498 R80714 R16959 H21723 AWB35154 D56097 D56381 
W21232 AAig0565 AW379755 AW067895 

AI352558 Z82248 X78138 NIW.003405 AU077248 AA223125 S80794 D78577 AI124697 AW403970 BE614089 BE296713 
BE621334 L20422 X80536 D54224 D54950 X57345 N29226 AA127798 AA340253 F08031 AA192540 H67636 AA321827 
AW950283 AA084159 BE538808 AW401377 AA256774 C03366 W46595 W47608 AA305009 H69431 H69456 AL120082 
H11706AA303717 AA361357 H22042 H78020AW999584 AA134368AA322911 AA322961 H60980 N85248 N31547 
H79624 T1 1718 W85826 AW894663 AW894624 BE167441 BE170015 AA304626 AW602163 AW998929 AA156681 
AA151067 BE002724 AA608688 H82692 BE155392 AW383636 BE155394 AA487004 AW383504 AI342365 R82553 
W16498 BE155344 AI143938 R69901 AA322873 AW340648 R25364 AA357935 AI559406 AA033522 AA374252 
AW835019 AI922133 AI697089 N99662 AW189078 AI199076 AW151598 W59944 AA662875 W94022 AA299055 
AI039008 AI829449 AA583503 AI635674 AW131665 AI473820 AW2731 18 AW900930 AA908944 AI688035 AW170272 
AI082545 AW468176 AI608761 A1082748 AI911682 AI248943 A1831016 AA192465 AI218477 AA938406 AA385288 
AI809817 AA905196 AI191245 AI470204 AI188296 AI421367 AI125315 AI087141 AA629032 AA740589 AI554181 
AA150830 A1248541 AI077943 AA775958 AA864930 AI261476 AI123121 AI310394 AA862331 AA872478 BE537084 
AI205606 AA720684 AI872093 AW150042 AL120538 AA219627 AA988608 C21397 AI359337 H25337 AI089749 
AA605146 AI359620 AA150478 A1359738 AW383642 AW995424 AI766457 R56892 AI089a39 W61343 N69107 W46459 
AA565955 N20527 AI279782 W46596 AA776573 H23204 A(866231 AI083995 N2 1 530 AA 126874 D82630 W65437 
AI086917 AW382095 AI086877 H69844 AW340217 W85827 L0B439 AA262704 AA505380 W47413 W94135 AA223241 
AW089153 AA084101 BE538000 AA096126 T28031 AA491574 R84813 AA774536 AW383522 AA155615 AW383529 
AA491520 AW028427 AA171496 AI469689 AW664539 AI81 1102 AI811116 BE464590 BE350791 H78021 T15405 H21979 
AA219489 H13301 AA505883 AI864305 AI423963 AW084401 F04963 R69858 H67097 AI917740 A1655561 H69B64 
AA033631 AW383484 Ai886261 H25293 AA513281 AW271187 H1 1617 N79982 AI174338 AI904207 AI904208 BE614558 
W94127 W65436 AI272249 AA700018 AI579932 AI085941 AW152629 
AA404418AI217248 

AA353093 AW957317 AW872498 AI560785 AI289110 AW135512 X97261 T68873 

AI743860 N49543 AW027759 BE349467 A1656284 BE463975 R35022 AA370031 AW955302 AL042109 N53092 AI611424 
AL079362 AI969290 AI928016 BE394912 BE504220 BE467505 AI611611 AI61 1 407 AI61 1452 W56437 Ai284566 
AI583349 AW183058 AI308085 AI074952 AA437315 AA628161 AW301728 AI150224 AA400137 AA437279 AI223355 
AA639462 AI261373 AI432414 AI984994 AI539335 AA401550 AA358757 AI609976 AA442357 AA359393 AA437046 
AA370301 AA429328 AW272055 AI580502 AI832944 AI038530 AA425107 AI014986 AI148349 AW237721 AW779756 
AW137877 AI125293 AA400404 R285S4 
AA069818 AA069971 AA069923 AA069908 
genbanK.AA021473 AA021473 
genbank_AA608588 AA608588 
genbanleC13961 C13961 
genbanleN66845 N66845 
genbanK.AA027317 AA027317 

304084 J AI583948 AA578212 AW303715 AA653450 AA456981 AI400385 W88533 AI224133 AW272145 AA088686 R94698 

genbanl^W84768W84768 

genbanKJ\A064859 AA064859 



279548_1 
18986 1 
244391J 



111495_1 



118 



wo 02/079492 



PCT/US02/04915 



108427 genbanMA076382 AA076382 

108439 genbanK_M078986 AA078986 

1 31 353 231 290 1 AW411259 H23555 AW015049 A1684275 AW015886 AW068953 AW014085 AI027260 R52686 AA918278 AI129462 

AA969360 N34869 AI948416 AA534205 AA702483 AA705292 

101447 entre?_M21305 M21305 

108931 genbanK_AA147186 AA147186 

108941 genbanK_AA148650 AA14B650 

103138 entre2^X55965 X65965 

1 19174 genbanl^R71 234 R71234 

119416 genbanK.T97186 T97186 

105985 genbanKJ\A40861O AA406610 

100327 entre^D55640 D55640 
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Pkey: Unique Eos probeset identifier number 

Accession: Accession number used for previous patent filings 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigeneiD: Unfgene number 

Unigene Title: Unigene gene title 



Pkey Accession ExAccn UniGene UnigeneTttle 



100405 
100420 
100481 
100484 
100718 
100991 
101097 
101168 
101194 
101261 
101345 
101447 
101485 
101543 
101550 
101560 
101674 
101714 
101741 
101838 
101857 
102012 
102024 
102164 
102241 
102283 
102303 
102564 
102663 
102759 
102778 
102804 
102887 
102898 
102915 
103036 
103037 
103095 
103158 
103166 
103185 
103280 
103554 
103850 
104465 
104592 
104764 
104786 
104850 
104865 
104894 
104952 
104974 
105178 
105263 
105330 
105376 
105729 
105826 
105977 
106008 
106031 
105124 



D86425 
D86983 

HG1098-HT1098 
HG1103-HT1103 
HG3342-HT3519 
J03764 
L06797 
L15388 
L20971 
L35545 
L76380 
M21305 
M24736 
M31166 
M31551 
M32334 
M61916 
M68874 
M74719 
M92934 
M94856 
U03057 
U03877 
U18300 
U27109 
U31384 
U33053 
U59423 
U70322 
U81607 
U83463 
U89942 
X04729 
X06256 
X07820 
X54925 
X54936 
X60957 
X67235 
X67951 
X69910 
X79981 
Z18951 
AA187101 
N24990 
R81003 
AA025351 
AA027168 
AA040465 
AA045136 
AAO54087 
AA071089 
AA085918 
AA1874gO 
AA227926 
AA234743 
AA236559 
AA292694 
AA398243 
AA406363 
AA411465 
AA412284 
AA423987 



AW291587 Hs.82733 
D86983 Hs.1 18893 
X70377 Hs.121489 
N^A„005402Hs.288757 
BE295928 Hs.75424 
J03836 Hs.82085 
BE245301 Hs.89414 
NWL005308HS.211569 
L20971 HS.18B 
D30857 Hs.82353 
NM^005795Hs.152175 
M21305 

AA2g6520 Hs.89546 
M31166 Hs.2050 
Y00630 Hs.75716 
AW958272 Hs.347326 
NM^002291Hs.82124 
M68874 Hs.211587 
NM_003199Hs.326198 
BE243845 Hs.75511 
BE550723 Hs.153179 
6E259035 Hs.118400 
AA301867 Hs.76224 
NIVL000107HS.77602 
NM_007351Hs.268107 
AW161552 Ks.83381 
U33053 Hs.2499 
U59423 Hs.79067 
NM_002270Hs.168075 
NM 005100HS.788 
AF000652 Hs.8180 
NIVL002318HS.83354 
J03836 Hs.82085 
NM_002205Hs.149609 
X07820 Hs.2258 
IVI13509 Hs.83169 
BE018302 Hs.2894 
NN/L005424HS.78824 
BE242587 Hs.l 18651 
AA159248 Hs.18O909 
NiVL006825Hs.74368 
U84722 HSJ6206 
A1878826 Hs.74034 
AA187101 Hs.213194 
Z44203 Hs.26418 
AW630488 Hs.25338 
AI039243 Hs^78585 
AA027167 Hs.10031 
AL133035 Hs.8728 
T79340 Hs.22575 
AF085214 Hs.18858 
AW076098 Hs.345588 
Y12059 Hs.278675 
AA313825 Hs.21941 
AW388633 Hs.6682 
AW338625 Hs.22120 
AW994032 Hs.8768 
H46612 Hs.293815 
AA478756 Hs.194477 
AK001972 Hs.30822 
AB033888 Hs.8619 
X64116 Hs.171844 
H93356 Hs.7567 



nidogen 2 

Metanoma associated gene 
cystatin D 

v-ral simian leukemia viral oncogene horn 
Inhibitor of DNA binding 1 , dominant neg 
serine (or cysteine) proteinase inhtbito 
chemokine (C-X-C motif), receptor 4 (f us 
G protein-coupled receptor kinase 5 
phosphodiesterase 4B. cAMP-specific (dun 
protein C receptor, endothelial (EPCR) 
calcitonin receptor-like 
gb:Human alpha satellite and satellite 3 
setedin E (endothelial adhesion motecul 
pentaxin-related gene, rapidly induced b 
serine (or cysteine) proteinase inhibito 
intercellular adhesion molecule 2 
laminin, beta 1 

phospholipase A2, group IVA (cytosolk^ 
transcription factor 4 
connective tissue growth factor 
fatty acid binding protein 5 (psoriasis- 
singed (Drosophia)-(ike (sea uichln fas 
EGF-contalning fibutin-like extracettula 
damage-specific DNA binding protein 2 (4 
muttimerin 

guanine nucleotide binding protein 11 

protebi kinase Olike 1 

MAD (mothers against decapentaplegk;, Dr 

karyopherin (importin) beta 2 

A kinase (PRKA) anchor protein (gravin) 

syndecan binding protein (syntenin) 

lysyt oxklase-iike 2 

serine (or cysteine) proteinase inhibito 
integrin. alpha 5 (fibronectin receptor, 
noatrix metalloproteinase 10 (stromelysin 
matrix metalloproteinase 1 (interstitial 
placental growth factor, vascular endoth 
tyrosine kinase with immunoglobulin and 
hematopoletically expressed homeobox 
peroxlredoxin 1 

transmembrane protein (63kD), endoplasmi 
cadherin 5. type 2, VE-cadherin (vascula 
caveolin 1, caveotae protein, 22kD 
hypothetical protein MGC10895 
ESTs 

protease, serine, 23 
ESTs 

KIAA0955 protein * 
hypothetteal protein DKFZp434G171 
B-ceD CLUIymphoma 6, member B (zinc fi 
phospholipase A2, group IVC (cytosoBc, 
desmoplakInpPI.DPIf) 
biomodomaln-oontaining 4 
AD036 protein 

solute carrier family 7, (cationic amino 
ESTs 

hypothetical protein FU 10849 

Homo sapiens HSPC285 mRNA. partial cds 

E3 ubiquitin ligase SMURF2 

hypothetical protein FU11110 

SRY (sex detennlning region Y)-box 18 

Homo sapiens cDNA: FLJ22296 fis, done H 

Homo sapiens cDNA: FU21962 fis, ctone H 
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106155 AA425309 
106302 AA435896 
106423 AA448238 
106793 AA478778 
107174 AA621714 
107216 D51069 
107295 T34527 
107385 U97519 
108756 AA127221 
108846 AA132983 
108888 AA135606 
109001 AA156125 
109166 AA179845 
109456 AA232645 
109768 F10399 
110107 H16772 
110906 N39584 
110984 N52006 
111006 N53375 
111018 N54067 
111133 N64436 
111760 R26892 
113073 T33637 
113195 T57112 
113923 W80763 
114521 AA046808 
115081 AA253217 
115098 AA255991 
115145 AA258138 
115819 AA426573 
115947 AA443793 
116314 AA490588 
116339 AA496257 
116430 AA609717 
116589 D59570 
116733 F13787 
117023 H88157 
117186 H98988 
117563 N34287 
117997 N52090 
118475 N66845 
118581 N68905 
119073 R32894 
119155 R61715 
119174 R71234 
119221 R98105 
119416 T97186 
119866 W80814 
121335 AA404418 
121381 AA405747 
123160 AA488687 
123473 AA599143 
123523 AA608588 
123533 AA608751 
123964 C13951 
'124006 D60302 
124315 H94892 
124659 N93521 
124669 N95477 
124847 R60044 
124875 R70506 
125091 T91518 
125103 T95333 
125355 R45630 
125565 R20839 
125590. R23858 
126511 AI024874 
126563 W26247 
126649 AA856990 
126872 AA136653 
127402 AA358869 
127651 AI123976 
127759 AI369384 
128062 AA379500 
128992 R49893 
129046 AA195678 



AA425414 Hs.33287 
AA398859 Hs.18397 
AB020722 Hs.16714 
H94997 Hs.16450 
BE122762 Hs.25338 
D51069 Hs.211579 
AA186629 Hs.80120 
NM-005397HS.16426 
AA127221 Hs.117037 
AL117452 Hs.44155 
AA135606 Hs.189384 
AI056548 H5.72116 
AA219691 HS73625 
AW956580 Hs.42699 
F06838 Hs.14763 
AW151660 Hs.31444 
AA035211 Hs.17404 
AW613287 Hs.80120 
BE387014 Hs.166146 
A)287912 Hs.3628 
AW580939 Hs.97199 
BE551929 Hs.268754 
N39342 Hs.103042 
H83265 Hs.8881 
AW953484 Hs.3e49 
AW139036 Hs.108957 
AI751438 Hs.41271 
AI583069 Hs.175319 
AA740907 Hs.88297 
AA486620 Hs.41135 
R47479 Hs.94761 
AI799104 Hs.178705 
AK000290 Hs.44033 
AK001531 Hs.66048 
A1557212 Hs.17132 
AL157424 Hs.61289 
AW070211 Hs.102415 
H98988 Hs.42612 
AF055634 Hs.44553 
'N52090 Hs.47420 
N66845 
N66905 

BE245360 Hs.279477 
R61715 Hs.310598 
R71234 

C14322 H8.250700 
T97186 

AA496205 Hs.193700 
AA404418 

AW088642 Hs.97984 

AA4886d7 Hs.284235 

AA599143 

AA608588 

AA608751 

CI 3961 

AI147155 Hs.270016 
NIVL005402HS.288757 
AI680737 Hs.289068 
AI571594 Hs.102943 
W07701 Hs.304177 
AI887664 Hs.285814 
T91518 

AA570056 Hs.l22730 
R60547 Hs.170098 
R20840 

R23858 Hs.143375 
T92143 Hs.57958 
AA516391 Hs.181368 
AA001860 Hs^9531 
AW450979 

AA356869 Hs.227949 
AA382523 Hs.105689 
At369384 Hs.292441 
AA379621 Hs.105547 
H04150 Hs.107708 
AB029290 Hs.108258 



nuclear factor l/B 

hypothetical protein FU23221 

Rho guanine exchange factor (GEF) 15 

ESTs 

ESTs 

metanoma cell adhesion molecule 

UDP-N-3cetyl-a!pha-D-ga)actosambie:polyp 

podocatyxin-like 

ESTs 

DKFZP58661517 protein 

gb:zl10a05.s1 Soaresj3iegnanLuterus_NbH 

hypothetical proton FU20992 similar to 

RAB6 interacting, kinesin-lilce (rabkines 

ESTs 

ESTs 

ESTs 

ESTs 

UDP-N-acetyt-atpha-D-galactosamine:polyp 
Homer, neuronal Immediate early gene, 3 
mitogen-activated protein Idnase kinase 
complement component Clq receptor . 
Homo sapiens cONA FLJ11949 fis. done HE 
mlcrotubule^ssodated protein IB 
ESTs, Weakly similar to S41044 chromosom 
hypothetical protein FU22041 similar to 
40S ribosomal protein S27 isofomi 
Homo sapiens mRNA full length insert cDN 
ESTs 
ESTs 

endomucin-2 
KIAA1691 protein 

Homo sapiens cONA FU11333 fis, clone PL 

dipeptidyl peptidase 8 

hypothetical protein FU10669 

ESTs. Moderately similar to 154374 gene 

synaptojanin 2 

Homo sapiens mRNA; cDNA bKFZp586N0121 (f 
ESTs. Weakly similar to ALU1 JHUMAN ALU S 
unc5 (Celegans homdog) c 
EST 

gb:za46c11^1 Soares fetal liver spleen 
gb:za69b09^1 SoaresJetaLbJng.NbHL19W 
ESTs 

ESTs. Moderately similar to ALU1_HUMAN A 
gb7i54c08.s1 Soares placenta Nb2HP Homo 
tryptase beta 1 

gb:ye50h09.s1 Soares fetal liver spleen 
Homo sapiens mRNA; cDNA DKFZp586l0324 (f 
gb:2w37e02^1 Soares.totaLfetus_Nb2HF8_ 
hypothetical protein FU22252 similar to 
ESTs, Weakly similar to 138022 hypotheti 
gb:ae52d04.s1 Stratagene lung carcinoma 
gb:ae54e06.sl Stratagene lung cardnoma 
gb:ae56h07^1 Stratagene tung carcinoma 
gb:C13951 Clontech human aorta polyA+ mR 
ESTs 

v-ral simian leukemia viral oncogene horn 

Homo sapiens cDNA FLJ11918 fis, done HE 

hypothetical protein MGC12916 

Homo sapiens done FLB8503 PR02286 mRNA. 

sprouty (Orosophiia) homotog 4 

gb7e20f05.s1 Stratagene lung (937210) H 

ESTs. Moderately similar to K1AA1 215 pro 

KIAA0372 gene product 

gb:yg05c08.r1 Soares infant biain 1N1B H 

Homo sapiens, done 1MAGE:3840937. mRNA, 

EGF-TM7-latrophBiiHelated protein 

U5 snRNP-specinc protein (220 kD), orth 

ESTs 

gb:Ul-H-BI3-ala^12-0-UU1 NCI.CGAP.Su 
SEC13 (S. cerGvlsiae)-nke 1 
MSTP031 protein 
ESTs 

neural proliferation, differentiation an 
ESTs 

adin binding protein; macrophin (mkvof 
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129188 
129314 
129371 
129468 
129765 
129805 
129864 
130495 
130639 
130657 
130828 
130972 
131080 
131137 
131182 
131486 
131573 
131647 
131756 
131859 
131881 
132050 
132083 
132164 
132358 
132413 
132456 
132490 
132676 
132687 
132718 
132736 
132760 
132933 
132968 
132994 
133061 
133147 
133161 
133200 
133260 
133363 
133491 
133517 
133550 
133607 
133614 
133627 
133691 
133696 
133913 
133975 
133985 
134039 
134088 
134161 
134299 
134416 
134453 
134656 
134989 
135051 
135073 
135349 
100114 
100130 
100143 
100168 
100208 
100224 
100405 
100420 
100455 
100529 
100618 
100619 



M30257 
AA028131 
M10321 
J03040 



AA012933 

AA266710 

AA243278 

D59711 

T94452 

AA053400 

AA370302 

J05008 

U85193 

AA255153 

X83107 

AA046593 

AA410480 

D45304 

M90657 

AA010163 

AA136353 

Y07867 

U84573 

X60486 

AA1 32969 

AA1 14250 

F13782 

AA283035 

AB002301 

AA056731 

U68019 

H99198 

AA598702 

N77151 

AA505133 

AB000584 

D12763 

AA253193 

AA432248 

AA083572 

AA479713 

L40395 

X52947 

W80846 

M34539 

D67029 

U095B7 

M&5289 

D10522 

W84712 

D29992 

L34657 

S78569 

D43636 

U97188 

AA487558 

M28882 

X70683 

X14787 

AA236324 

C15324 

AA452000 

D83174 

D00598 

D11428 

D13640 

D14874 

D26129 

D28476 

DB6425 

D86983 

D87953 

HG1862-HT1897 
HG2614-HT2710 
H62639-HT2735 



NWL001078HS.109225 
BE622768 Hs.290356 
X06828 H5.1 10802 
AW410538 Hs.111779 
M86933 Hs.1238 
AA012848 Hs.12570 
AF055581 Hs.13131 
AW250380 Hs.109059 
AI557212 Hs.17132 
AW337575 Hs.201591 
AW631469 Hs.203213 
D81866 Hs^1739 
NM_001955Hs.2271 
W27392 Hs.33287 
AI824144 Hs,23912 
F06972 Hs.27372 
AA040311 Hs^8959 
AA359615 Hs.30089 
AA443966 Hs.31595 
AW960564 
AW361018 Hs.3383 
AI267615 Hs.38022 
BE386490 Hs.279663 
A1752235 Hs.41270 
NM_003542Hs.46423 
AW361383 Hs.260116 
AB011084 Hs.48924 
Nf4^001290Hs.4980 
N925&9 Hs.261038 
AB002301 Hs.54985 
NIVL004600HS.554 
AW081883 Hs.211578 
AA125985 Hs.56145 
BE263252 Hs.6101 
AF2a4532 Hs.61638 
AA1 12748 Hs.279905 
AI186431 Hs.296638 
AA026533 Hs.66 
AW021103 Hs.6631 
AB037715 Hs.183639 
AA403045 Hs.6906 
AI866286 H5.71962 
BE619053 Hs,170001 
NM_000165Hs.74471 
AI129903 Hs.74669 
BE273749 

NIVL003003HS.75232 
NM_002047Hs.75280 
M85289 Hs.211573 
AI878921 Hs.75607 
AU076964 Hs.7753 
C18356 Hs.295944 
L34657 Hs.78145 
NM^002290Hs.78672 
AI379954 Hs.79025 
AA634543 Hs.79440 
AW580939 Hs.97199 
X68264 Hs.211579 
AI272141 Hs.83484 
A1750878 Hs.87409 
AW968056 Hs.92381 
AI272141 Hs.63484 
W55958 Hs.94030 
AA1 14212 Hs.9930 
X02308 Hs.82962 
NM.000304HS.103724 
AU076465 Hs.278441 
H73444 Hs.394 
NWL002933HS.78224 
AL121516 Hs.138617 
AW291587 Hs.82733 
D86983 Hs.1 18893 
AW888941 Hs.75789 
BE313693 Hs.334330 
AI752163 Hs.1 14599 
N24433 Hs.241567 



vascular cell adhesion moiecule 1 
mesoderm development candidate 1 
von WiUebrand factor 
secreted protein, acidic, cystelne-rich 
amelogenin (Y ctiromosome) 
tutHJiin-specific ctiapenone d 



mitochondrial ribosomal protein L12 
ESTs» Moderately similar to 154374 gene 
ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp586t1 518 (f 
endotheiln 1 
nuclear factor l/B 
ESTs 

BMX non-receptor tyrosine kinase 

ESTs 

ESTs 

ESTs 

transmembrane 4 superfamily member 1 
upstream regulatory element binding prot 
ESTs 
Pirin 

procollagen-lysine. 2-oxoglutarate 5-dio 
H4 histone family, member G 
metaltoprotease 1 (pitrilysin family) 
KIAA0512 gene product; ALEX2 
LIM domain binding 2 
ESTs, Weakly similar to 138022 hypotheti 
KIAA0303 protein 

Sjogren syndrome antigen A2 (60kD, ribon 
Homo sapiens cDNA: FU23037 fjs. done L 
thymosin, beta, identified In neuroblast 
hypothetical protein MGC3178 
myosin X 

cloneHQ0310PRO0310p1 
prostate differentiation factor 
interleukin 1 receptor-like 1 
hypothetical prot^ FU20373 
hypothetk»l protein FU10210 
Homo sapiens cDMA: FU23197 fis, done R 
ESTs, Weakly similar to B36298 proline-r 
eukaryotictranslatbn initiation fador 
gap junction protein, alpha 1, 43kD (con 
veslcte-assoclated membrane protein 5 (m 
FK506-binding protein 1A (12kD) 
SEC14 (S. cerevlsiae)-like 1 
glycyl-tRNA synthetase 
heparan sulfate proteoglycan 2 (pertecan 
myristoylated alanlne-rfch protein kinas 
calumenin 

tissue factor pathway inhibitor 2 
platetet/endotheOal cell adhesion molec 
laminln. alpha 4 
K1AA0096 protein 
IGF-II mRNA-binding protein 3 
complement component C1q receptor 
melanoma ceil adhesion molecule 
SRY (sex determining region Y)-box 4 
thrombospondtn 1 

nudix (nudeoside diphosphate linked mol 

SRY'(sex detennining region Y)-bQX 4 

Homo sapiens mRNA; cDNA DKFZp586E1624 (f 

serine (or cysteme) proteinase inhibilo 

thymidylate synthetase 

peripheral myePai protein 22 

KlAAOOISgeneprodud 

adrenomeduiiin 

ribonudease. RNase A family. 1 (pancrea 
thyrokf honnone receptor interactor 12 
nldogen 2 

Melanoma assodated gene 

N-myc downstream regulated 

calmoduHn 2 (phosphorylase kinase, delt 

collagen, type Vlll, alpha 1 

RNA landing motif, single stranded inter 
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100658 

100676 

100718 

100752 

100828 

100850 

100991 

101097 

101110 

101142 

101156 

101168 

101184 

101192 

101317 

101336 

101345 

101400 

101475 

101485 

101496 

101505 

101543 

101557 

101560 

101587 

101592 

101633 

101634 

101667 

101682 

101714 

101720 

101741 

101744 

101793 

101837 

101838 

101840 

101857 

101864 

101931 

101966 

102012 

102013 

102024 

102059 

102121 

102283 

102300 

102378 

102395 

102460 

102491 

102499 

102523 

102560 

102564 

102588 

102600 

102645 

102687 

102693 

102709 

102759 

102804 

102882 

102907 

102915 

102927 

102960 

103011 

103020 

103029 

103036 

103056 



HG2855-HT2995 
HG3044-HT3742 
HG3342-HT3519 
H63543^T3739 
HG4069-HT4339 
H6417-HT417 
J03764 
L06797 
L08246 
L12711 
L13977 
L15388 
L19871 
L20859 
L42176 
L49169 
L76380 
M15S90 
M23254 
M24736 
M26576 
M27396 
M31156 
M31994 
M32334 
M35878 
M36429 
M57730 
M57731 
M60858 
M62994 
M68874 
M69a43 
M74719 
M75126 
M84349 
M92843 
M92934 
M93056 
M94856 
M95787 
S76965 
S81914 
U03057 
U03100 
U03877 
U08021 
U14391 
U31384 
U32944 
U40369 
U41767 
U48959 
U51010 
U51478 
U53445 
U59289 
U59423 
U62015 
U63825 
U67963 
U73379 
U73824 
U77604 
U81607 
U89942 
X04412 



X07820 
X12876 
X15729 
X52541 
X53416 
X54489 
X54925 
X57206 



U56725 Hs.180414 
X02761 Hs.287820 
BE295928 Hs.75424 
T81309 

AL048753 Hs.303649 
AA838472 Hs.297939 
J03836 Hs.82085 
BE245301 Hs.89414 
AM39011 Hs.86386 
L12711 Hs.89643 
AA340987 Hs75693 
NM^005308Hs.211569 
NIVL001674HS.460 
BE247295 HsJ8452 
L42176 Hs.8302 
NIVL006732HS.75678 
NM_005795Hs.152175 
M15990 Hs.194148 
BE410405 Hs.76288 
AA296520 Hs.89546 
X12784 Hs.1 19129 
AA307680 Hs.75692 
M31166 Hs.2050 
BE293116 Hs.76392 
AW958272 Hs.347326 
AI752416 Hs.77326 
AF064853 Hs,91299 
NM_004428Hs.1624 
AV650262 Hs.75765 
NIVL005381 
AF043045 Hs.81008 
M68874 Hs.211587 
M69043 Hs.81328 
NM.003199HS.326198 
At879352 Hs.1 18625 
W01076 Hs.278573 
M92843 HS.3435&6 
BE243845 H8.75511 
AA236291 Hs.183583 
BE550723 Hs.153179 
BE392588 Hs.75777 
NWL006823HS.75209 
X96438 Hs.76095 
BE259035 Hs.11840O 
BE616287 Hs.178452 
AA301867 Hs.76224 
Ar752666 Hs.76669 
NM>004998Hs.82251 
AW161552 Hs.83381 
AI929721 Hs.5120 
AU076887 Hs.28491 
AU077005 Hs.92208 
U48959 Hs.211582 
U51010 

BE243877 Hs.76941 
U53445 Hs.15432 
R97457 Hs.63984 
U59423 Hs.79067 
AU076728 Hs.8867 
AI984144 Hs.66713 
AL1 19566 Hs.6721 
NM.007019HS.930Q2 
AA532780 Hs.183684 
AA122237 Hs.81674 
NM.005100HSJ88 
NM.002318HS.63354 
A}767736 Hs.290070 
BE409661 Hs.202833 
X07820 Hs.2258 
BE512730 Hs.65114 
AI904738 Hs.76053 
AJ243425 Hs.326035 
X53415 Hs.195464 
AW800726 Hs.789 
M13509 Hs.83169 
Y18024 Hs.78877 



heat shock 70kD protein 2 
fibronectin 1 

inhibitor of DNA binding 1, dominant neg 
insulin-lil(e growth fector 2 (sonnatomedi 
small inducible cytokine A2 (monocyte ch 
cathepsin 8 

serine (or cysteine) proteinase Inhibito 

chemoklne (C-X-C motiO. receptor 4 (fus 

myeloid cell leukemia sequence 1 (6CL2-r 

transketotase (Wernicke-Korsakoff syndio 

protylcarboxypeptidase (angbtensinase C 

G protein-coupled receptor kinase 5 

activating transcription factor 3 

solute carrier family 20 (phosphate tran 

four and a half LiM dornains 2 

FBJ murine osteosarcoma viral oncogene h 

calcitonin receptor-Tike 

v-yes-1 YamaguchI sarcoma viral oncogene 

caipain 2, (m/ii) large subunit 

seiectin E (endothelial adhesion molecul 

collagen, type \V, alpha 1 

asparagine synthetase 

pentaxin-reialed gene, rapkliy induced b 

aldehyde dehydrogenase 1 family, member 

intercellular adhesion molecule 2 

tnsu!in-iike growth factor blndir^ prote 

guanine nudeotkie binding protein (G pr 

ephrin-AI 

GR02 oncogene 

nucleolin 

fllamtn B, beta (actin-bindlng protein-2 
phosphoiipase A2, group IVA (cytosolic» 
nuclear factor of kappa l^ht polypeptid 
transcription factor4 
hexokbiasel 

CD59 antigen p18-20 (antigen kJenSfled 
zinc linger protein hombtogous to Zfp-36 
connective bssue growth factor 
serine (or cysteine) proteins inhibito 
fotty acid binding protein 5 (psoriasls- 
transgelln 

protein kinase (cAMP-dependent, catalyti 
immediate eariy response 3 
singed (Dmsophila)-tike (sea urchin fas 
catenln (cadherin^ssociated protein), a 
EGF-containing fibulln-llke extracellula 
nicotinamide N-methyitransfeFase 
myosin IE 

guanine nucleotide binding protein 11 
dynein, cytoplasmic, light polypeptide 
spennkiine/spennine Nl-acetyltransferase 
a disintegrin and metallopmteinase dome 
myosin, iight polypeptide kinase 
gb:Human nicotinamide N-methyitransferas 
ATPase, Na->/K+ transporting, beta 3 poly 
downreguiated in ovarian cancer 1 
cadherin 13, H<cadherin (heart) 
MAD (mothers against decapentaplegic, Or 
cysteine-rich. angk)genic inducer, 61 
hepatitis delta ant^en-lnterading prot 
lysosomal 

ubkiuitin carrier protein E2-C 
eukaryotic translation initiation factor 
mtoosomal gtutathbne S-transferase 2 
A kinase (PRKA) anchor protein (gravin) 
lysyl Qxidase-like 2 
gelsoDn (amytoklosis. Finnish type) 
heme oxygenase (decydlng) 1 
matrix metalloproteinase 10 (stromelysin 
keratin 18 

DEAD/H (Asp-Glu-Ala^p/His) box polypep 
eariy growth response 1 
filamin A, alpha (actin-binding protein- 
GR01 oncogene (melanoma growti) stimulafi 
matrix metalloproteinase 1 (interstitial 
inositol 1,4.54risphosphatB 3-kinase B 
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103080 X59798 
103095 X60957 
103138 X65965 
103176 X69111 
103195 X70940 
103347 X87838 
103371 X91247 
103432 X97748 
103471 Y00815 
103987 AA303711 
104447 L44538 
104764 AA025351 
104783 AA027050 
104798 AA029462 
104865 AA045136 
104877 AA047437 
104894 AA054087 
104952 AA071089 
105113 AA156450 
105178 AA1874gO 
105196 AA195031 
105215 AA205724 
10S263 AA227926 
105271 AA227986 
105330 AA234743 
105461 AA253216 

105492 AA256210 

105493 AA256268 
105594 AA279397 
105727 AA292379 
105732 AA292717 
105767 AA346551 
105882 AA400292 
105936 AA404338 
106031 AA412284 
106124 AA423987 
106222 AA428594 
106241 AA430108 

106263 M431462 

106264 M431470 
106366 AA443756 
105454 AA449479 
105634 AA459916 
105724 AA465226 
106793 AA478778 
106799 AA479037 
106842 AA462597 
106868 AA487561 
106890 AA489245 
106961 AA504110 
106974 AA520989 
107030 AA5g9434 
107061 AA608649 
107086 AA609519 
107216 D51069 
107385 U97519 
107444 W28391 
107985 AA035638 
108507 AA083514 
108695 AA121315 
108931 AA147186 
109001 AA156125 
109195 AA188932 
109390 AA219653 
109456 AA232645 
109737 F100r8 
110411 H48032 
110660 H82117 
110906 N39584 
111018 N54067 
111091 N59858 
111356 N90933 
111378 N93764 
111741 R26124 
111769 R27957 
112318 R55470 



AU077231 Hs.82932 
NiyL005424Hs.78824 
X65965 

AL021154 HsJ6884 
AA351647 Hs.2642 
AU077309 Hs.171271 
X91247 Hs.13046 
X97748 

Y00815 Hs.75216 
AL120051 Hs.144700 
AW204145 Hs.156044 
A1039243 Hs.278585 
AA533513 Hs.93659 
AW952619 Hs.17235 
T79340 Hs.22575 
AI138635 Hs.22968 
AF065214 Hs.18858 
AW076098 Hs.345588 
AB037816 Hs.8982 
AA313825 Hs^ig41 
W84893 Hs.9305 
AA205759 Hs.10119 
AW388633 H5.6682 
AA807881 Hs.25329 
AW338625 Hs^2120 
BE539071 Hs.69388 
A1805717 Hs.289112 
AL047586 Hs.10283 
AB024334 Hs.25001 
AL135159 Hs.20340 
AW504170 Hs.274344 
AW370946 Hs.23457 
W46802 Hs.81988 
AI678765 Hs.21812 
X64116 Hs.171844 
H93366 Hs.7567 
AA356392 HsJ21321 
BE019681 Hs.60ig 
W21493 Hs.28329 
AL046859 Hs.3407 
AA186715 Hs.336429 
NM_014038Hs.5216 
W25491 Hs.288909 
N48670 Hs.28631 
H94997 Hs.16450 
BE313412 Hs.7961 
AF124251 Hs.26054 
BE185536 Hs.301183 
AA489245 Ks.88500 
AW243614 Hs.18063 
AI817130 Hs.9195 
AL117424 Hs^35 
BE147611 Hs.6354 
NM_012331Hs.26458 
D51069 Hs.211579 
NM_005397Hs.16426 
W28391 Hs.343258 
T40064 Hs.71968 
A1554545 Hs.68301 
AB029000 Hs.70823 
AA147186 

AI056548 Hs.72116 
AF047033 Hs.132904 
AW007485 Hs.87125 
AW956580 Hs.42699 
AA055415 Hs.13233 
AW001579 Hs.9645 
AA782114 Hs.28043 
AA035211 Hs.17404 
AI287912 Hs.3628 
AA300067 Hs.33032 
BE301871 Hs.4867 
AW160993 Hs,326292 
AB020653 Hs.24024 
AW629414 Hs.24230 
AW083384 Hs.11067 



cyclin D1 (PRAD1: parathyroid adenomatos 

tyrosine kinase with immunogtobulin and 

gb:H.sapien5 800-2 gene for manganese su 

inhibitor of DNA binding 3, dominant neg 

eukaryotic translation elongation factor 

catenin (cadherin-assodated protein), b 

thioredoxin reductase 1 

gb:H.sapiens PTX3 gene promoter region. 

protein tyrosine phosphatase, receptor t 

ephrin-61 

ESTs 

ESTs 

protein disuifide isomerase related prot 

Homo sapiens clone TCCCIA00176 mRNA sequ 

B-cell ClITIymphoma 6, member B (zinc fi 

Homo sapiens clone IMAGE:451939, mRNA se 

phosphoiipase A2, group iVC (cytosolic. 

desmoplaldn (DPI, DPIl) 

Homo sapiens, clone IMAGE:3506202, mRNA. 

AD036 proton 

angiotensin leceptor-iitce 1 

hypothetical protein FU14957 

solute carrier family 7, (cationic amino 

ESTs 

ESTs 

hypothetical protein FLJ20505 

CGU3 protein 

RNA binding motif protein 88 

tyrosine 34nonooxygenase/tryptDphan 5-mo 

KIAA1002 protein 

hypothetical protein MGC12942 

ESTs 

disabled (DrosophDa) homotog 2 (mitogen 
ESTs 

Homo sapiens cDNA: FU22296 fis, clone H 

Homo sapiens cDNA: FLJ21962 fis, done H 

Homo sapiens clone FLB9213 PR02474 mRNA. 

Homo sapiens cONA: FIJ21288 fis. done C 

hypothetical protein FU14005 

protein kinase (cAMP-dependent catalyU 

RIKEN cDNA 9130422N19 gene 

HSPC028 protein 

hypothetica! protein FU22471 

Homo sapiens cDNA: FU22141 fis. done H 

ESTs 

Homo sapiens done 250 1 2 mRNA sequence 
novel SH2-containing protein 3 
molecule possessing anlcyrin repeats indu 
mitogen-activated protein kinase 8 inter 
Homo sapiens cDNA FU10768 fis, done NT 
Homo sapiens cDNA FU13698 fis, done PL 
chlorkte intracellular channel 4 
stromal cell derived factor receptor 1 
methtoi^ sulfoxide reductase A 
melanoma cell adhesion molecuie 
podocalyxin-like 

proiiferatlon-^ssodated 2G4. 38kD 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 

ESTs 

KIAA1077 protein 

gb:zo38d01.s1 Stratagene endothelial eel 
hypothetical protein FLJ20992 similar to 
solute carrier family 4, sodium bicarbon 
EH^omain containing 3 
ESTs 

ESTs. Moderately similar to A47582 B-cel 
Homo sapiens mRNA for K1AA1741 protein. 
ESTs 
ESTs 

n^ogen-activated protein kinase kinase 
hypothetical protein DKFZp434N185 
mannosyl (alpha-1,3-}-glycoprDtein beta- 
hypothetical gene DKFZp434A1 1 14 
KIAA0846 protein 
ESTs 

ESTs, Highly similar b T46395 hypotheti 
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112951 


T16550 


AA307634 


Hs.6650 


113057 


T26674 


AW194301 Hs.339283 


113195 T57112 


H83265 


Hs.8881 


113490 T88700 


BE178110 


Hs.1 73374 


113542 T90527 


H43374 


Hs.7890 


113803 W42789 


AW880709 H&283683 


113847 


W60002 


NM_005032Hs.4114 


113910 W78175 


AA1 13262 


Hs.17901 


113947 


W84768 


W84768 




114047 


W94427 


AL035858 


Hs.3807 


115081 


AA253217 


AI751438 


Hs.41271 


115819 


AA426573 


AA486620 


Hs.41135 


115870 AA432a74 


NIVL005985HS.46029 


115984 AA446622 


AA9o75oo 


Hs.74313 


116228 AA478771 


A I7A7Q>I7 


11 _ CAD 4 4 


116264 


AA482594 




nS.272239 


116314 


AA490588 


AI799104 


Hs. 178705 


116589 


D59570 


AI557212 


Hs,17132 


117023 


H88157 


AW070211 


Hs.102415 


117112 


H94648 


AW969999 


Hs.293658 


117156 


H97538 


W73853 




117176 


H98670 


H45100 


Hs.49753 


117280 


N22107 


M18217 


Hs.172129 


119559 


W38197 


W38197 




119866 


W80814 


AA495205 


Hs.193700 


120655 


AA287347 


AA305599 


Hs^38205 


121314 


AA402799 


W07343 


Hs.182538 


121335 AA404418 


AA404418 




121822 AA425107 


AI743860 




121835 


AA425435 


AB033030 


Hs,300870 


122331 


AA442872 


AL133437 


Hs.1 10771 


122577 


AA452860 


AA829725 


Hs.334437 


123160 


AA468687 


AA488687 


Hs.284235 


123486 


AA599674 


BE019072 


Hs.334802 


124059 


F13673 


BE387335 


Hs.283713 


124339 


H99093 


H99093 


Hs.343411 


124358 


N22495 


AW070211 


Hs.102415 


124364 


N23031 


AF265555 


Hs.250646 


124726 


R15740 


NI^003654Hs.104576 


124763 


R39610 


BE410405 


Hs.76288 


125167 


W45560 


AL137540 


HS.1Q2541 


125304 


Z39833 


AL359S73 


Hs.124940 


125307 


Z40583 


AW580945 Hs.330466 


125329 


AA825437 


AA825437 


Hs.58875 


125598 


R66613 


T40064 


Hs.71968 


125609 AA868063 


AA868063 


Hs.104576 


418245 AA128075 


AA088767 


Hs.83883 


127435 


N66570 


X69086 


Hs.286161 


127566 


AI051390 


AI051390 


Hs.1 16731 


127619 


AA627122 


AA627122 


Hs.163787 


128453 


X02761 


X02761 


Hs.287820 


128495 AF010193 


NM.005904HS.100602 


128515 AA149044 


BE395085 


Hs.10086 


128580 


U82108 


U82108 


Hs.101813 


128623 


D78676 


BE076608 


Hs.105509 


128642 


L35240 


Z28913 


Hs.102948 


128669 


AA598737 


W28493 


Hs.180414 


128903 


R69417 


AW150717 Hs.345728 


128914 


AA232837 


AW867491 Hs.107125 


129087 


N72695 


AI348027 


Hs.108557 


129188 


M30257 


NM_001078Hs.109225 


129226 


M96843 


BE222494 


Hs.180919 


129265 


X68277 


AA530892 


Hs.171695 


129345 AA292440 


R22497 


Hs.1 10571 


129468 


J03040 


AW410538 Hs.1 11779 


129488 AA228107 


AW966728 Hs.54642 


129498 


AA449789 


AA449789 


H3.75511 


129557 


W01367 


AL045404 


Hs.46366 


129819 


AA610116 


AA209534 


Hs.284243 


129827 


AA258308 


T40064 


Hs.71968 


129762 AA460273 


AA453694 


Hs.12372 


129864 


AA286710 


AF055581 


Hs.13131 


130018 T68873 


AA353093 




130147 


D63476 


D63476 


Hs.172813 


130178 


M62403 


U20982 


Hs.1516 


130282 


X55740 


BE245380 


Hs.153952 



vacuolar protein sorting 45B (yeast homo 
Human DNA sequence from clone RP1-187J11 
ESTs, Weakly similar to S41044 diromosom 
Homo sapiens cDNA RJ10500 fis, clone NT . 
Homo sapiens mRNA for KIAA1671 protein, 
chromosome 8 open reading frame 4 
plastin 3 (T Isoform) 

Homo sapiens, done II^GE:3937015, mRNA, 
gb:zh53d03.s1 Soares_fetaLUver.spleen_ 
FXYD domain-containing ion transport reg 
Homo sapiens mRNA fuD length insert cDN 
endomuCfn-2 

snail 1 (drosophila homolog), zinc finge 

KIAA1265 protein 

ESTs 

lysosomal 

Homo sapiens cDNA FU11333 fis. done PL 
ESTs, Moderately similar to 154374 gene 
Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
ESTs 
ESTs 

uveal autoantlgen vi/ith coiled coii domai 
Homo sapiens cDNA: FLJ21409 fis. done C 
Empirically selected from AFFX single pr 
Homo sapiens mRNA; cDNA DKFZp586i0324 (f 
hypothetical protein PRO2013 
phospholipid scramblase 4 
gb:z«v37e02.s1 Soares.tDtaLfetus_Nb2HF8_ 
metallothlonein IE (functional) 
KIAA1204 protein 

Homo sapiens cDNA: FLJ21904 fis. done H 
hypothetical protein MGC4248 
ESTs. Weakly similar to 138022 hypolhell 
Homo sapiens cDNA FLJ14680 fts, clone NT 
ESTs. Weakly similar to S64054 hypotheli 
DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 
Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
baculoviral lAP repeat-containing 6 
carbohydrate (keratan suifote Gal-6) sul 
calpam 2. (m/ll) large subunit 
netrfn4 

GTP-binding protein 

ESTs 

ESTs 

Homo sapiens mRNA: dDNA DKFZp564F053 (fr 
carbohydrate (keratan sulfite Gal-6) sul 
transmembrane, prostate androgen induced 
Homo sapiens cDI^ FU13613 lis. ck)ne PL 
ESTs 
ESTs 

fibronedin 1 

MAD (mothers against decapentaplegic Or 
type I transmembrane protein Fn14 
solute canler family 9 (sodkim/hydrogen 
CTL2gene 

enigma (LIM domain protein) 
heat shock 70kD protein 8 
STAT induced STAT inhibitor 3 
plasmalemma veslde assodated protein 
hypothetical protein PP1057 
vascular cell adhesion mdecule 1 
inhibitor of DNA binding 2. dominant neg 
dual spedfldty phosphatase 1 
growth anest and DNA-damage-indudble. 
secreted protein, addic, cysteine-rich 
methionine adenosyltransferase II, beta 
connective tissue growth factor 
KtAA0948 protein 
tetraspan NET-6 protein 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
tripartite motif protein TRll^ 
lysosomal 
metellothionein 1L 

PAK-interacb'ng exchange factor beta 
insunn-like growth factor-bbiding prote 
5'nudeotktese(CD73) 
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130431 L10284 AW505214 Hs.155560 

130495 AA243278 AW250380 Hs.109O59 

130553 AA430032 AF062649 Hs.252587 

130638 H16402 AW021276 Hs,17121 

5 130639 D59711 AI557212 Hs.17132 

130657 T94452 AW337575 Hs.201591 

130686 AA431571 BE548267 Hs.337986 

130776 R79356 AF167706 Hs.192B0 

130818 AA280375 AW190920 Hs.19928 

10 130840 Z49269 BE048821 Hs.20144 

130899 Z4174D AI077288 Hs.296323 

131002 AA121543 AL050295 Hs^39 

131080 J05008 NM.001955Hs.2271 

131084 AA101878 NWL017413Hs.3030ft4 

15 131091 T35341 AJ271216 Hs.22880 

131107 N87590 66620886 HsJ5354 

131182 AA256153 Ar824144 Hs.23912 

131207 W74533 AF104266 Hs.24212 

131319 U25997 NIVL003155Hs.25590 

20 131328 V01512 AW939251 Hs.25647 

131328 V01512 AW939251 Hs.25647 

131328 VQ1512 AW939251 Hs.25647 

131328 V01512 AW939251 Hs.25647 

131509 X56681 X56681 Hs.2780 

25 131555 AA161292 T47364 Hs.278613 

131564 AA491465 T9350D Hs.28792 

131573 AA046593 AA040311 Hs.28959 

131692 D50914 BE559681 Hs.30736 

131756 D45304 AA443966 Hs.31595 

30 131859 M90657 AW960564 

131909 W69127 NlVL016558Hs.274411 

131915 AA316186 AI161383 Hs.34549 

132046 AA384503 AI359214 Hs.179260 

132050 AA136353 A1267615 HS.3B022 

35 132151 AA044755 BE379499 Hs.173705 

132164 U84573 AI752235 Hs.41270 

132187 AA058911 AA235709 Hs.4193 

132303 AA620962 BE177330 Hs.325093 

132314 AA285290 AF112222 Hs.323806 

40 132358 X60486 NryL003542Hs.46423 

132398 R31641 AA876616 Hs.16979 

132421 AA489ig0 AW163483 H$.48320 

132490 F13782 NIVL001290Hs.4980 

132520 AA257993 AA257992 Hs.50651 

45 132546 M24283 M24283 Hs.1683B3 

132610 AA443114 AA160511 Hs.5326 

132716 T35289 BE379595 Hs.283738 

132840 N23817 BE218319 Hs.5807 

132883 AA047151 AA373314 HS.5B97 

50 132968 N77151 AF234532 Hs.61638 

132989 AA480074 AA480074 Hs.331328 

132999 Y00787 Y00787 Hs.624 

133071 T99789 BE384932 Hs.64313 

133076 W84341 AW946276 Hs.6441 

55 133099 L09209 W16518 Hs.279518 

133147 D12763 AA026533 Hs.66 

133149 T16484 AA370045 Hs.6607 

133161 AA253193 AW021103 Hs.6631 

133200 AA432248 AB037715 Hs.183639 

60 133220 X82200 NM.006074Hs.31 8501 

133260 AA083572 AA403045 Hs.6906 

133295 L00352 AI147861 Hs.213289 

133349 N75791 AW631255 Hs.8110 

133391 X57579 AW103364 Hs.727 

65 133398 X02612 NM.000499Hs72912 

133436 H44631 BE294068 Hs.737 

133454 AA090257 BE547647 Hs.177781 

133478 X83703 X83703 Hs.31432 

133491 L40395 66619053 Hs.170001 

70 133510 AA227913 AW880841 Hs.96908 

133517 X52947 NM.000165HS.74471 

133526 M11313 AU077051 Hs74561 

133538 L14837 NM^003257Hs74614 

133562 M60721 M60721 Hs.74870 

75 133584 D90209 D90209 Hs.181243 

133590 T67986 T70956 Hs.75108 



calnexin 

mitochondrial ribosomal protein L12 
pituitary tumor-transformfng 1 
ESTs 

ESTs, Moderately similar to 154374 gene 
ESTs 

Homo sapiens cONA FU10934 fis. done OV 

cysteine-rich motor neuron 1 

hypo(he6cal protefn SP329 

small inducible cytokine sub^y A (Cy 

serum/gluoocortkx)bi regulated kinase 

KIAA0758 protein 

endofheKn 1 

apelln; peptide Bgand for APJ receptor 

dipeptidylpeptidase III 

GCN1 (general control of amino^cid synt 

ESTs 

latrophilin 
stanniocaldn 1 

v4os FBJ murine osteosarcoma viral onco 
v-fos FBJ murine osteosarcoma viral onco 
v-fos FBJ murine osteosarcoma viral onco 
v-fos FBJ murine osteosarcoma viral onco 
jun D proto-oncogene 
interferon, alpha-indudble protein 27 
Homo sapiens cDNA FU11041 fis, done PL 
ESTs 

K1AA0124 protein 
ESTs 

transmembrane 4 superfamtly member 1 

SCAN domain-containing 1 

ESTs, Highly similar to S94541 1 clone 4 

ciiromosome 14 open reading frame 4 

ESTs 

Homo sapiens cDNA: FIJ22050 fis, done H 

procollagen-lysine, 2-oxoglutarate 5-dio 

DKFZP58601624 protein 

Homo sapiens cDNA: FLJ21210 fis, clone C 

pinin. desmosome assodated protein 

H4 histone bmily, member G 

ESTs, Weakly similar to A43932 mudn 2 p 

double r^g-rmger protein. Dorfin 

LIM domain binding 2 

Janus kinase 1 (a protein ^rosine kinas 

interceDutar adhesion molecule 1 (CD54) 

amino add system N transporter 2; porcu 

casein kinase 1, alpha 1 

GTPase Rab14 

Homo sapiens mRNA; cDNA DKFZp586P1622 (f 

myosin X 

hypothetical protein FU13213 
interieukin 8 

ESTs. Weakly similar to AF2571 82 1 G-pro 

Homo sapiens mRNA; cDNA DKF2p586J021 (fr 

amyioid beta (A4) precursor-fike protein 

interleukin 1 receptor-like 1 

AXIN1 up-regulated 

hypothetical protein FLJ20373 

hypothetical protein FLJ10210 

Homo sapiens mRNA full length Insert cDN 

Homo sapiens cONA: FLJ23197 fis, clone R 

tow density lipoprotein receptor (famili 

L-3-hydroxyacyl-Coenzyme A dehydrogenase 

inhibin. beta A (adivin A, activin AB a 

cytochrome P450, subfamily I (aromatic c 

immediate early protein 

hypottietnal protein MGC5618 

cardiac ankyrin repeat protein 

eukaiyotic translatkm initiation feidor 

p53-lnduoed protein 

gap Junction protein, alpha 1. 43kD (con 

alpha-2-macroglobuBn 

tight junction protein 1 (zona oodudens 

H2.0 (Drosophila>Iike homeo box 1 

activating transcription factor 4 (tax-r 

clusterin (complement lysis inhibitor, 8 
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133617 AA148318 BE244334 Hs.75249 

133651 U97105 AI301740 Hs.173381 

133671 T25747 AW503116 Hs.301819 

133678 K02574 AW247252 

5 133681 D78577 AI352558 

133722 X53331 AW969976 Hs.279009 

133730 S73591 BE242779 Hs.l79526 

133750 X95735 BE410769 Hs.75873 

133802 L16862 AW239400 Hs.76297 

10 133825 U44975 BE616902 Hs.285313 

133838 M97796 BE222494 Hs.180919 

133859 U86782 U86782 Hs.178761 

133889 AA099391 U48959 Hs.211582 

133960 M19267 M19267 Hs.77899 

15 133975 D29992 C18356 Hs.295944 

133977 L19314 AI125639 Hs.250666 

134039 S78569 NM_002290Hs.78672 

134075 U28811 NM_012201Hs.78979 

134081 L77886 AL034349 Hs.79005 

20 134164 C14407 AW245540 Hs.79516 

134203 M60278 AA161219 Hs.799 

134238 R81509 AA102179 Hs.160726 

134299 AA487558 AW580939 Hs.97199 

134332 D86962 D86962 Hs.81875 

25 134339 AA478971 R70429 Hs.81988 

134343 D50683 D50683 Hs.82028 

134381 U56637 AI557280 Hs.184270 

134403 M61199 AA334551 

134416 hAZBSSl X68264 Hs.211579 

30 134493 X15183 IVI30827 Hs.28908d 

134558 S53911 NIUL001773H5.85289 

134817 U20734 AU076592 H5.ig8951 

134983 D28235 D28235 HS.19S384 

134989 AA236324 AWg680S8 Hs.92381 

35 135052 AA148923 AL136653 Hs.93675 

135062 AA174183 AK000967 Hs.93872 

135069 AA466311 AA876372 Hs.93961 

135071 L08069 W27190 Hs.94 

135073 AA452000 W55956 Hs.94030 

40 135170 AA282140 T53169 Hs.9587 

135196 J02854 C03577 Hs,9615 

135348 AA442054 U80983 Hs^68177 

134404 AB000450 AB000450 Hs.82771 
439561 AB002380 AF180681 Hs.6582 

45 100082 AB003103 AA130080 Hs.4295 

132817 AB004884 N27852 Hs.57553 

130150 AF000573 BE094848 H3.15113 

100104 AF008937 AF008937 

447973 AF009301 AB011169 Hs.20141 

50 332613 AF009368 AF029674 Hs.173422 

100113 D00591 NM_001269Hs.84746 

133980 D00760 AA294921 Hs,348024 

100129 D11139 AA469369 Hs.5831 

100154 D14657 H60720 Hs.81892 

55 100169 D14878 AL037228 Hs.82043 

129718 D17716 NM.002410HS.121502 

100190 D21090 M91401 K5.176658 

134742 D26135 N(il.001346H5.89462 

100211 D2652B 026528 Hs.123058 

60 100238 D30742 L24959 Hs^48 

130283 031762 NM.012288HS.1 53954 

134237 031765 031765 Hs.170114 

100246 031888 N^L015156H$.78398 

100256 038128 025418 Hs.393 

65 100262 038500 D38500 Hs.278468 

134329 D38551 N92036 Hs.81848 

100281 042087 AF091035 Hs.184627 

100294 049396 AA331881 Hs.75454 

100327 D55640 D55640 

70 100335 063391 AW247529 Hs.6793 

134495 063477 063477 Hs.84087 

100338 063483 D86864 Hs.57735 

135152 D64015 M96954 Hs.182741 

-134269 079990 NM-014737H8.80905 

75 100372 079997 NM_014791Hs.184339 

134304 D80010 BE613486 Hs.81412 



AOP-ribosylation factor-Dke 6 interacti 

dihydropyrimidinase-iike 2 

zinc finger protein 146 

nucleoside phosphorylase 

tyrosine 3-monooxygenase/tfyptophan 5-mo 

matrix Gla protein 

upregulated by 1,25-dihydR)xyvitamin 0-3 
zyxin 

G protein-coupted receptor kinase 6 

core promoter element binding protein 

inhibitor of DNA binding 2, dominant neg 

26S proteasome<associated padi homolog 

myosin, light polypeptide kinase 

tropomyosin 1 (alpha) 

tissue factor pathway inhibitor 2 

haiiy (Drosophi[a)-homolog 

laminin, alpha 4 

Gotgi apparatus protein 1 

protein tyrosine phosphatase, receptor t 

brain abundant, membrane attached signal 

diphtheria toxin receptor (heparin-bindi 

Homo sapiens cONA FLJ11680 fis. clone HE 

complement component Clq receptor 

growth factor receptor-bound protein 10 

disabled (Drosophlla) homolog 2 (mitogen 

transfonning growth factor, beta rscepto 

capping protein (actin filament) muscle 

spenn speciric antigen 2 

melanoma cell adhesion molecule 

heat shock 90kO protein 1 » alpha 

C034 antigen 

JunBprolo-oncogene 

prostaglandin-endoperoxkle synthase 2 (p 

nudix (nucleoside diphosphate linked moi 

decidual protein Induced by progesterone 

KIAA1682 protein 

Homo sapiens mRNA; cONA OKFZp667D095 (fr 
OnaJ (Hsp40) homolog. subfamily A. membe 
Homo sapiens mRNA: cONA OKFZp586E1624 (f 
Homo sapiens cDNA: FLJ22290 fis, ctone H 
myosin regulatory light chain 2. smooth 
phospholipase C. gamma 1 (fonmerfy sub^ 
vaccinia related kinase 2 
Rho guanine exchange factor (GEF) 12 
proteasome (prosome. macropain) 26S subu 
tousled-like kinase 2 

homogentisate 1.2-dk)xygenase (homogenti 
syntaxin 16 

similar to S. cerevisiae SSM4 

KIAA1605 protein 

chromosome condensation 1 

v-ral simian leukemia viral oncogene horn 

tissue inhibitor of metalloproteinase 1 

KIAA0101 gene product 

0123 gene product 

mannosyl (alpha-1,&')^lycoproteln beta- 
RAD23 (S. cerevisiae) homotog B 
diacytglycerol kinase, gamma (90kO) 
DEAD/H (Asp^tu-Ala-AspftHis) box polypep 
catdum/calmodulln-dependent protein kin 
TRAM-like protein 
KtAA0061 protein 
KIAA0071 protein 

prostaglandin 12 (prostacyd^) receptor 
postmeiotic segregation increased 2-like 
RA021 (S. pombe) homolog 
KIAA01 18 protein 
peroxiredoxin 3 

gb:Human monocyte PABL (pseudoautosomal 

platelet-activating tactorace^lhydrola 

KIAA0143 protein 

acetyl LDL receptor; SREC 

TIA1 cytotoxic granule-associated RNA-bl 

Ras association (RalGOS/AF-6) domain fam 

KtAA0175 gene product 

lipin 1 
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100394 
100405 
100418 
133154 
134347 
444099 
100438 
134593 
100481 
100552 
100591 
100652 
100662 



100905 

100945 

100950 

100964 

135407 

130149 

131877 

101016 

134786 

134100 

134078 

101132 

134849 

332736 

101152 

135397 

432642 

101168 

421155 

101226 

415138 

134739 

130155 

440538 

409916 

101294 

101300 

101310 

130344 

101381 

101381 

415678 

133780 

101396 

101447 

101458 

101470 

134604 

101478 

133519 

131185 

134116 

133999 

130174 

129963 

132983 

133900 

101543 

101545 

101620 

134691 

133595 

101700 

101714 

134246 

101760 

415022 

415022 

415022 

415022 

415022 

101791 



D84276 
D86425 
D86978 
D87012 
D87075 
D87432 
D87448 
D87845 

HG1098-HT1098 

HG2167-HT2237 

HG241541T2511 

HG28254fr2949 

HG2887-HT3031 

HG4660+rr5073 

HG4704-HT5146 

HG884-HT884 

HG919-HT919 

J00212 

J04029 

J04031 

J04088 

J04543 

L06139 

L07540 

L08895 

L11239 

L11353 

L13773 

L13800 

L14922 

L15189 

L15388 

L16895 

L27476 

L27624 

L32976 

L33404 

L35263 

137347 

L40371 

L40391 

L41607 

L77566 

M13928 

M13928 

M14016 

M14219 

M15796 

M21305 

M22092 

M22898 

M22995 

M23379 

M24400 

M25753 

M27691 

M28213 

M29550 

M29971 

M30269 

M31158 

M31166 

M31210 

M55420 

M59979 

M62810 

M64710 

M68874 

M74524 

M80254 . 

M81780 

M81780 

M81780 

M81780 

M81780 

M83822 



D84284 Hs.66052 
AW291587 Hs.82733 
D86978 Hs.84790 
D87012 Hs.1 94685 
AF164142 Hs.82042 
D87432 Hs.10315 
AA013051 Hs.91417 
NIVL000437HS.234392 
X70377 Hs.121489 
M019521 Hs.301946 
NM.004091HS.231444 
BE613606 Hs.142653 
AI368680 Hs.816 
AL039123 Hs.103042 
L12260 Hs.172816 
AF002225 Hs.180686 
AF128542 Hs.166846 
J00212 

J04029 Hs.99936 
AW067805 Hs.1 72655 
J04088 Hs.156346 
J04543 Hs.78637 
T29618 Hs.89640 
AA460085 Hs.171075 
L08895 Hs.78995 
L11239 Hs.36993 
BE409525 Hs.902 
Z83689 Hs.1 14765 
Aig84625 Hs.9884 
L14922 Hs.166563 
BE297635 Hs,3069 
NM_005308Hs.211569 
H87879 Hs.102267 
AF083892 Hs75608 
C18356 Hs.295944 
NWL002419HS.89449 
AA101043 Hs.151254 
W76332 Hs.79107 
BE313625 Hs.57435 
AF168418 Hs.116784 
BE535511 
L41607 Hs.934 
AW250122 Hs,154879 
AW675039 Hs.1227 
AW675039 Hs.1227 
AW005903 Hs.78601 
AA557660 Hs.76152 
BE267931 Hs.78996 
M21305 
M22092 

NIVL000546HS.1846 
NM^002884Hs.865 
N1VL002890HS.758 
AW583062 Hs.74502 
BE280074 Hs.23960 
R84694 Hs.79194 
AA535244 Hs.78305 
M29551 Hs.151531 
M29971 Hs.1384 
M30269 

M31158 Hs.77439 
M31166 Hs.2050 
6E246154 Hs.154210 
S55271 Hs.247930 
AW382987 Hs.88474 
AA393273 Hs.75133 
D90337 HS547916 
M68874 Hs.211587 
D28459 H$.80612 
M80254 Hs.173125 
X59960 Hs.77813 
X59960 Hs.77813 
X59960 Hs.77813 
X59960 Hs.77813 
X59960 Hs.77813 
M83822 Hs.62354 



CD38 antigen (p45) 

nidogen 2 

KIAA0225 protein 

topoisomerase (DNA) III beta 

solute carrier family 23 (nucleobase tra 

solute carrier family 7 (catlonic amino 

topoisomerase (DNA) II binding protein 

ptatetet-actlvating foctor acetylhydrola 

cystatinD 



Homo sapiens. SimDar to hypothetical pr 
ret finger protein 

SRY (sex detemtining region Y)-box 2 
microtubute-associated protein IB 



ubiquitin protein ligase E3A (human papl 
polymerase pNA directed), epsilon 
Empirically selected from AFFX single pr 
keratin 10 (epidermolytic hyperkeratosis 
methytenetetrahydrofolate dehydrogenase 
topoisomerase {DUA) II alpha (170kD) 
annexin A7 

TEK tyrosine kinase, endothelial (venous 
replication factor C (activator 1) 5 (36 
MADS box transcription enhancer factor 2 
gastrulation brain homeo box 1 
neurofibromin 2 (bilateFal acoustic neur 
myeloid/lymphoid or mixed-lineage leukem 
spindle pole body protein 
replication factor C (activator 1) 1 (14 
heat shock 70kD protein 9B (mortaliiv2) 
G protein-coupled receptor kinase 5 
lysyl oxidase 

tight junction protein 2 (zona ocdudens 
tissue factor pathway inhibitor 2 
mitogen-activated protein kinase kinase 
kalGkrein 7 (chymotiyptic, stratum com 
mitogen-activated protein kinase 14 
solute carrier family 11 (pioton-coupled 
thyroid hormone receptor interactor 4 
transmembrarie trafficking protein 
gluoosaminyl (N<ioe^) transferase 2. 1 
DIGeorge syndrome critk;al regnn gene D 
aminotevuiinate, delta-, dehydratase 
aminolevulinate, delta-, dehydratase 
uroporphyrinogen decarboxylase 
decorin 

proliferating cell nuclear antigen 
gb:Human alpha satellite and satellite 3 
gbrHuman neural cell adhesion nwlecule ( 
tumor protein p53 (U-Fraumeni syndrome) 
RAP1 A, member of RAS oncogene family 
RAS p21 protein activator (GTPase active 
chymotjypsinogen 81 
cydin B1 

cAMP responsive element binding protein 
RAB2. member RAS oncogene famiy 
protein phosphatase 3 (formeriy 2B), cat 
0-6-melhylguanine-DNA methyltransferase 
nidogen (enactin) 

protein kinase. cAti^P-dcpendent regulato 
pentaxin-related gene, rapidly induced b 
endothelial differentiation, sphingolipi 
Epsilon , IgE 

ppostaglandin-endoperoxkie synthase 1 (p 
. transcription factor 6-like 1 (mitochond 
natriuretic peptide precursor C 
phosphotipase A2. group IVA (cytosolic. 
ubiquitin-conjugating enzyme E2A (RAD6 h 
peptidytproiyt isomerase F (cydophilin 
sphir^omyelin phosphodiesterase 1, add 
sphingomyelin phosphodiesterase 1, acid 
sphingomyelin phosphodiesterase 1, add 
sphingomyelin phosphodiesterase 1. add 
sphingomyelin phosphodiesterase 1, add 
cell division cyde 4-{ike 
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101812 


M86934 


BE439894 


Hs.78g91 


101813 


M8733B 


NM.002914HS.139226 


133396 


M96326 


M96326 


Hs.72885 


428161 


M96954 


M96954 


Hs.182741 


129026 


M98&33 


AU20297 


Hs.108043 


101901 


S66793 


H38026 


Hs.308 


134831 


S72370 


AA853479 


Hs.89890 


134039 


S78S69 


NM-002290HS.78672 


442355 


379873 


AA456539 


Hs.8262 


101975 


S63325 


AA079717 


Hs.283664 


101977 


S83364 


AF1 12213 


Hs.184062 


101978 


S83365 


BE561610 


Hs.5809 


101998 


U01212 


U01212 


Hs.248153 


102003 


U01922 


U01922 


Hs.125565 


102007 


U02556 


U02555 


Hs.75307 


102009 


U02680 


BE245149 


Hs.82643 


416658 


U03272 


U03272 


Hs.79432 


132951 


U04209 


AW821182 Hs.61418 


135389 


U05237 


U05237 


Hs.99872 


102048 


U07225 


U07225 


Hs.339 


130145 


U07620 


U34820 


Hs.151051 


303153 


U09759 


U09759 


Hs.246857 


420269 


U09820 


U72937 


Hs.96264 


102095 


U11313 


U11313 


Hs.75760 


102123 


U14518 


NM«001809Hs.1594 


102126 


U 14575 


AW950870 HS78961 


102133 


U15173 


AU076845 


Hs.155596 


102139 


U15932 


NM^004419Hs.2128 


102162 


U18291 


AA450274 


Hs.1592 


102164 


U18300 


NM„000107Hs.77602 


4'27653 


U18383 


AA159001 


Hs.180069 


131817 


U20536 


U20536 


Hs.3280 


102200 


U21551 


AA232362 


Hs,l 57205 


102210 U23028 


BE619413 


Hs2437 


102214 


U23752 


U23752 


HS329&I 


132811 


U25435 


U25435 


HS37419 


131319 


U25997 


N(4.003155HsJ25590 


102256 


U2B251 


U28251 


Hs.53237 


132316 


U28831 


U28831 


Hs.44566 


102269 


U30245 


U30245 




417526 


U32315 


AA568908 


Hs.82240 


102293 


U32439 


AFQ90116 


Hs.79348 


102298 


U32849 


AA382169 


HS.544B3 


102325 


U35139 


At815867 


Hs.50130 


428734 


U36764 


BE303044 


Hs.192023 


102361 


U39400 


AA223616 


Hs.75859 


102367 


U39657 


U39656 


Hs.118825 


102388 


U41344 


AA362907 


HS.7&494 


102394 


U41766 


NWL003816HS.2442 


129829 


U41813 


AF010258 


Hs.127428 


102409 


U43286 


BE300330 


Hs.1 18725 


133746 


U44378 


AW410035 Hs.75862 


102423 


U44754 


Z47M2 


Hs.179312 


132828 


U47011 


AB014615 


Hs.57710 


132828 


U47011 


AB014615 


Hs.57710 


132828 


U47011 


AB014615 


Hs,57710 


132828 


U47011 


AB014615 


Hs.57710 


425322 


U47077 


U63630 


Hs.l 55637 


102450 


U48251 


U48251 


Hs.75871 


129350 


U50535 


U50535 


Hs.1 10630 


102534 


U56833 


U96759 


Hs.198307 


130457 


U58091 


AB014595 


Hs.155976 


135065 U58&37 


AA019401 


Hs.93909 


102560 


U59289 


R97457 


Hs.63984 


102567 


U59863 


U63830 


Hs.146847 


417173 


U67122 


U61397 


Hs^1424 


102638 


U67319 


U67319 


Hs.9216 


132736 U68019 


AW081883 Hs.211578 


133070 U69611 


U92649 


Hs.64311 


102663 U70322 


NM_002270Hs.168075 


134660 U73524 


U73524 


Hs.87465 


102735 


U79267 


AF11110B 


Hs.3382 


102741 


U79291 


AW959829 Hs.B3572 


130564 


U82671 


U82671 


Hs.36980 


130564 U82671 


U82671 


Ms.36980 


132164 


U84573 


A1752235 


Hs.41270 



DNA segment numerous copies, expressed 
replication factor C (activator 1) 2 (40 
azurocidin 1 (cationic antimicrobial pro 
TIA1 cytotoxic granule-associated RNA-t)i 
Friend leukemia virus integration 1 
arresUn 3, retinal (X-arrestin) 
pyruvate carboxylase 
tamlnin, alpha 4 

lysosomal-assodated membrane protebi 2 

aspartate beta-hydroxylase 

putative Rab5-lnteracting protein 

putative transmembrane protein; homolog 

olfactory marker protein 

translocase of Inner mitochondrial membr 

t-complax-associated-tests-expressed 1- 

protein tyrosine kinase 9 

fibriliin 2 (congenital contracturai ara 

microfibriliar-associated protein 1 

fetal Alzheimer antigen 

purinerglc receptor P2Y, G-proteIn coupl 

mitogen-aclivated protein kinase 10 

mitogen-activated protein kinase 9 

alpha thalassemia/mental retardation syn 

sterol canier protein 2 

centromere protein A (17kD) 

protein phosphatase 1, regulatory (inhib 

BCL2/adenovirus E1B 19kD-interactin9 pro 

dual specificity phosphatase 5 

CDC 16 (ceil division cycte 16, S. cemvi 

damage-speciflc DNA binding protein 2 (4 

nuclear respiratory factor 1 

caspase 6, apoptosis-related cysteine pr 

branched chain aminotransferase 1. qrtos 

eukaryotic translatton Initiation factor 

SRY (sex determining region Y>-box 11 

CCCTC-blnding factor (zinc finger protei 

stanntocaldn 1 

ESTs. Highly similar to Z169.HUMAN ZINC 
KIAA1641 protein 

gbiHuman myek>mono(^ specific protein 
syntaxin 3A 

regulator of G-protein signalTing 7 
N-myc (and STAT) interactor 
necdin (mouse) honnotog 
eukaryotk: transtatton initiation factor 
chromosome 11 open reading frame 4 
mitogen-activated protein kinase kinase 
proline arginine-rich end ieudne-rich r 
a disintegrin and metalloproteinase doma 
homeoboxA9 

selenopt>osphato synthetase 2 
MAD (mothers against decapentaplegk:, Dr 
small nuclear RNA activating complex, po 
fibroblast growth factor 8 (androgen-ind 
fibroblast growth factor 8 (androgen>ind 
fibroblast growth factor 8 (androgen-ind 
fibroblast growth factor 8 (androgen-ind 
protein kinase, DNA-activated. catalytic 
protein kinase C binding protein 1 
Human BRCA2 regon. mRNA sequence C6006 
von Hippel-Llndau bindirig protein 1 
cullln4B 

cyclic nucleotide gated channel beta 1 

cadherfn 13. H-cadherin (hearQ 

TRAP famSy member-associated NFKB adiv 

ubiquitin-like 1 (sentrin) 

caspase 7, apoptosls-related cysteine pr 

Homo sapiens cDNA: FIJ23037 fis. done L 

a disintegrin and metalloproteinase doma 

karyopherin (importin) beta 2 

ATP/GTP-binding protein 

protein phosphatase 4, regulatory subuni 

hypothetical protein MGC14433 

melanoma antigen, family A, 2 

melanoma antigen, family A. 2 

procolEagen-fysine. 2*oxQg(U1arate 5-dio 
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102823 

102826 

102831 

102846 

129777 

134161 

134854 

429257 

413985 

419768 

102915 

134656 

413858 

102988 

102971 

134037 

134037 

103023 

103037 

130262 

134542 

128568 

128568 

103093 

413076 

129063 

424460 

411077 

103181 

103184 

103194 

103208 

129698 

131486 

130729 

103334 

132645 

135094 

103352 

103352 

103353 

132173 

103371 

131584 

103376 

103378 

128510 

103410 

133490 

332689 

103438 

103440 

103452 

133536 

420234 

426502 

134662 

132083 

103500 

134389 

132084 

103540 

133152 

103548 

103612 

129092 

103692 

103695 

129796 

434993 

132683 

131887 

103723 

453368 

133260 

103765 



U90914 

U91316 

U91932 

U96131 

U97018 

U97188 

V00503 

X04327 

X06389 

X07496 

X07820 

X14787 

X15525 

X16396 

X16609 

X53586 

X53586 

X53793 

X54936 

X55740 

X57025 

X60673 

X60673 

X60708 

X62048 

X63097 

X63563 

X64037 

X69636 

X69878 

X70B49 

X72B41 

X74987 

X83107 

X84194 

X85753 

X87870 

X89066 

X89398 

X89398 

X89399 

X89426 

X91247 

Xgi648 

X92098 

X92110 

X94703 

X96506 

X97230 

X97230 

X98263 

X98296 

X99584 

Y00264 

Y07566 

Y07759 

Y07827 

Y07867 

Y09443 

Y09858 

Y12394 

Z11559 

Z11695 

Z15005 

Z46261 

AA011243 

AA018418 

AA018758 

AA018804 

AA031993 

AA044217 

M04654a 

AA057447 

AA058376 

AA083572 

M085696 



D85390 Hs.5057 
NM_007274Hs.8679 
M262170 Hs.80917 
BE264974 Hs.6566 
U97018 Hs.12451 
AA634543 Hs.79440 
J03464 Hs.179573 
AW163799 Hs.198365 
AI018666 Hs.75667 
T72104 Hs.93194 
X07820 H5.2258 
AI750878 Hs.87409 
NM_001610Hs.75589 
AU076611 Hs.154672 
X16609 H5.183805 
AI808780 Hs.227730 
AI808780 Hs.227730 
AW500470 Hs.117950 
BE018302 Hs.2894 
BE245380 Hs.153952 
M14156 Hs.85112 



HI 2912 
H12912 
S79876 
U10564 
X63094 



Hs.274691 

Hs.274691 

Hs.44926 

Hs.75188 

Hs.283822 



BE275979 Hs.296014 
AW977263 Hs.68257 
X69636 Hs.334731 
U43143 Hs.74049 
NM_004939Hs.78580 
AW411340 Hs.31314 
BE242144 Hs.12013 
F06972 Hs.27372 
AI963747 Hs.18573 
NM_001260Hs.25283 
AI654712 Hs.54424 
NM_003304Hs.250687 
H09366 Hs.78853 



H09366 



X89426 
X91247 



Hs.78853 
Hs.119274 
Hs.41716 
Hs.13046 



AA598509 Hs.29117 
AL036166 H&323378 
AL119690 Hs.153618 
X94703 

AA158294 Hs.295362 
AF022044 Hs.274601 
AF022044 Hs,274601 
AW175781 Hs.152720 
X98296 Hs.77578 
NI\4«006936Hs.85119 
W25797.comp 
AW404908 Hs.96038 
Y07759 Hs.170157 
NM_007048Hs.284283 
BE386490 Hs.279663 
AW408009 Hs.22580 
Y09858 Hs.82577 
NIVL002267HS.3886 
NI\A.002197Hs.154721 
Z11695 Hs.324473 
Z15005 HS75573 
BE336654 Hs.70937 
D56365 Hs.63525 
.AW137912 Hs.227583 
' AW207152 Hs.186600 
BE216319 Hs.5807 
AA306325 H5.4311 
BE264633 H5.143638 
W17064 Hs.332848 
BE274312 Hs^14783 
W20296 Hs^88178 
AA403045 Hs.6906 
AA085696 Hs.169600 



carboxypeptldase D 

cytosolic acyl coenzyme A Ihioester hydr 
adaptor-related protein complex 3, sIgma 
thyroid hormone receptor Interador 13 
echinodenn micnotubule^associated protei 
I6F>{I mRNA-binding protein 3 
collagen, type I. alpha 2 
2,3-bl5phosphoglyceFate mutase 
synaptophysin 
apoilpoproteinA-l 

matrix metalloproteinase 10 (stromeiysin 

thromt)ospondin 1 

add phosphatase 2. lysosomal 

methylene tetrahydrofolafe dehydrogenase 

ankyrin 1, erythrocytic 

integnn, alpha 6 

integrin. alpha 6 

multifunctional polypeptide similar to S 

placental growth factor^ vascular endoth 

5' nucleotidase (CD73) 

insurm-like growth factor 1 (somatomedi 

adenylate kinase 3 

adenylate kinase 3 

dipeptidyipeptldase IV (CD26. adenosine 

weel (S. pombe) homolog 

Rhesus blood group, D antigen 

polymerase (RNA) II (DNA directed) polyp 

general transcription factor IIP, polype 

Homo sapiens, clone iMAGE:3448306, mRNA. 

fms-related tyrosine kinase 4 

DEAD/H (Asp-Glu-Ala-Asp/His) box potypep 

retinoblastoma-binding protein 7 

ATP'binding cassette, sub-family E (OABP 

BMX non-receptor tyrosine kinase 

acylphosphatase 1, erythrocyte (common) 

cyclin-dependent kinase 8 

hepatocyte nuclear factor 4. alpha 

transient receptor potential channel 1 

uradl-ONA glycosyiase 

uradl-DNA glycosyiase 

RAS p21 protein activator (GTPase acUva 

endothelial oell^ecific molecule 1 

thiored(»dn reductase 1 

pufbie^ element binding protein A 

coated veslde membrane protein 

HC6VI1M prot^ 

RAB28, member RAS oncogene family 

DRI-assodated protein 1 (negative cofac 

killer cell immunogbbulin-like receptor 

knier cell immunoglobulin-like receptor 

M-phase phosphoprotein 6 

ubiquitin spedfic protease 9. X chromos 

SMT3 (suppressor of mif two 3, yeast) ho 

Hs.1 77486 amyloid beta (A4) precursor protein (pro 

Ric (Orosophila)-nke, expressed \s\ many 

myosin VA (heavy polypeptide 12, myoxin) 

butyrophilin, subfamily 3, member A1 

Pirin 

alkylglycerone phosphate synthase 
spindlin-tike 

kaiyopherin alpha 3 (importin alpha 4} 

aconttase 1, soluble 

mitogen-activated protein kinase 1 

centromere protein E (312kD) 

H3 histone family^ member A 

po{y(iC>^inding protein 2 

Homo sapiens chromosome X map Xpl 1 ^ L- 

ESTs 

GTPase Rab14 

SUMO-1 acthrating enzyme subunit 2 
WD repeat domain 4 
SWI/SNF related, matrix associated, acU 
Homo sapiens cDNA FU14041 lis. done HE 
Homo sapiens cDNA FU1198& fis, done HE 
Homo sapiens cDNA: FLI23197 fis. done R 
KIAA0826 protein 
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103766 AA088744 

103767 AA089688 
132051 AA091284 
103773 AA092700 
135289 AA092968 
409859 AA094800 
103794 AA100219 
131471 AA1 14885 
134319 AA129547 
103807 AA133016 
446392 AA149507 
129863 AA151005 
103850 AA187101 
103855 M195179 
103861 AA206236 
130634 AA227621 
447735 AA248283 
103909 AA249611 
458928 AA282640 
415824 AA287199 
129013 AA313990 
129435 AA314256 
103988 AA314389 
104000 AA324364 
425284 AA329211 
128629 AA399187 
133281 AA421079 
104104 AA422029 
332455 AA425230 
132091 AA447052 
135073 AA452000 
131367 AA456687 
129593 AA487015 
133505 C01527 
132064 C01714 
442351 C01811 
131427 C02352 
433892 C02375 
104282 C14446 
134827 D16611 
425330 D25216 
131742 D31352 
456935 D58024 
425218 080897 
104334 D82614 
134593 D87845 
134731 D89377 
445776 H06583 
131670 H40732 
104394 H46617 
104402 H56731 
439130 H75570 
129077 H78886 
104417 H81241 
134927 L36531 
129280 M63154 
134498 M63180 
104460 M91504 
104488 N56191 
131248 N78483 
130017 R14652 
104530 R20459 
104534 R22303 
104544 R33779 
133328 R36553 
104567 R64534 
129575 R70621 
130776 R79356 
104599 R84933 
104660 AA007160 
104667 AA007234 
104718 AA018409 
104764 AA025351 

104786 AA027168 

104787 AA027317 
134079 AA029423 



AI920783 H5.191435 
BE244667 

AA393968 Hs.180145 
AI219323 Hs.101077 
AW372569 Hs.9788 
AW970843 Hs.55662 
AF244135 Hs.30670 
AA164842 Hs.192619 
BE304999 Hs.285754 
AW9S8264 Hs.103832 
AF142419 Hs.15020 
BE379765 Hs.129872 
AA187101 Hs.213194 
W02363 

AA206236 Hs.4944 
AI769067 Hs.127824 
AA775258 Hs.6127 
AA249611 Hs.47438 
AF043117 Hs.24594 
D42039 Hs.78871 
AA371155 Hs.107942 
AF151852 Hs.111449 
AA314389 Hs.342e49 
AI146527 Hs.80475 
AF155568 Hs.348043 
AL096748 Hs.102708 
AK001601 Hs.69594 
AA422029 Hs.143640 
NM-005754H8.220689 
AW954243 
W55956 Hs.94030 
AI750575 Hs.173933 
A1338247 Hs.98314 
AI630124 Hs.324504 
AA121098 Hs.3838 
W52642 Hs.8261 
AF151879 Hs.26706 
AI929357 H5.323966 
C14448 Hs.332336 
BE314037 Hs.8g866 
D25216 Hs.155650 
AA961420 Hs.31433 
AA370362 Hs,57958 
NWL014909Hs,155182 
D82614 Hs.78771 
NM 000437HS.234392 
D89377 Hs.89404 
NM_001310Hs.13313 
H03514 Hs.15589 
AA129551 Hs.172129 
H56731 Hs,132956 
AA306090 Hs.124707 
N74724 Hs.108479 
Ai819448 Hs.320861 
L36531 Hs.91296 
M63154 Hs.1 10014 
AW246273 Hs,84131 
AW955705 Hs.62604 
N56191 Hs.106511 
AI038989 Hs^32633 
AKOOOOge Hs.143198 
AKD01676 Hs.12457 
R22303 

AI091173 Hs.222362 
AW452738 Ks.265327 
AA040620 H5.5672 
F08262 Hs.278428 
AF167706 Hs.19280 
AW815036 Hs.151251 
BE298665 Hs.14846 
AI239923 Hs.63931 
AI143020 Hs.36250 
AI039243 Hs.278585 
AA027167 HSJ0031 
AA027317 

AKD01751 Hs.171835 



ESTs 

C6t-100 protein 
HSPC030prot^ 

ESTs. Weakly similar to T22363 hypottieU 

hypothetical protebi MGC10924 sinular to 

eukaryotic translation.initiatlon factor 

hepatocellular carcinoma-associated anti 

KIAA1600 protein 

fumaratel^dratase 

similar Id yeast Upf3. variant B 

homotog of mouse quaking QKI (KH domain 

sperm associated antigen 9 

hypothetical protein IVIGC10895 

hypothetical protein FU10330 

hypothetical protein FU12783 

ESTs. Weakly similar to T28770 hypotheti 

Homo sapiens cDNA: FU23020 fis, clone L 

SH3 domain binding glutamic acid-rich pr 

ubiquitination factor E4B (homologous to 

mesodenn development candidate 2 

DKF2P564M1 12 protein 

CGI-94 protein 

ADP-ribosylation factor-like 5 

polymerase (RNA) II (DNA directed) polyp 

NSI-assodated protein 1 

DKFZP434A043 protein 

high-mobility group 20A 

ESTs, Weakly similar to hyperpotarizatio 

Ras-GTPase-activating protein SH3-domain 

KIAA0251 protein 

Homo sapiens mRNA; cDNA DKFZpS86E1624 (f 
nuclear factor t/A 

Homo sapiens mRNA; cDNA DKFZp586L0120 (f 
Homo sapiens mRNA; cDNA DKFZp586J0720 (f 
semm-indudble kinase 
hypothetical protein FLJ22393 
CGM21 protein 

Homo safuens done H63 unknown mRNA 
EST 

ooproporphyrinogen oxidase (coproporphyr 

KIAA0014 gene produd 

ESTs 

EGF-TM7-latrophi[in-related protein 
KIAA1036 protein 
phosphoglyce/ate kinase 1 
platelet-activating factor acetylhydrola 
msh prosophila) homeo box homolog 2 
cAMP responsive element binding protein- 
ESTs 

Homo sapiens cDNA: FU21409 fis. done C 

ESTs 

ESTs 

ESTs 

KruppeMike factor 8 
integrin, alpha 8 

gastric intrinsic factor (vitamin B synt 

threonyl-tRNA synthetase 

Homo sapiens, done 1MAGE:4299322. mRNA. 

protocadherin 17 

Bafdet-Btedl syndrome 2 

inhibitor of growth fiamlly, member 3 

hypothetkal protein FU10814 

gb:yh26b09.r1 Soares placenta Nb2HP Homo 

ESTs, Weakly similar to p40 [H.saplens] 

hypothetical protein DKFZp761t141 

^othetteal protein AF140225 

progestin induced protein 

cysteine-rich motor neuron 1 

ESTs 

Homo sapiens mRNA; cDNA DKFZp564D016 (fr 
ESTs 

ESTs. Weakly similar to 138022 hypotheti 
ESTs 

KIAA0955 protein 

gb:2e97d11.s1 Soares_fetaLheafLNbHH19W 
hypothetical protein FU108d9 
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104804 

104865 

130828 

104907 

104943 

105013 

105024 

132592 

105038 

105077 

105096 

129215 

105169 

132796 

427210 

105200 

130114 

105330 

105337 

422040 

105376 

105397 

431679 

131991 

421305 

105489 

105508 

105539 

135172 

131569 

431129 

105643 

105659 

105666 

105674 

105709 

105722 

105765 

115951 

130884 

105962 

105985 

106008 

457322 

134222 

446954 

106141 

447973 

106157 

428314 

446727 

106196 

457714 

133200 

106302 

106328 

450534 

106423 

439608 

106477 

106503 

446999 

106543 

442007 

106589 

106593 

106596 

423064 

106636 

106654 

131353 

106707 

452909 

106717 

453141 

106747 



AA031357 

AA045136 

AA053400 

AA055829 

AA065217 

AA1 16054 

AA1 26311 

AA129390 

AA130273 

AA142919 

M150205 

AA176867 

AA1 80321 

AA1 80487 

AA187634 

AA195399 

AA234717 

AA234743 

AA234957 

AA235604 

AA236559 

AA242868 

AA251776 

AA251909 

AA252672 

AA256157 

AA256680 

AA258873 

AA262727 

AA281451 

AA281545 

AA282069 

AA283044 

AA283930 

AA284755 

AA291268 

AA291927 

AA343514 

AA398109 

AA398109 

AA405737 

AA406610 

AA411465 

AA416886 

AA424013 

AA424148 

AA424558 

AA424g61 

AA425367 

AA425921 

AA426220 

AA427735 

AA430673 

AA432248 

AA435896 

AA436705 

AA446561 

AA448238 

AA449756 

AA450303 

AA452411 

AA454566 

AA454667 

AA456437 

AA456646 

AA456826 

AA456981 

AA458959 

AA459950 

AA460449 

M463910 

AA464603 

AA464606 

AA465093 

AA465692 

M476473 



A1858702 Hs.31803 
T79340 Hs.22575 
AW631469 Hs.203213 
AA055829 Hs.196701 
AF072873 Hs.1 14218 
H63789 Hs.296288 
AA126311 Hs.9879 
AW803564 Hs.28885a 
AW503733 Hs.9414 
W55946 Hs.234863 
ALO42506 Hs.21599 
AB040930 Hs.1 26085 
BE245294 Hs.180789 
NM_006283Hs.173159 
BE396283 Hs.173987 
AA328102 Hs.24641 
AA233393 Hs.14992 
AW338625 Hs.22120 
AI468789 Hs.347187 
AA172106 Hs.1 10950 
AW994032 Hs.8768 
AA814807 Hs.7395 
AK000046 Hs.343877 
AF053306 Hs.36708 
BE397354 Hs.324830 
AA256157 Hs.24115 
AA173942 Hs.326416 
AB040884 Hs.109694 
AB028956 Hs.12144 
AL389951 Hs.271623 
AL137751 Hs.263671 
BE621719 Hs.173802 
AA283044 Hs.25625 
AA426234 Hs.34906 
A1609530 Hs.279789 
AI928962 Hs.26761 
AI922821 Hs.32433 
AA299688 Hs.24183 
BE546245 Hs.301048 
BE546245 Hs.301046 
AW880358 Hs.339808 
AA405610 
AB033888 Hs.8619 
AI815486 Hs.243901 
AW855861 Hs.8025 
A6037850 Hs.16621 
AF031463 Hs.9302 
AB011169 H5.20141 
W37943 Hs.34892 
AW135049 Hs.26285 
AB011095 Hs.i6032 
AA525993 Hs.173699 
AA083764 

AB037715 Hs.183639 
AA398859 Hs.18397 
AL079559 Hs.28020 
A1570189 Hs^5132 
AB020722 Hs.16714 
AW864696 Hs.301732 
R23324 Hs.41693 
AB033042 Hs.29679 
AA1 51520 

AA676939 Hs.69285 
AA301116 Hs.142838 
AK000933 Hs.28661 
AW296451 Hs.24605 
AA452379 
AF265208 Hs.8740 
AW958037 Hs.286 
AW075485 Hs.286049 
AW754182 
AK000566 Hs.98135 
NM.015368HS.30985 
AA600357 Hs.239489 
AB014548 Hs.31921 
NM_007118Hs.171957 



ESTs, Weakly simflar to N-WASP IH^sapien 
B-cell CLL/lymphoma 6. member B (zinc fi 
ESTs 

ESTs, Weakly similar to ALUI.HUMAN ALU S 

frizzled (Drosophila) homolog 6 

ESTs, Weakly similar to KIAA0638 protein 

ESTs 

Homo sapiens cDNA: FLJ22526 fts, done H 
KIAA1488 protein 

Homo sapiens cDNA FU1 2082 fts, done HE 
Knjppel-ltke factor 7 (ubiquitous) 
KIAA1497 protein 
S164 protein 

transfomiing, addic coiled-coil contain 
eukaryotic translation initiation fador 
cytoskeleton associated protein 2 
hypotl^etical protein FLJ1 1 151 
ESTs 

myotubuiarin related protein 1 
Rag C protein 

hypothetical protein FU10849 
hypothetical protein FU23182 
hypothetical protein FLJ20039 
budding uninhibited by benztmldazotes 1 
diptheria toxin resistance protein requl 
Homo sapiens cDUA FLJ14178 fis. done NT 
Homo sapiens mRNA; cDNA DKFZp564H1916 (f 
KIAA1451 protein 
KIAA1033 protein 
nudeoporin 50kD 

Homo sapiens mRNA; cDNA OKFZp43410812 (f 

K1AA0603 gene produd 

hypothetical protein FU11323 

ESTs, Weakly simflar to T17210 hypotheti 

histone deacetylase 3 

DKFZP586L0724 protein 

ESTs 

ESTs 

sec13-like protein 

sec13-iike protein 

hypothetical protein FU10120 

gb:zv15b10.s1 Soares_NhHMPu_S1 Homo sapi 

SRY (sex detennining region Y)-box 18 

Homo sapiens cDNA FIJ20738 fis, done HE 

Homo sapiens done 23767 and 23782 mRNA 

DKFZP434I1 16 protein 

phosdudn-like 

similar to S. oerevisiae SSM4 
KIAA1323 protein 

Homo sapiens cDNA FLJ10643 fis, done NT 
K1AA0523 protein 

ESTs. Weakly similar to ALU1_HUMAN ALU S 

hypothetical protein MGC3178 

hypothetical protein FLJ10210 

hypothetical protein FLJ23221 

KIAA0766 gene produd 

KIAA0470 gene produd 

Rho guanine exchange fador (GEF) 15 

hypothetical protein MGC5306 

DnaJ (Hsp40) homolog, subfamily B. membe 

cofador required for Spl transcriptiona 

hypothetical protein MGC4485 

neuropilin 1 

nudeolar phosphoproteln Nopp34 

Homo sapiens cDNA FLJ10071 fis, clone HE 

ESTs 

ESTs. Moderately similar to ALU7_HUMAN A 
SWl/SNF related, matrix associated, acti 
ribosomal protein L4 
phosphoserine aminotransferase 
gb:RC2-CT0321-131199^11-c01 CT0321 Homo 
hypothetical protein FLJ20559 
pannexin 1 

TIA1 pytotoxIcgFanuIe-assodated RNA-bi 
K1AA0648 protein 

trfpte functional domain (PTPRF interad 
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106773 


AA478109 


AA478109 


Hs.188833 


106781 


AA478474 


AA330310 


Hs^4181 


106817 


AA480889 


D61216 


Hs.18672 


106848 


AA485223 


AB037744 


Hs.34892 


106846 


AA485254 


AA44g014 


H5.121025 


108856 AA486183 


W58353 


Hs.285123 


416699 


AA496936 


BE539639 


Hs.173030 


107001 


AA59858g 


AI926520 


Hs.31016 


442853 


AA598831 


AW021276 Hs.17121 


107054 


AA600150 


A1076459 


Hs.15978 


107059 


AA608545 


BE614410 


Hs.23044 


107080 AA609210 


AL122043 


Hs.19221 


107115 


AA610108 


BE379623 


Hs.27693 


107130 AA620582 


AB033106 


Hs.12913 


lUf lot) 




AA137043 


Hs.9663 


iU/l/4 




BE122762 


Hs.25338 


130621 


AA621718 


AW513087 Hs.16803 


107190 


D19673 


AA836401 


Hs.87860 


132626 


D25755 


AW504732 Hs.21275 


107217 


D51095 


AL080235 


Hs.35861 


332564 


D60272 


AA357879 


Hs.29423 


444655 


T08879 


AF088886 


Hs.11590 


107295 


T34527 


AA188629 


Hs.80120 


107299 


T40327 


BE277457 


Hs.30661 


107315 


T62771 


AA316241 


Hs.90691 


107316 


T63174 


T63174 


Hs.193700 


107328 


T83444 


AW959891 HSJ6591 


107334 


T93641 


T93597 


Hs.1 87429 


456340 


U48263 


U48263 


Hs.89040 


126636 


U49065 


U49065 


Hs.102865 


129938 


U79300 


AW003668 Hs.135587 


107375 


U88573 


BE011845 


Hs^51064 


130074 


U93867 


AL038596 


Hs.250745 


107387 


W01094 


086983 


Hs.1 18893 


132036 


W01568 


AL157433 


Hs.37706 


107426 


W26853 


W26853 


Hs.291003 


135388 


W27985 


W27965 


Hs.99865 


130419 W36260 


AF037448 


Hs.155489 


107469 W47063 


W47083 


Hs.94668 


434203 W79060 


BE262677 


Hs.263558 


107508 


W88550 


AB028981 


Hs.8021 


132358 


X60488 


NM_003M2Hs.46423 


107522 


X78931 


X7B931 


Hs.99971 


456495 


Z14077 


NM_003403Hs.97496 


107582 


AA002147 


AA002147 


Hs.59952 


107609 AA004711 


R756M 


Hs.164797 


107661 


AA010383 


AA010383 


Hs.60389 


107714 


AA015761 


AA015761 


Hs.60642 


107775 


AA018772 


AW008846 


Hs.60857 


107832 


AA021473 


AA021473 




107859 


AA024835 


AW732573 


Hs.47584 


107914 


AA027229 


AA027229 


Hs.61329 


107935 AA029428 


AA029428 


Hs.61555 


410196 


AA035143 


A1936442 


Hs.59838 


131461 


AA035237 


AA992841 


Hs.27263 


108007 


AA039347 


AA039347 


Hs.61916 


108029 


AA040740 


AA040740 


Hs.62007 


108040 


AA041551 


AL1 21031 


Hs.159971 


108084 


AA045513 


AA058944 


Hs.1 16602 


108088 


AA045745 


AA045745 


HS.628B6 


108168 


AA055348 


AI453137 


Hs.63176 


130719 


AA056582 


AA679262 


Hs.14235 


108189 


AA056697 


AW376061 


Hs.83335 


108190 AA056746 


AA05674S 


Hs.63338 


108203 


AA057678 


AW847814 


Hs.289005 


108216 AA058681 


AA524743 


Hs.44883 


108217 AAQ58686 


AA058686 


Hs.62588 


108245 AA062840 


BE410285 


Hs.89545 


108277 AA064859 


AA064859 




108280 AA065069 


AA065069 




108309 AA069923 


AA069818 




108340 M070815 


AA069820 


Hs.180909 


108403 


AA075374 


AA075374 




108427 


AA076382 


AA076382 




108435 


AA078787 


T82427 


Hs.194101 


108439 AA078986 


AA078986 





ESTs 
ESTs 
ESTs 

K1AA1323 protein 

chromosome 11 open reading frame 5 
Homo sapiens mRNAfull length insert cDN 
ESTs. Weakly sMar to ALU8_HUMAN ALU S 
putative DNA tiinding protein 
ESTs 

KtAA1272 protein 

RAD51 (S. oerevislae) homolog (E coli Re 
hypothetical protein DKFZp58661424 
peptidylproiyi Isomerase (cydophBInH 
KIAA1280 protein 

programmed cell death 6-intefBcting prot 
ESTs 

LUC7 (S^cerevisiaeHike 
ESTs 

hypothetical protein FLJ1101 1 
DKFZP586E1621 protein 
ESTs: Weakly similar to macrophage lecti 
cathepsin F 

UDP-N-acetyl-aIpha-0-galactosamine:polyp 

hypothetical protein MGC4606 

nucteophosmin/nudeoplasmin 3 

Homo sapiens mRNA; cDNA DKFZp586l0324 (f 

KIAA0887 protein 

ESTs 

prepronociceptin 
interleukin 1 receptor-tike 2 
Human done 23629 mRNA sequence 
high-mobility group (nonhistone chromoso 
polymerase (RNA) III (DNA directed) (62k 
Melanoma associated gene 
hypothetical protein DKFZp434E2220 
hypothetical protein MGC4707 
epimorphin 

NSI-assodated protein 1 
ESTs 

hypothetical protein PR01855 

KIAA1058 protein 

H4 histone family, member G 

zinc finger protein 272 

YY1 transcription Victor 

EST 

hypothetical protein FU13693 

ESTs 

ESTs 

ESTs 

gbi£66c11.s1 Scares retina N2b4HR Homo 
potassium voltage-gated channel, delayed 
ESTs. Weakly slmilarto T16370 hypothetl 
ESTs 

hypothetical protein F1J10808 

KIAA1458 protein 

EST 

ESTs 

SWI/SNF related, matrix assodated. acti 
Homo sapiens, done IMA6E:41 54008, mRNA. 
ESTs 
ESTs 

hypothetical protein FU20008; iaAA1839 
ESTs, IVtoderateiy similar to A46010 X-Dn 
EST 

Homo sapiens cDNA: FU21532 fis. done C 

ESTs 

ESTs 

proteasome (prosome. macropain) subunit. 
gb2m50fD3^1 Stratagene fibroblast (937 
gb2m12e1ts1 Stratagene pancreas (93720 
gb:zm67e03.r1 Stratagene neuroepithelium 
peroxiredoxin 1 

gb:zm87a0ls1 Stratagene ovarian cancer 
gb:zm91g08.s1 Stratagene ovarian cancer 
Homo sapiens cDNA: FLJ20869 fls, done A 
gb2m92h01.s1 Stratagene ovarian cancer 
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'lOQylcc 


AA079393 


A AA^AnAO 

M07y393 


Hs.3462 


iuo4Dy 


A A A70v(Q7 


A A A7 A A 

M0794o7 




lUoDUU 


A AnQQOAT 

AAUoo/U/ 


M083207 


Hs.68270 


lUoOui 


A AAOOOCC 


M083256 




lUOOoO 




M084415 








M100795 






AAUOOD/0 


AI732404 


Hs.68846 


1 JUOifU 




AI907537 


Hs.76698 




AA101255 


D14041 


Hs.347340 


13Uoo5 


AA126474 


AW067800 Hs.155223 


10O74S 


* A 4 AVAR'S 

AA127017 


M127017 


Hs.71052 


10oou7 


AA129988 


AI652236 


Hs.49376 




AA130240 


M045088 


Hs.62738 


1 08833 


AA131866 


AF188527 


Hs.61661 


108646 


AA1 32983 


AL1 17452 


Hs.44155 


108057 


A A4f>'90Cn 

AA1o32S0 


AK001468 


Hs.62180 


1o1474 




L46353 


Hs.2726 


4 na DOii 




AK001431 


Hs.5105 




AA14OD00 


M148650 




'inQOco 


A AiC'lI'lA 


AI304870 


Hs.188680 




A A4CC701 

AA155/54 


AW995610 Hs.332436 


luyuui 


A A 4CC40C 


AI056548 


HSJ2116 


131183 


A A 4 CCOOA 

AA15o2o9 


AI611807 


Hs.285107 


109019 


A A 4 f ^ Aft^ 

Ml 56997 


Ml 56755 


Hs.72150 


109022 


AA1 57291 


Ml 57291 


HSJ21479 


109023 


Ml 57293 


Ml 57293 


Hs.72168 


109068 


Ml 64293 


Ml 64293 


Hs.72545 


109072 


Ml 64 676 


AI732585 


Hs.22394 


426981 


M167375 


AL044675 


Hs.173081 


130346 


M1 67550 


H05769 


Hs.188757 


109146 


Ml 76589 


M176589 


Hs.142078 


109172 


Ml 80448 


M180448 


Hs.144300 


428438 


M187144 


NM_001955Hs.2271 


129208 


M189170 


AI587376 


Hs.109441 


109222 


AAA r**T 

M1 92757 


M192833 


Hs.333512 


109300 


AA205650 


M418276 


Hs.170142 


109481 


AA233342 


M878923 


Hs.289069 


103485 


AA233472 


BE619092 


Hs.28465 


109516 


AA234110 


At471639 


Hs.71913 


109537 


D80981 


AI858695 


Hs.34898 


109556 


F01660 


AI9252S4 


.Hs.87385 


109577 


F02208 


F02208 


Hs.296639 


109578 


F02208 


F02208 


Hs.27214 


109595 


F02544 


M078629 


Hs.27301 


109625 


F03918 


H29490 


Hs.22697 


428376 


F04258 


AF119665 


Hs.184011 


109648 


CAjiCAA 

F04600 


H17800 


Hs.7154 


109671 


CAOAAO 

F0o99o 


R59210 


Hs.26634 


4nncnn 
109699 


r09o05 


H18013 


Hs.167483 


4 And OA 

109820 


F11115 


AW016809 


Hs.1 19021 


.109933 


H06371 


R52417 


Hs.20945 


110014 


H10995 


AL109666 


Hs.7242 


110039 


H1 1938 


HI 1938 


Hs.21907 


110099 


HI 6568 


R44557 


Hs.23748 


110107 


HI 6772 


AW151660 


Hs.31444 


110155 


HI 8951 


AI559626 


Hs.93522 


110197 


H20859 


AW090386 


Hs.1 12278 


110223 


H23747 


H19836 


Hs.31697 


110306 


H38087 


H38087 


Hs.105509 


110335 


1 1 A/\00 «t 

H40331 


H65490 


Hs.18845 


110342 


H40567 


H40961 


Hs.33008 


110395 


H46966 


M025116 


Hs.33333 


.110511 


H56640 


H56640 


Hs.221460 


110523 


IJC74CJI 


AI040384 


Hs.19102 


110715 


Line 741 
nSD712 


H96712 


Hs.269029 


110754 


N20814 


AW302200 


Hs.6338 


428454 


N25249 


U55936 


Hs.184376 


431663 


Ai#^v4nn 

N27100 


NM.01656gHs.267182 




N09bl0 


AW973443 Hs.8086 


110938 


N48982 


N46982 


Hs.38034 


110983 


N51957 


N^L015367Hs.10267 


111081 


N59435 


A1146349 


Hs.271614 


111128 


N64139 


AW505364 Hs.19074 


431548 


N66981 


AI834273 


Hs.9711 


111216 


N68640 


AW139408 Hs.152940 


437562 


N69352 


AB001636 


Hs.5683 



cytochrome c oxidase subunit Vile 
gb:zm97ID8.s1 Stratagene colon HT29 (937 
EST 

gb:zn08g12.s1 Stratagene iiNT neuron (937 
gbu^06g09.s1 Stratagene hNT neuron (937 
gbu:m26c08^1 Stratagene pancreas (93720 
ESTs 

stress-associated endoplasmic reticulum 
H.2K binding factor-2 
stanniocalcln 2 
ESTs 

hypothetical protein FU20644 
ESTs 

ESTs. Weakly similar to AF174605 1 F-box 

DKFZP586G1517 protein 

anillin (Diosophila Scraps homolog), act 

high-mobility group (nonhistone chromoso 

hypothetical protein FU10569 

gb:zo09e06^1 Stratagene neuroeplthelium 

ESTs 

EST 

hypothetical protein FU20992 similar to 
hypothetical protein FU13397 
ESTs 

ubinuclein 1 

ESTs 

ESTs 

hypothetical protein FU10893 
KIM0530 protein 

Homo sapiens, done MGC:5564» mRNA. comp 

EST 

EST 

endothelin 1 
MSTP033 protein 
similar to rat myomegalin 
ESTs 

hypothetical protein FU21016 

Homo sapiens cDNA: FLJ21869 fis. done H 

ESTs 

ESTs 

ESTs 

Homo sapiens potassium channel subunit ( 

ESTs 

ESTs 

ESTs 

pyrophosphatase (Inorganic) 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens done 24993 mRNA sequence 

Homo sapiens mRNA full length Insert cDN 

hfstone acetyltransferase 

ESTs 

ESTs 

Homo sapiens mRNA for KIM1647 protein, 

annestin, beta 1 

ESTs 

CTL2gene 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs. Weakly similar to organic anion tr 
ESTs 

KIM0672gene product 
synaptosomal-assodated protein. 23kD 
TBX34S0 protein 

RNA (guanine-7-) methyttransferase 
Homo sapiens cDNA FU12924 fis. done NT 
MIL1 protein 
C6M 12 protein 

LATS (large tumor suppressor. Drosophila 

novel protein 

ESTs 

DEAD/H (Asp-GkMIa-Asp/Hls) box polypep 
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111399 R00138 


AW270776 Hs.18857 


ESTs 


111514 R07998 


R07998 




gb:yf169l1.s1 Scares fetal liver spleen 


428744 R08929 


BE267033 


Hs.192853 


ublquitin-conjugating enzyme E2G 2 (homo 


111574 R10307 


A(024145 


Hs.188526 


ESTs 




AA482478 


Hs.181785 


ESTs 


111831 R3fi083 


R36095 


H5.268695 


ESTs 


426773 R3793& 


NIWL015556HS.172180 


KIAA0440 protein 


iii9n4 R^Qisn 


Z41572 




ab*HSCZYB122 noimalized Infant biatn cDN 


428371 R40816 


AB012193 


Hs.183874 


cunin 4A 


112033 R43162 


R49031 


Hs.22627 


ESTs 


130987 R45698 


BE613289 


Hs.21893 


hypothetical protefn DKFZp761N0624 


112300 R54SS4 


H24334 


H5.26125 


ESTs 


112513 R68425 


R68425 


Hs.13809 


hypothetical protein FU10648 


112514 R68S68 


R68568 


Hs.183373 


src homology 3 domaln-contalnlng protein 


112522 R68763 


R68857 


Hs.265499 


ESTs 


112540 R70467 


R69751 




gb:yi40a10.s1 Scares placenta Nb2HP Homo 


428655 R73565 


H05769 


Hs.1 88757 


Homo sapiens, done MGC:5564, mRNA, comp 


129534 R73R40 


AKa02126 


Hs.11260 


hypothetical protein FU11264 




R78376 


Hs.29733 


EST 


119719 RQ24*53 


R92453 


Hs.34590 


ESTs 


45179fl T03fi65 


BE297587 


Hs.27047 


hvDOthetical orotein FLJ20392 


112888 T03fl72 


AW195317 Hs,107716 


hypothetical protein FU22344 


131863 T10072 


AI656378 


Hs.33461 


ESTs 


112911 T10080 


AW732747 Hs.13493 


like mouse brain protein E46 


132215 T10132 


AL035703 


Hs.4236 


K1AA0478 gene product 


112931 T15343 


702966 


Hs.1 67428 


ESTs 


112984 T23457 


718971 


Hs.289014 


ESTs, Weakly similar to A43932 mucin 2 p 


112QQ8 T23555 


H11257 


Hs.22968 


Homo saolens done IMAGE'451 939 mRNAse 


13337fi 72*^670 


BE618768 


Hs.7232 


ac6tvl-CoBn2vma A carboxylase atoha 


inn9R T91QAA 


AA376654 




eukarvotic translation initiation factor 


inn7ft T'vwR^ 

1 fOU/U I «X>*4D*4 


AB032977 


Hs.6298 


K)AA1151 Dinteln 


H lUf O 1 1 OfM lO 


A1375672 


Hs.165028 


ESTs 


nou/*» io**on 


AK001335 


Hs.31137 






AA828380 


Hs.126733 


CO 1 9 


111-170 T'i'\1R9 


BE622021 


Hs.152571 


FS7s Hkihhr similar to IGP.Il mRNA-bInd 


11000/ IffHiJO 


777453 


H5.302234 




1 101^ 1 1 0**U057 


AI769400 


Hs.189729 


ESTs 


1 10*19*? 1 Ou*i«/0 


A1022166 


Hs.16188 


ESTs 


1114A1 TR7fiQ3 

1 IO*tO 1 1 Of 09i9 


T87693 


Hs.204327 


EST 




AA302862 


Hs.90063 


• neurocakjn delta 


113557 790945 


H66470 


Hs.16004 


ESTs 


113S59 790987 


T79763 


Hs.14514 


ESTs 


113589 791863 


AI078554 


Hs.15682 


ESTs 


113591 791881 


T91881 


Hs.200597 


KIAA0563 aen3 oroduct 


11361Q 793783 


R08665 


Hs.17244 


hvDothetical orotein FU 13605 


113683 796687 


AB035335 


Hs.144519 


T«cell leukemia/lvmohoma 6 


1136.02 7Q6Q44 


AL360t43 


Hs.17936 


DKFZP434H132 orotein 


113709 7Q7307 

1 10/ 1 9f Ovf 


797307 




nh*vp53hi0't si Soares feftal liver snleen 


113717 7Q7764 

1 lO/ • / 1 3» / OH 


T99513 


Hs.187447 


ESTs 


11 3824 W48817 


A1631964 


Hs.34447 


ESTs 


113AdO W58343 


R72137 


Hs.7949 


DKFZP586B2420 orotein 




AI369275 


HS.243D10 


Hnmn <;anlpns cDtsIA FLJ14445 lis done HE 


113002 W74644 


AA340111 


HS.10Q009 


acvt-Coenzvme A oxidase 1 oalmitovl 


m904 W74761 


AF125044 


Hs.19196 


ubiquitin*<x>njugatbig en^fme HBUCE1 


113905 W74802 


R81733 


Hs.33106 


ESTs 


m031 W81205 


BE255499 


Hs.3496 


hypothetical protein MGC15749 


113Q32 W81237 


AA256444 


Hs.126485 


hypothetical protein FLJ12604; KIAA1692 


131065 W90146 


W79283 


Hs.35962 


ESTs 


11^035 W92708 


W92798 


Hs.269181 


ESTs 


114106 Z38412 


AW602528 




ob-RC5-BT0562-2601004)11-A02 BT0562 Homo 


4R7308 738700 


At416988 


Hs.238272 


Inn^itnl 1 4 5-trinhosnhate rec^ntor tv 


liAIRI 738Qnd 


BE548222 


Hs.299883 


hivnnfhtpttnal nrotein FLJ23399 


ilOAQilO 730103 


AF052212 


Hs.153934 


mro>tiinHin(i factor nintdonnin aintia 

^^llT^UII IvIUI^ ICIV«WI| ■ Ul 11 1 Kill 1^ cuuiia 




AWD69534 H&279583 


C6I^1 orotein 


198037 Z39939 


AA251380 


Hs.10726 


ESTs, Weakly sirhllarto ALU1 J1UMAN ALU S 


4395S4 Z40012 


AI479813 


Hs^8411 


NQ<-assodated orotein 1 


114277 Z40377 


AI052229 


Hs.25373 


ESTs» Wealdy similar to T20410 hypothetl 


114304 Z40820 


Aig34204 


Hs.16129 


ESTs 


114364 Z41680 


AL117427 


Hs.172778 


Homo sapiens mRNA; cDNA DKFZp566P013 (fr 


432620 AA005112 


AA777749 


Hs.5976 


UM domain only 7 


129034 AA005432 


AA481157 


Hs.108110 


DKFZP547E2110 protein 


131881 AA010163 


AW361018 Hs.3383 


upstream regulatory element binding prot 


332421 AA026356 


AI909968 


Hs.108106 


transcnption factor 


114465 AA026g01 


BE621056 


Hs.131731 


hypothetical protein FU11099 


451271 M036867 


AK001644 


Hs^156 


hypothetical protein FU10782 


332498 AA044644 


AA303661 




lymphocyte-specific protein 1 
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431555 AA046426 AI815470 Hs.260024 

132944 AA054515 T96641 Hs.6127 

114618 AA084162 AW979261 Hs.291993 

332509 AA085749 AA128376 Hs.153884 

5 114648 AA101055 AA101056 

114658 AA102746 AA102383 Hs.249190 

132456 AA114250 AB011084 Hs.48924 

450847 AA126561 NM^003155Hs.25590 

132225 AA128980 AA128980 

10 437197 AA129757 W38586 

114709 AA129921 AA397651 Hs.301959 

456926 AA133331 AB018284 Hs.158688 

114750 AA135958 AA887211 Hs.129467 

42680B AA136524 T19228 Hs.172572 

15 114763 AA147044 AA810755 Hs.102500 

114767 AA148885 AI859665 Hs.154443 

114774 AA150043 AV656017 Hs.164325 

129388 AA151621 AA662477 Hs.110964 

457742 AA155743 BE561824 Hs.273369 

20 456200 AA156335 AA768242 Hs.80618 

130207 AA156336 AF044209 Hs.144904 

114798 AA159181 AA159181 Hs.54900 

114800 AA159825 219448 Hs.131887 

114828 AA234185 AA252937 Hs.283522 

25 114846 AA234929 BE018682 Hs.166195 

114848 AA234935 BE614347 Hs.169615 

114902 AA236359 AW275480 Hs.39504 

132271 AA236466 AB030034 Hs.1 15175 

114907 AA236535 N29390 Hs.13804 

30 420170 AA236935 U43374 Hs.95631 

132204 AA236942 AA235827 Hs.42265 

114928 AA237018 AA237018 Hs.94869 

132481 AA237025 W93378 Hs.49614 

114932 AA242751 AA971436 Hs.16218 

35 314162 AA242760 BE041820 Hs.38516 

131006 AA242763 AF064104 Hs.22116 

114935 AA242809 H23329 Hs.290880 

408908 AA243133 BE296227 Hs.250822 

437754 AA243495 R60366 Hs.5822 

40 114957 AA243706 AW170425 Hs.87680 

114974 AA250848 AWg66931 Hs.302649 

114977 AA250868 AW296978 Hs.87787 

114995 AA251152 AA769266 Hs.193657 

115005 AA251544 AI760825 Hs.153042 

45 417177 AA251792 NM.0044S8HS.81452 

115026 AA252144 AA251972 Hs.188718 

115045 AA252524 AW014549 Hs.58373 

115068 AA253461 AW512260 Hs.87767 

133138 AA255522 AV657594 Hs,181161 

50 332668 AA255522 AV657594 Hs.181161 

115114 AA256468 AA527548 Hs.7527 

129584 AA256528 AV656017 Hs.184325 

115137 AA257976 AW968304 Hs.56156 

417187 AA258296 AB011151 Hs.334659 

55 115166 AA258409 AF095727 Hs.287832 

115167 AA258421 AA749209 Hs.43728 

436719 AA262077 Y11192 Hs.5299 

115239 AA278650 BE251328 Hs.73291 

115243 AA278766 AA805600 Hs.116665 

60 428419 AA280791 U49436 

115322 AA28G819 L08895 Hs.78995 

413303 AA28a828 AW836130 Hs.75277 

115372 AA282195 AW014385 Hs.88678 

409962 AA283127 U82671 Hs.57698 

65 130269 AA284694 F05422 Hs.168352 

456570 AA291137 AA286914 Hs.183299 

332675 AA291708 BE439944 

407864 AA293495 AF069291 Hs.40539 

115536 AA347193 AK001468 Hs.62180 

70 408799 AA398474 AA059412 Hs.47986 

115575 AA398512 AA393254 Hs.43619 

115601 AA400277 AA14B984 Hs.48849 

434428 AA400896 D14540 Hs.199160 

115683 AA410345 AF255910 Hs.54650 

75 115715 AA416733 BE395161 Hs.1390 

132952 AA425154 AI658580 Hs.61426 



Cdc42 effector protein 3 

Homo sapiens cDNA: FU23020 fts; done L 

ESTs 

ATP binding protein assodated with cell 
gb:zn25b03.s1 Stratagene neuroepithelium 
tumor necrosis factor receptor superfami 
KtAA0512 gene product; ALEX2 
stanniocalcin 1 

gb:zoOga11.s1 Stratagene neuroepithelium 
guanine nucleotide binding protein (G pr 
proline synthetase co-transcribed (bacte 
KIAA0741 gene product 
ESTs 

hypothetical protein FLJ20093 
hypothetical protein dJ511E16^ 
minichromosome niaintenance deficient (S. 
CG)-76 protein 

hypothetical protein FU23471 

uncharacterized hematopoietic stem/proge 

hypothetical protein 

nuclear receptor co-repressor 1 

serologically defined colon cancer antig 

ESTs. Weakly similar to T24396 hypotheli 

Homo sapiens mRNA; cONA DKFZp434J1912 (f 

ATPase, Class I, type 8B. member 1 

hypothetical protein FU20989 

hypothetical protein M6C4308 

sterile-alpha motif and leucine zipper c 

hypothetical protein dJ462023^ 

Human nonnal Iceratinocyte mRNA 

ESTs 

ESTs 

ESTs 

KIAA0903 protein 

Homo sapiens, done MGC:15887, mRNA. com 

CDC14 (cell division cyde 14, S. cerevi 

ESTs. Weakly similar to ALU1_HUMAN ALU S 

serine/threonine kinase 15 

Homo sapiens cDNA: F1J22120 fis, done H 

ESTs 

nucleosome assembly protein 1-Dke 1 

ESTs 

ESTs 

ESTs 

fatty-add-Coenzyme A ligase, long-chain 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FU14643 Tts, clone NT 
ESTs 

small fragment nuclease 
CGI-76 protein 
ESTs 

hypothetical protein MGC14139 
myelin protein zero-like 1 
hypothetical protein 

aldehyde dehydrogenase 5 family, member 
hypothetical protein FLJ10881 
KIAA1842 protein 
KIAA1856 protein 

MADS box transcnption enhancer factor 2 
hypothetical protein FU 13910 
ESTs, Weakly similar to Unknown [H.sapie 
Target CAT 

nudeoporin-like protein 1 

ESTs 

ESTs 

chromosome 8 open reading frame 1 
anillin (Drosophila Scraps homolog). act 
hypothetical protein MGC10940 
ESTs 

ESTs. Weakly similar to ALU4 J^UMAN ALU S 
myetoid/lymphoid or mbced-ilneage ieukem 
jundtonal adhesion molecule 2 
pioteasome (proscme, macropain) subunit 
Homo sapiens mesenchymal stem cell prote 
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115819 


AA426573 


AA4B6620 


Hs.41135 


409124 


AA431418 


AW292809 Hs.50727 


115695 


AA436182 






458073 


AA437099 


t\r\ l9ilOQ9 




115982 


AA446585 


AIDuOoO 1 




115967 AA446B87 






115974 AA447224 


DCS lOH*tA 




115985 AA447709 


AA447709 


Hs.268115 


129254 


AM53624 


AA252468 


Hs.1098 


448730 AA455044 


BE384932 


Hs.64313 


116095 AA456045 


AA043429 


Hs.62618 


426856 AA460454 


R19768 


Hs.172788 


116210 AA476494 


BE622792 


Hs.172788 


116213 AA476738 


AA292105 


Hs.326740 


432645 AA481422 


D14041 


Hs.347340 


116265 AA482595 


BE297412 


Hs.55189 


129334 


AA485084 


AW157022 Hs.343551 


116274 


AA485431 


AI129767 


Hs.182874 


426002 


AA489638 


BE514376 


Hs.165998 


116331 


AA491000 


N41300 


Hs.71616 


116333 


AA491250 


AF155827 


Hs.203963 


132994 


AA505133 


AA1 12748 


Hs.279905 


418538 


AA598447 


BE244323 


Hs.85951 


116391 


AA599243 


T86558 


Hs.75113 


116394 


AA599574 


NM_006033Hs,65370 


134531 


AA600153 


AI742845 


Hs.1 10713 


116417 


AA609309 


AW499664 




116429 


AA609710 


AF191018 


Hs.279923 


116439 


AA610068 


AA251594 


Hs.43913 


116459 


AA621399 


R80137 


Hs.302738 


427505 


AA621752 


AA361562 


Hs.178761 


409633 


C21523 


AW449822 Hs.55200 


116541 


D12160 


D12160 


Hs^49212 


132557 


D19708 


AA1 14926 


Hs.169531 


414954 


D25801 


AA337548 


Hs.333402 


116571 


D45652 


D45652 


Hs.211604 


451522 


D60208 


BE565817 


Hs^98 


421919 


D80504 


AJ224901 


Hs.109526 


116643 


F03010 


AI367044 


Hs.153638 


116661 


F04247 


R61504 




116715 


F10966 


AL1 17440 


Hs.170263 


116729 F13700 


BE549407 


Hs.115823 


316709 


H05063 


R52576 


Hs.285280 


418999 


H16758 


NM_000121Hs.89548 


116773 


H17315 


AI823410 


Hs.343581 


116780 


H22566 


H22566 


Hs.63931 


453884 


H48459 


AA355925 


Hs.36232 


116819 


H53073 


H53073 


Hs.93698 


427278 


H58559 


AL031428 


Hs.174174 


407833 


H57957 


AW955632 Hs.66666 


116844 


H64938 


H64938 


Hs.337434 


116845 


H54973 


AA649530 


Hs.348148 


116892 


H69535 


A1573283 


Hs.38458 


116925 


H73110 


H73110 


Hs.260603 


1*16981 


H81783 


N29218 


Hs.40290 


453133 


H86259 


AC005757 


Hs.31809 


117031 


H88353 


H88353 


Hs.347265 


117034 


H88639 


U72209 




431129 


H88675 


AL137751 


Hs.263671 


417861 


H93708 


AA334551 




117280 


N22107 


M18217 


Hs.172129 


117344 


N24046 


R19085 


Hs.210708 


117422 


N27028 


AI355562 


Hs.43880 


117475 


N30205 


N30205 


Hs.93740 


117487 


N30621 


N30621 


H5.44203 


117937 


N33258 


AF044209 


Hs.144g04 


130207 


N33258 


AFa44209 


Hs.144904 


117549 


N333g0 


N33390 


Hs.44483 


117683 


N40180 


N40180 




117710 


N45ig8 


N45198 


H5.47248 


117791 


N48325 


N48325 


Hs.93956 


117822 N48913 


AA706282 


Hs.93963 


422544 


N49394 


AB018259 


Hs.1 18140 


117895 N50656 


AW450348 Hs.93996 


452259 N50721 


AA317439 


Hs.28707 


133057 


N53143 


AA465131 


Hs.64001 



endomucin-2 

N-acetyigtucosamlnidase, alpha- (Sanfill 

KIAA1209 protein 

ESTs 

hypothetical protein MGC10702 
ESTs 

hypothetical protein FU10631 

ESTs* Wealdy similar to T0B599 prot)at)le 

DKFZp434J1813 protein 

ESTs. Weaidy similar to AF257182 1 G-pro 

ESTs 

A1^3 protein 
AI£X3 protein 

hypothetical protein MGC10947 

H-2K binding factor-2 

hypothetical protein 

hypothetical protein FU22584 

guanine nucleotide binding protein (G pr 

PAI-1 mRNA-binding protein 

Homo sapiens mRNA; cDNA DKFZp586N1720 (f 

hypothetical protein FU10339 

ctoneHQ0310PRO0310p1 

exportin. tRNA (nuclear export receptor 

general transcription factor IIIA 

lipase, endothelial 

DEK oncogene (DNA binding) 

Human clone 23826 mRNA sequence 

putative nucleotide binding protein, est 

PIBF1 gene product 

Homo sapiens cDNA: FU21425 fis, clone C 
263 proleasome-assodated padi homotog 
ESTs 

polymerase (RNA) III (DNA directed) (155 
ESTs 

hypothetical protein MGC12760 
gb:HUMGS02848 Human adult lung 3^ direct 
hypothetical protein FU21657 
zinc finger protein 198 
myeloid/lymphoid or mixed-lineage leukem 
gb7h16a03^1 Soares infant brain 1NIB H 
tumor protein p534)indlng protein. 1 
rtbonuclease P. 40kD subunit 
Homo sapiens cDNA: FU220g8 fis. done H 
erythropoieOn receptor 
karyopherin alpha 1 (importin alpha 5) 
ESTs 

KIAA0186 gene product 
EST 

KIAA0601 protein 

ESTs, Weakly similar to S19560 proline-r 
ESTs. Weakly similar to A46010 X-(inked 
gb:ns44fD5^1 NCLCGAPJ^I Homo sapiens 
ESTs 

ESTs, Modefatety similar to A47562 B-cel 
ESTs 

hypothetical protein 

gb:yw21a02.s1 Morton Fetal Cochlea Homo 

YY1 -associated factor 2 

Homo sapiens mRNA; cDNA DKFZp434l0812 (f 

spenn specific antigen 2 

Homo sapiens cDNA: FLJ21409 fis. done C 

Homo sapiens cDNA FU13182 fis, done NT 

ESTs. Weakly slmBar to A46010 X-finked 

ESTs. Weakly similar to 138022 hypotheti 

ESTs 

nudear receptor co-repressor 1 
nudear receptor oo-repressor 1 
EST 

gb:yy44d02.s1 Soaies.mulb*ple_sderosls. 
ESTs. High^ similar to similar to Cdc14 
EST 

ESTs 

KIAA0716 gene product 
ESTs. Highly simnarto SORL.HUMAN SORTI 
signal sequence receptor, gamma (translo 
Homo sapiens done 25218 mRNA sequence 
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no lUo 


KICCOOC 


A A >trt4700 


nS,1o4134 


ESTs 


1 lolll 


MCC/IOO 






gb:yv50c02.s1 Scares fetal liver spleen 




No/ 4yj 


No/4yj 




gb:yy54c08.sl Soares^multi pte_sclerosis_ 






NbZ955 




Homo sapiens cDNA FLJ 11375 ns, clone he 






Nbo52U 




gb:yy62f01 .si Soares_muItjple_sclerosls_ 








ns.4/ ibb 


IJTA04 


Hi r uy© 


No4 lob 


AbUl / obo 


ns.i/Job^ 


inzzied (Drosopnua) nonnotog 7 


•1 4 QOCO 

11oJd3 


N6416o 


Ai183B38 


Hs.48938 


hypothetical protein FLJ21802 


118364 


N64191 


N46114 


Hs.29169 


hypothetical protein FLJ22623 


118475 


N66845 


N66845 




gb:za46c11.s1 Scares fetal liver spleen 


118491 


N67135 


AV647908 


1 t A^ A 

Hs.90424 


Homo sapiens cDNA: FU23285 fis, done H 


118500 


N67295 


W32889 


Hs.154329 


ESTs 


11o5B4 


Ndo9o3 


A lA 14 oenoo 

AW1 36926 




gblJl-H-B11-aap-(H)8-0-Ul.s1 NCLCGAP_Su 


456847 


NS9331 


AI252640 


Hs. 110364 


peptidylpnolyl isomerase C (cydophilin 


ilODOl 


N70777 


AL1 37554 


Lin A^ttfn 

nS.49927 


protein kinase NYD-oP15 


11od84 


N71364 


N71313 


Hs. 163986 


Homo sapiens cDNA: FLJ22765 fis, done K 


4 4QCOO 
1l0Q09 


N71o4D 




LIa <hacaa 
MS. 1 04544 


Homo sapiens, done iiv]AGc:33o5oo3, mRNA, 


4 4 DOHA 


N71571 


N71571 


Hs.269142 


ESTs 


4 4 OTce 
IIO/DD 


N74456 


N7445D 


nS.50499 


EST 


118793 


N75594 


N75594 


Hs.285921 


ESTs, Moderately similar to T47135 hypot 


4 4004*? 

noo17 


N79035 


A lOODOCO 

AlOOODDO 


ns.o0797 


ESTs 


118844 


N80279 


AL035364 


Hs.50891 


hypothetical protein 


118919 


N91797 


AW452696 


Hs. 130760 


myosin phosphatase, target subunit 2 


129558 


N92454 


AW580922 


Hs. 180446 


karyopherin (importin) beta 1 


407604 


N94581 


AW191962 


Hs.288061 


collagen, type vIlK alpha 2 


118996 


N94746 


N94746 


Hs.274248 


hypothetical protein FLJ20758 


119021 


N98238 


N98238 


Hs.55185 


ESTs 


119039 


R02384 


All 60570 


Hs.252097 


pregnancy spedfic beta-1-gIycoprotein 6 


119063 


R16833 


R16833 


Hs.53106 


ESTs, Moderately similar to ALU1_HUMAN A 


332622 


R41828 


R10674 




CSR1 protein 


119111 


R43203 


T02865 


Hs.328321 


EST 


415115 


R46395 • 


AA214228 


Hs.127751 


hypothetical protein 


119146 


R58863 


R58863 


Hs.91815 


ESTs 


449224 


R78248 


AW995911 


Hs.299883 


hypothetical protein FLJ23399 


119239 


Til 483 


T11483 




gb:CHR90049 Chromosome 9 exon Homo sapie 


119281 


T16896 


AI692322 


Hs.65373 


ESTs, Weakly similar to T02345 hypotheti 


119298 


T23820 


NM_001241Hs.155478 


cydin T2 


126502 


T30222 


T10077 


Hs.13453 


hypothetical protein FU 14753 


419983 


W15275 


W55956 . 


Hs.94030 


Homo sapiens mRNA; cDNA DKFZp586E1624 (f 


119558 


W38194 


W38194 




Empirically selected from AFFX single pr 


429641 


W42414 


AW081883 Hs.211578 


Homo sapiens cDNA: F1J23037 fis, done L 


419445 


W49832 


AA884471 


Hs.90449 


Human done 23B08 mRNA sequence 


119850 


W57613 


R82342 


Hs.79856 


ESTs, Weakly similar to S65657 a^ha-1C- 


119654 


W57759 


W57759 




gb:zd20g11.s1 Soares_fetaLheaiLNbHH19W 


119683 


W61118 


W65379 


Hs.57835 


ESTs 


119694 


W65344 


AA041350 


Hs.57847 


ESTs, Moderately similar to 1CE4_HUMAN C 


119718 


W69216 


W69216 


Hs.92848 


ESTs 


410365 


W69379 


AI287518 




Homo sapiens mRNA; cDNA DKFZp586D0923 (f 


119938 


W86728 


AW014862 Hs.58885 


ESTs 


120128 


Z38499 


BE379320 


Hs.91448 


MKP-1 like protein tyrosine phosphatase 


120130 


Z38630 


AA045767 


Hs.5300 


bladder cancer assodated protein 


120146 


Z39494 


F02806 


Hs.65765 


ESTs 


120155 


Z39623 


Z39623 


Hs.65783 


ESTs 


451979 


Z40071 


F06972 


Hs.27372 


BMX non-receptor tyrosine kinase 


120183 


Z40174 


AW082866 Hs.65882 


ESTs 


120184 


Z40182 


Z40182 


HS.65B85 


EST 


120211 


Z40904 


Z40904 


Hs.66012 


EST 


120245 


AA1 66965 


AW959615 Hs.111045 


ESTs 


120247 


A A 4C"7CAn 

AAlb75U0 


AA167500 


Hs.103939 


EST 


120254 


AA1 69599 


W90403 


Hs.111054 


ESTs 


120259 


AA 171724 


AW014786 Hs.192742 


hypotfiet/cal protein FU12785 


120260 


A A 474*70^ 

AA17i739 


AK000061 


Hs.101590 


hypothetical protein 


120275 


AA177105 


AA177105 


Hs.78457 


solute canier family 25 (mitodiondrial 


120284 


AA1 82626 


AA179656 




gb2p54e1 1 .si Stratagene NT2 neuronal pr 


417735 


AA1 00324 


AA188175 


Hs.82506 


KIAA1254 protein 


422137 


Ml 92099 


AJ236885 




zinc finger protein 148 (pHZ-52) 


120302 


AA192173 


AA837098 


Hs.269933 


ESTs 


120303 


A A ii AO 

AA192415 


AI216292 


Hs.96184 


ESTs 


40AQne 




AW295096 Hs.101337 


urKX)upiing proiein o vfniHM^nonQnai, pro 


120319 


AA194851 


T57776 


Hs.191094 


ESTs 


408729 


AA195520 


AA195764 


Hs.72639 


ESTs 


120326 


AA196300 


AA196300 Hs.21145 


hypothetical protein RG083M05^ 


133145 


AA196549 


H94227 


H5.6592 


Homo sapiens, clone IMAGE:2961368, mRNA. 


120327 


M196721 


AK0002g2 


Hs.130732 


hypothetical protein FU20285 


120328 


AAig6979 


AA923278 


Hs.290905 


ESTs. Weak^ similar to protease {H.sapi 


120340 


AA206828 


AA20682& 




gb:zq80b08^1 Stratagene hNT neuron (937 
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417122 AA207123 AI906291 Hs.81234 immunoglobulin superfamily, member 3 

131522 AA214539 AI380040 Hs.239489 TIM cytotoxic granule-associated RNA-bi 

421787 AA226914 AA227068 Hs.108301 nuclearreceptorsubfamily 2. group C. m 

120375 AA227260 AF028706 Hs.111227 Zlc family member 3 (odd-paired DrosophI 

5 120376 AA227469 AA227469 gb:zr18a07^1 Stratagene NT2 neuronal pr 

120390 AA233122 AA837093 Hs.111460 caldum/catmodurm-dependent protein kin 

410804 AA233334 U64820 Hs.66521 Machado-Joseph disease (spinocerebellar 

434223 AA233347 AI825842 Hs.3776 zinc finger protein 216 

312771 AA233714 AA018515 Hs.264482 Homo sapiens mRNA; cDNA DKFZp761 ACM 11 (f 

10 120396 AA233796 AA134006 Hs.79308 eukaryofic translation initiation fodor 

120409 AA235050 AA235050 gb:zs38e04^1 SoaresJlhHMPu.S1 Homo sapl 

120414 AA235704 AW137156 Hs.181202 hypothetical protein FU 10038 

120420 AA236031 A1128114 Hs.1 12885 sf^nal cord-derived growth fector-B 

120422 AA236352 AL133097 Hs.301717 hypothetical protein DKFZp434N 1928 
IS 419326 AA236390 W94915 Hs.42419 ESTs 

120423 AA236453 ' AA236453 Hs.18978 Homo sapiens cDNA: FU22822 fis, done K 
120435 AA243370 AA243370 Hs.96450 EST 

120453 AA250947 AA250947 Hs.170263 tumor protein p53-blnding protein. 1 

120455 AA251083 AA251720 Hs.104347 ESTs, Wealdy similar to ALUC^HUMANIHI 

20 120456 AA251113 AA488750 Hs.8d414 BTB and CNC homology 1. basic leucine zi 

120473 AA251973 AA251973 Hs.269988 ESTs 

126922 AA252023 A1244901 Hs.9589 ubiquilln 1 

120477 AA252414 AA252414 Hs.43141 DKFZP727C091 protein 

120479 AA252650 AF006589 Hs.1 10299 mitogen-actlvated protein kinase kinase 

25 120488 AA255523 AW952916 Hs.63510 KIAA0141 gene product 

120510 AA258128 At796395 Hs.111377 ESTs 

120527 AA262105 AA262105 Hs.4094 Homo sapiens cDNAFU14208fis.ctone NT 

120528 AA262107 A1923511 Hs.104413 ESTs 

120529 AA262235 A1434823 Hs.104415 ESTs 

30 120541 AA278298 W07318 Hs.240 M-phase phosphoprotein 1 

120544 AA278721 BE548277 Hs.103104 ESTs 

120562 AA280036 BE244580 Hs.342307 hypothetical protein FU10330 

120569 AA280548 AA807544 Hs.24970 ESTs, Weakly similar to 834323 GTP-bindi 

120571 AA28073a AB037744 Hs.34892 KIAA1323 protein 

35 120572 AA280794 H39599 Hs.294008 ESTs 

129434 AA280837 AW967495 Hs.186644 ESTs 

130529 AA280886 AA178953 Hs.309648 gb:zp39e03.s1 Stratagene muscle 937209 H 

120575 AA280934 AW978022 Hs.238911 hypothetical protein DKFZp762E1 511: KIAA 

409339 AA281535 AB020686 Hs.54037 ectonudeotidepyrophosphatase^phosphodi 

40 120591 AA281797 AF076847 Hs.191356 general transcripthsn factor IIH, polype 

120593 AA282047 AA748355 Hs.193522 ESTs 

430275 AA283002 Z11773 Hs^37786 zinc finger protein 187 

440303 AA283709 AA306166 Hs.7145 calpain? 

120609 AA283902 AW978721 Hs.266076 ESTs, Weakly similar to A46010X-linked 

45 409702 AA284108 AI752244 eukaryotic transtatton ekmgation fader 

456870 AA284109 AI241084 Hs.154353 nonsetedive sodium potassIunfi/|proton exc 

132614 AA284371 AA284371 Hs.118064 similar to rat nuclear ubiquitous casein 

458750 AA284744 AA1 15496 Hs.336898 Homo sapiens, Similar to RIKEN cDNA 1810 

135376 AA284784 BE617856 Hs.99756 mitochondrial ribosome recycling fador 

50 120621 AA284840 AW961294 Hs.1 43818 hypothetical protein FU23459 

452279 AA286844 AA286844 Hs.61260 hypothetical protein FU131 64 

332484 AA287032 AW172431 Hs.13012 ESTs 

120644 AA287038 AI869129 Hs.96616 ESTs 

120660 AA287546 AA286785 Hs.99677 ESTs 

55 135370 AA287553 BE622187 Hs.99670 ESTs, Weakly similar to 138022 hypotheti 

120661 AA287556 AA287556 Hs.263412 ESTs, Weakly simOar to ALUB.HUMAN til! 
429828 AA287564 A6019494 Hs.225767 IDN3 protein 

452291 AA291015 AF015592 Hs.28853 CDC7 {ceil division cycle 7, S. cerevisi 

120699 AA291716 AI683243 Hs.97258 ESTs, Moderately similar to S29539 ribos 

60 100690 AA291749 AA383256 Hs.1657 estrogen receptor 1 

120726 AA293656 AA293655 Hs.21198 ESTs 

120737 AA302430 AL049176 Hs.82223 chordin-like 

120745 AA302809 AA302609 gb:EST10426 Adipose tissue, white I Homo 

443574 AA302820 U83993 Hs.321709 purinergic receptor P2X.iigand-gated to 

65 120750 AA310499 AI191410 Hs.g6693 ESTs. Moderately slmllarto 21 09260A Be 

120761 AA321890 AA321890 branched chain keto add dehydrogenase E 

120768 AA340589 AA340589 Hs.104560 EST 

120769 AA340822 Ai769467 Hs.9475 ESTs 

135232 AA3424S7 AL038812 Hs.96800 ESTs. Moderately sbnitar to ALU7.HUMAN A 

70 120793 AA342864 AA342664 Hs.96812 ESTs 

120796 AA342973 A1247356 Hs.96820 ESTs 

120809 AA346495 AA346495 gb£ST52657 Fetal heart 11 Homo sapiens 

332633 AA347573 AL120071 Hs.48998 fibronedin leudne rtoh transmembrane p 

120825 AA347614 A1280215 Hs.96885 ESTs 

75 120827 AA347717 AA382525 Hs.132967 Human EST done 122887 mariner transpose 

120839 AA348913 AA348913 gb£Sr55442 Infant adrenal gland II Homo 
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120850 AA349647 AA349647 Hs.96927 

120852 AA349773 AA349773 Hs.191564 

128852 AA350541 R40622 Hs.l06601 

135240 AA357159 AA357159 Hs.96986 

5 120870 AA357172 AA357172 Hs.292581 

120894 AA370132 AA370132 Hs,97063 

435737 AA370472 AF229839 Hs.173202 

120897 AA370867 AA370867 Hs.97079 

120915 AA377296 AL135556 Hs.97104 

10 120935 AA383902 AL048409 Hs.97177 

120936 AA385934 AA385934 Hs.97184 

120937 AA386255 AA386255 Hs.97186 

120938 AA386260 AA386260 Hs.104632 
417832 AA386268 R20855 Hs.5422 

IS 120960 AA398014 AA3g8014 Hs.104684 

120985 AA398222 At219896 Hs.97592 

120988 AA398235 AA398235 Hs.97631 

121008 AA398348 AA398348 Hs.130546 

121029 AA398482 AA398482 Hs,97641 

20 121032 AA398504 AA393037 Hs.161798 

121033 AA398505 AA398505 Hs.97360 

121034 AA398507 AL389951 Hs.271623 

121035 AA398523 AA398523 Hs.210579 
121058 AA398625 AA398625 Hs.97391 

25 121060 AA398632 AA398632 Hs.97395 

121061 AA398633 AA393288 Hs.97396 

121091 AA398894 AA398894 Hs.97657 

121092 AA398895 AA398895 Hs,97658 
121094 AA398900 AA402505 

30 121096 AA398904 AA398904 Hs.332690 

121115 AA399122 AA398187 Hs.104682 

121121 AA399371 AA399371 Hs.189095 

121122 AA399373 A1126713 Hs,192233 
121125 AA399441 AL042981 Hs.251278 

35 121151 AA399636 AA399636 Hs.143629 

121153 AA399640 AA399640 Hs.97694 

121163 AA399680 AI676062 Hs.1 11902 

121176 AA400080 AL121523 Hs.97774 

121192 AA400262 AA400262 Hs.190093 

40 121223 AA400725 AI002110 Hs.97169 

121227 AA400748 AA400748 Hs.97823 

121231 AA400780 AA814948 Hs.96343 

121278 AA401631 AA037121 Hs.98518 

121279 AA401688 AA292873 Hs.177996 
45 121282 AA401695 AA401695 Hs.97334 

121299 AA402227 AA402227 Hs.22826 

121301 AA402329 NIVL006202HS.89901 

121302 AA402398 AA402587 Hs,325520 
121304 AA402449 AA293863 Hs.97316 

50 121305 AA402468 AA402468 Hs.291557 

134721 AA403268 AK000112 Hs.89306 

121323 AA403314 AA291411 Hs.97247 

121324 AA404229 AA404229 Hs.97842 
444422 AA404260 AI768623 Hs.108264 

55 131074 AA404271 U16125 Hs.181581 

121344 AA405026 AA405026 Hs.193754 

121348 AA405182 AA405182 Hs.97973 

121350 AA405237 AA405237 

121400 AA406061 AA406061 H3.98001 

60 121402 AA406063 AA406063 Hs.98003 

121403 AA406O70 AA406070 Hs.98004 

121408 AA406137 AA406137 Hs.98D19 

121431 AA406335 AA035279 Hs.176731 

121471 AA411804 AA411804 Hs.281575 

65 121474 AA411833 AA402335 Hs.168760 

121526 AA412219 AW665325 Hs.98120 

121530 AA412259 AA778658 Hs.9d122 

121558 AA412497 AA412497 

121559 AA412498 AI192044 Hs.104778 
70 121584 AA416586 A1024471 Hs.98232 

121609 AA416867 AA416867 Hs.98185 

121612 AA416874 AA416874 Hs.98168 

121737 AA421133 AA421133 Hs.104671 

121740 AA421138 AA421138 Hs.143835 

75 436032 AA422079 AA150797 Hs.109276 

121784 AA423837 T90789 Hs.94308 



Homo sapiens cDNA FU12573 fis, clone NT 

ESTs 

ESTs 

EST 

ESTs, Moderately simflar to ALUI.HUMAN A 
ESTs 

i>kappa-8-interacting Ras-like protein 1 
ESTs, htodeiately similar to AF174605 1 F 
ESTs 

ESTs, Wealtly similar to ALU1_HUMAN ALU S 
EST. Highly similar to (defllne notavai 
EST 
EST 

gtycoprotein MOB 

EST 

ESTs 

ESTs 

Human DMA sequence from done RP1 1-251 J8 

EST 

ESTs 

ESTs 

nucleoponn50kD 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs, IVIoderately similar to ALU8.HUMAN A 
EST 

gb:zt62h10.r1 SoaresJeslls^NHT Homo sap 

ESTs 

ESTs, WealOy similar to mitochondrial d 

similar to SALL1 (sal (DrosophilaHike 

ESTs, Highly similar to T00337 hypotheH 

KtAA1201 piotein 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs. Weakly similar to dJ667H1Z2.1 [H. 
Homo sapiens mRNA; cDNA DKFZp434D024 (fr 
ESTs. Weakly simtlartoALUC_HUMAN llll 
Homo sapiens cDNA FU1 1490 fis. done HE 
ESTs 
ESTs 

tropomodulin 3 (ubkiuitous) 

phosphodiesterase 4A, cAMP-spedfic (dun 

LATI^TM protein 

EST 

ESTs 

hypothetical protein FLI20105 

ESTs 

EST 

ESTs 

glutamate receptor, lonotropic. kainate 

ESTs 

ESTs 

gb:zt06e10.s1 NCLCGAP.GCBI Homo sapiens 

EST 

ESTs 

EST 

EST 

ESTs 

ESTs 

ESTs. Highly similar to Trad [H.sapiens] 

ESTs 

BSTs 

gbzt95g12^1 Soares_testis.NHT Homo sap 

ESTs 

ESTs 

EST 

ESTs 

eiythrocyte transmembrane protein 
EST 

latexin protein 

RAB35, member RAS oncogene family 
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121602 M424328 AI251870 H5.18889d ESTs 

121803 AA424339 A1336371 Hs.157173 ESTs 

135286 AA424469 AW023482 Hs.97849 ESTs 

332778 AA424469 AW023482 Hs.97849 ESTs 

5 121806 AA424502 AA424313 Hs.g8402 ESTs 

129517 AA425004 AW972853 Hs.1 12237 ESTs 

121845 AA425734 A1732692 Hs.165066 ESTs, Moderately sMar to ALU2_HUMAN A 

121853 AA425887 AA425e87 Hs.98502 hypothetical protein FU14303 

121891 AA426456 AA426456 Hs.g8469 ESTs 

10 121895 AA427396 AA427396 gb:zutf33a02^1 Scares ovary tumor NbHOT H 

121899 AA427555 R55341 HS.5D421 KIAAQ203 gene product 

121917 AA428218 AA406397 Hs.139425 ESTs 

121918 AA428242 BE274689 Hs.184175 chromosome 2 open reading frame 3 

121919 AA428281 AA428281 Hs.98560 EST 
15 121941 AA428865 AA428865 Hs.98563 ESTs 

121942 AA428994 AW452701 Hs.293237 ESTs 

121970 AA429666 AA429666 Hs.98617 EST 

121993 AA430181 AW297880 Hs.98661 ESTs 

418706 AA4301B4 U73524 Hs,87465 ATP/GTP-blnding protein 

20 122022 AA431293 AA431293 Hs.98716 ESTs. Moderately similar to T42650hypot 

122050 AA431478 Ai453076 EUV (embryonic lethal, abnonnal vision. 

122051 AA431492 AA431492 Hs.98742 EST 
122055 AA431732 AA431732 Hs.98747 EST 
122105 AA432278 AW241685 Hs.98699 ESTs 

25 '122125 AA434411 AK000492 Hs.98806 hypothetical protein 

135235 AA435512 AW298244 Hs.266195 ESTs 

122162 AA435698 AA628233 Hs79946 cytochrome P450, subfamily XIX (aromatlz 

422072 AA435711 AB018255 Hs.1 11 138 KIAA0712 gene product 

415106 AA435815 U407&3 Hs77965 peptidyt-prolyl isomerase 6 (cydophDin 

30 122186 AA435842 AA398811 Hs.104673 ESTs 

1 22235 AA436475 AA436475 Hs.1 12227 membrane-assodated nucleic acid binding 

412970 AA436489 AB026436 Hs.177534 dual spedftdty phosphatase 10 

419288 AA442060 AA256106 Hs.87507 ESTs 

1 22310 AA442079 AW192803 Hs.98974 ESTs. Weakly similar to S65824 reverse t 

35 122334 AA443151 BE465894 Hs.98365 ESTs. Weakly similar to 134DJ1UMAN NADP- 

122382 AA446133 AA446440 Hs.98643 ESTs 

122425 AA447145 AB007659 Hs.100955 K(AA0399 protein 

122431 AA447398 AA447398 Hs.99104 ESTs 

122450 AA447643 AA447643 Hs.112095 hypothetical protein DKFZp434F1819 

40 426284 AA447742 AJ404468 Ha2842S9 dynein.axonemat. heavypolypeptkle9 

122477 AA448226 AA448226 Hs.324123 ESTs 

122500 AA448825 AA448825 Hs.99190 ESTs 

122522 AA449444 AA299607 Hs.98969 ESTs 

122536 AA450087 AF060877 Hs.g9236 regulatorofG-protefn signalling 20 

45 122538 AA450211 AA450211 Hs.99239 ESTs 

122540 AA450244 AA476741 Hs.98279 ESTs, Weakly similar to A43932 mucin 2 p 

122560 AA452123 AW392342 Hs.283077 centrosomaf P4.1 -associated protein; unc 

421919 AA452155 AJ224901 Hs.109526 zinc finger protein 198 

122562 AA452156 AA452156 gb:zx29c03.s1 SoaresJotaUetus_Nb2HF8. 

50 122585 AA453036 AI681654 Hs.170737 hypothetical protein FU23251 

122608 AA453526 AA453525 Hs.143077 ESTs 

122635 AA454085 AA454085 gb:zx33a08.s1 SoarBS^totaLfetU5^Nb2HF8_ 

122636 AA454103 AW651706 Hs.99519 hypothetical protein FU14007 
122653 AA454642 AW009166 Hs.99376 ESTs 

55 122660 AA454935 Aiei6827 Hs.180069 nuclear respiratory factor 1 

122703 AA456323 AA456323 Hs.269369 ESTs 

122724 AA457395 AA457395 Hs.99457 ESTs 

122749 AA458850 AA458850 Hs.293372 ESTs, Weakly similar to B34087hypotheti 

122772 AA459662 AW1 17452 Hs.99489 ESTs 

60 430242 AA459658 U66669 Hs.236642 3-hydrDxyisobutyryl-Coen2yme A hydrolase 

429838 AA459679 AW904907 Hs.30732 hypothetical protein FU1 3409; KIAA1711 

122777 AA459702 AK001022 Hs.214397 hypothetical protein FU10160slmilarto 

135362 AA460017 AAg78128 Hs.99513 ESTs. Weakly similar to T17454 diaphanou 

122798 AA460324 AW366286 Hs.1456g6 splidng factor (CC13) 

65 122837 AA461509 AA461509 Hs^93565 ESTs, Weakly similar to putative pi 50 [H 

122860 AA464414 AA464414 gb:zx78g01^1 Soares ovary tumor NbHOTH 

122861 AA464428 AA335721 Hs.213628 ESTs 
122910 AA470084 AA470084 Hs.98358 ESTs 

132899 AA476606 AA476608 Hs^66 SMAD in the antisense ortentation 

70 122967 AA478521 AA806187 Hs.289101 glucose regulated protein. 58kD 

422845 AA478523 AA317841 Hs7845 hypothetteal protein M6C2752 

123009 AA479949 AA535244 Hs.78305 RAB2. member RAS oncogene family 

128917 AA481252 AI365215 Hs.206097 oncogene TC21 

123081 AA485351 AI815486. Hs.243901 Homo sapiens cDNA FU20738 fis, done HE 

75 123133 AA487264 AA487264 Hs,154974 Homo sapiens mRNA; cDNA DKFZp667N064 (fr 

123184 AA48g072 BE247767 Hs.18166 K1AA0870 protein 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



332467 


AA489630 


NM 014700HS.1 19004 


123233 


A A iinnooc 

AA490225 


AW974175 Hs.151875 


1/0234 


AA4yu22/ 


NM_001938Hs.16697 


400000 

12o23o 


AA49025O 


AW968504 Hs.123073 


•lOOOcc 




AA830335 


Hs.105273 


VIOA/14C 


A A >inAn4C 
AA49u91d 


AW768399 Hs.106357 


A AtiOnrt 

44oo92 


AA490925 


AF084535 


Hs.22464 


123259 


AA490955 


AI744152 


Hs.283374 


123284 


AA495812 


AA488988 


Hs.293796 


123286 


AA495824 


AA495824 


Hs.188822 


123315 


AA496369 


AA496369 




457397 


AA504125 


AW969025 Hs.109154 


433049 


AA521473 


AU076668 


Hs.334884 


123421 


AA598440 


AA598440 


Hs.291154 


123449 


AA598899 


AL049325 


Hs.112493 


426981 


AA599244 


AL044675 


Hs.173081 


409986 


AA599694 


NMJ)14777Hs.57730 


123497 


AA600037 


AA765256 


Hs.135191 


123604 


AA609135 


AA609135 


Hs.293076 


123712 


AAG09684 


AA609$84 




123731 


AAG09839 


AA609839 


Hs.334437 


123800 


AA620423 


AA620423 


Hs.112862 


123841 


AA620747 


AA620747 


Hs.1 12896 


123929 


AA621364 


AA621364 


Hs.1 12981 


123978 


C20653 


T89832 


Hs.170278 


133184 


D20085 


AA001021 


Hs.6685 


132835 


D20749 


Z83844 


Hs.5790 


435147 


D51285 


AL1 33731 


Hs.4774 


128695 


D59972 


NM_003478Hs,101299 


124029 


F04112 


F04112 


Hs.312553 


124057 


F 13504 


AA902384 


Hs.73853 


449316 


H01662 


AI609045 


Hs.321775 


130973 


H05135 


AI638418 


Hs.1440 


124108 


H12245 


H12245 




124136 


H22842 


H22842 


Hs,101770 


124165 


H30894 


H30039 


Hs.107674 


429627 


H43442 


NM_015340Hs.2450 


124178 


H45996 


BE463721 


Hs.97101 


129948 


H69281 


AI537162 


Hs.263988 


452114 


H69485 


N22687 


Hs.8236 


124-+0826254 


H69899 


H69899 


129056 


H70627 


AI769958 


Hs.108336 


427580 


H73260 


AK001607 


H5.44143 


426793 


H77531 


X89887 


Hs.172350 


124274 


H80552 


H80552 


HS.1Q2249 


129078 


H80737 


AI351010 


Hs.102267 


457658 


H93412 


AW952124 Hs.13094 


124315 


H94892 


NIVL005402HSJ288757 


437712 


H95643 


X04588 


Hs.85844 


124324 


H96552 


H96552 


Hs.159472 


452933 


H97146 


AW391423 


Hs.288555 


132231 


H99131 


AA662910 


Hs.42635 


421877 


H99462 


AW250380 


Hs.109059 


443123 H99837 


AA094538 


Hs.272808 


132963 


N22140 


AA099693 


Hs.34851 


420473 


N22197 


AL1 18782 


Hs.300208 


417381 


N23756 


AF164142 


Hs.82042 


130365 


N24134 


W56119 


Hs.155103 


456610 


N24195 


AF172066 


Hs.106346 


439311 


N26739 


BE270668 


Hs.151945 


124383 


N27098 


N27098 


Hs.102463 


124387 


N27637 


N27637 


Hs.109019 


129341 


N33090 


AI193519 


Hs.226396 


419793 


N35967 


AI364933 


Hs,l 68913 


124433 


N39069 


AA280319 


Hs.288840 


124441 


N46441 


AW450481 


Hs.161333 


132338 


N48270 


AA353868 


Ks.182982 


436575 


N48365 


AI473114 




124466 


N51316 


R10084 


Hs.1 13319 


408048 


N51499 


NM.007203HS.42322 


124483 


N53976 


AI821780 


Hs.179864 


124484 


N54157 


H66118 


Hs.285520 


124485 


N54300 


AB040933 


Hs.15420 


124494 


r454831 


N54831 


Hs.271381 


129200 


N59849 


N59849 


Hs.13565 


124527 


N62132 


N79264 


Hs^69104 



KIAA0665 gene product 

ESTs, Weakly similar to MAPB^HUMAN MICRO 

down-regulator of transcription 1 , TBP-b 

CDC2-related protein kinase 7 

ESTs 

ESTs 

epilepsy, progressive myoclonus type 2, 
ESTs, Weakly similar to CA15_HUMAN COU^ 
ESTs 

ESTs. Weakly similar to A46010 X-linked 
gb:2v37d10.sl Soares ovary tumor NbHOT H 
ESTs 

SEC10 (S. cerevisiae)-fike 1 

EST. Weakly similar to 138022 hypothetic 

Homo sapiens mRNA; cDNA DKFZp564D036 (fr 

KIAA0530 protein 

KIAA0133 gene product 

ESTs, Weakly similar to unnamed protein 

ESTs 

Homo sapiens cDl^: FU21543 fis. cfone C 

gb:ae62f01s1 StiBtagene lung carcbioma 

EST 

ESTs 

ESTs 

ESTs 

thyroid hormone receptor Interactor 8 

hypoihetical protein dJ37E16.5 

Homo sapiens mRNA; cDNA DKFZp76lC17l2 (f 

cuHinS 

gb:HSC2JH062 nomnaiized infant brain cDN 
bone morphogenetic protein 2 
hypothetical protein DKFZp434D1428 
DEAO/H (Asp-Gfu-Ala-Asp/Hls) box poiypep 
gb:ym17a12.r1 Soares infant brain 1NIB H 
EST 
ESTs 

leucyl-tRNA synthetase, mitochondrial 
putative G protein-coupled receptor 
ESTs 
ESTs 

gb:yu70c12.s1 Weizmann Olfactory Epithel 
ESTs. Weakly similar to ALUE.HUMAN HI! 
Homo sapiens done FLB6914 PR01821 mRI^. 
HIR (histone ceD cycle regulation defec 
EST 



presenifins associated rhombokl-like pro 
v-ral simian leukemia viral oncogene horn 
neurotrophic tyrosine kinase, receptor, 
Homo sapiens cDNA: FU22224 fis. clone H 
Homo sapiens cDNA: FU22425 fis, clone H 
hypothetical protein DKFZp434K2435 
mitochondrial ribosomal protein LI 2 
putative transcription regulatton nudea 
epsilon-tubulin 

Sec23-interactlng protein p125 
solute carrier fami^ 23 (nucleobase tra 
eukaryotic translation initiation factor 
retinoic acid repressibie protein 
mitochondrial ra)osomal protein L43 
EST 
ESTs 

hypothetical protein FU11126 

serine/threonine kinase 24 (Ste20, yeast 

PR01 575 protein 

ESTs 

go!gin-67 

ESTs 

kinesin heavy chain member 2 
A kinase (PRKA) anchor protein 2 
ESTs 

ESTs. Weakly similar to 2109260A 6 ceU 
KIAA1500 protein 

ESTs. Weakly similar to 138022 hypotheO 
Sam68-like phosphotyrosine protein. T-ST 
ESTs 



142 



3NSDOCID: <WO 02079492A2_L> 



wo 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



124532 


N62375 


N62375 


Hs.102731 


133213 


N63138 


AAg03424 


Hs.67e6 


124539 


N63172 


D54120 


Hs.146409 


129196 


N63787 


BE296313 


Hs.265592 


124575 


N68168 


N68168 




124576 


N68201 


N68201 




124577 


N68300 


Neesoo 


Hs.138485 


124578 


N68321 


N68321 


Hs.231500 


124593 


N69575 


N69575 


Hs.102788 


128501 


N75007 


AL133572 


Hs.199009 


332434 


N75542 


A1680737 


Hs.289068 


128473 


N90066 


T78277 


Hs.100293 


128639 


N91246 


AW582962 Hs.102897 


124552 


N92751 


W19407 


Hs.3862 


133137 


N93214 


AB002316 


Hs.65746 


124671 


N99148 


AK001357 


Hs.102951 


133054 


R07876 


AA464836 


Hs.291079 


425266 


R10865 


J00077 


Hs.155421 


124720 


R11056 


R05283 




124722 


R11488 


T97733 


Hs.185685 


128944 


R23930 


AL137586 


Hs.52763 


132965 


R26589 


AI248173 


Hs.191460 


426504 


R37588 


AW162919 Hs.170160 


438826 


R37613 


AL134275 


Hs.6434 


124757 


R38398 


H11368 


Hs.141055 


124762 


R39179 


AA553722 


Hs.92096 


124773 


R40923 


R45154 


Hs.338439 


135266 


R41179 


R41179 


Hs.97393 


427961 


R41294 


AW293165 Hs.143134 


414303 


R42307 


NM.004427HS.165263 


128540 


R43189 


AW297929 HS.32&317 


124785 


R43306 


W38537 


Hs.280740 


124792 


R44357 


R44357 


Hs.48712 


124793 


R44519 


R44519 




124799 


R45088 


R45088 




124812 


R47948 


R47948 


Hs.188732 


124821 


R51524 


H87832 


Hs.7388 


424123 


R54950 


AW966158 Hs.58582 


124835 


R55241 


R55241 


Hs.101214 


124845 


R59585 


R59585 


Hs.101255. 


124847 


R60044 


W07701 


Hs.304177 


440630 


R6D872 


BE561430 


Hs.239388 


124851 


R66690 


R67567 


Hs.107110 


332503 


R67268 


NIVL004455HS.150956 


124879 


R73588 


R73588 


Hs.101533 


124892 


R79403 


A1970003 


Hs.23756 


124906 


R87647 


H75964 


Hs.107815 


124922 


R93622 


R93622 


Hs.12163 


124940 


R99599 


AF068846 


Hs.103804 


124941 


R99612 


AI766661 


Hs.27774 


124943 


T02888 


AW963279 Hs.123373 


124947 


T03170 


T03170 


Hs.100165 


124954 


T10465 


AW964237 Hs.6728 


456862 


T15418 


U55184 


Hs.154145 


410653 


T15597 


BE383768 


Hs.65238 . 


418133 


T15652 


R43504 


Hs.6181 


440014 


T16898 


AW960782 Hs.6656 


131082 


T26644 


AI091121 


Hs.246218 


124980 


T40841 


T40841 


Hs.98681 


124984 


T47566 


BE313210 


Hs.334798 


124991 


T50116 


T50116 




457222 


T50145 


NiyL004477Hs203772 


125000 


T58615 


T58615 


Hs^887 


132932 


TS9940 


AW118826 HS.G093 


444484 


T63595 


AK002126 


Hs.11260 


125008 


T64891 


T91251 




125009 


T64924 


T64924 


Hs.303046 


445384 


T64933 


T79136 


Hs.127243 


125017 


T68875 


T68875 




125018 


T69027 


T69027 


Hs.269481 


125020 


T69924 


T69981 




437871 


T70353 


AI084813 


Hs.1 14088 


134204 


T79780 


AI873257 


Hs.7994 


125050 


T79951 


AW970209 Hs.1 11805 


125052 


T80174 


T85104 


HSJ222779 


125054 


T80622 


T80622 


Hs.268601 



EST 
ESTs 

cell divteion cyde 42 (GTP-biading prot 
ESTs. Weakly similar to 138022 hypotheti 
gb:za11o01s1 Soares fetal liver spleen 
ESTs, Weakly similar to 138022 hypotheU 
gb:za12907.s1 SoafBS fetal ffver spleen 
EST 
ESTs 

protein containing CXXC domain 2 
Homo sapiens cDNA FU11918 lis; done HE 
O-linked N-acetylgiucosamine (GlcNAc) tr 
CGI-47 protein 

regulator of nonsense transcripts 2; DKF 
KIAA0318 protein 

Homo sapiens cDNA FU10495 fis. clone NT 
ESTs. Weakly similar to T27173 hypotheti 
alpiia-fetoprotein 

gb.7e9icO8.sl Soares fetal Gver spleen 
ESTs 

anaphase-promoting complex subunlt 7 

hypothetical protein M6C12936 

RAB2, member RAS oncogene family-like 

hypothetical protein DKFZp761F2014 

Homo sapiens clone 23758 mRNA sequence 

ESTs, Moderately similarto A46010 X^fin 

ESTs 

KIAA0328.protein 
ESTs 

early development regulator 2 (homolog 0 
EST 

hypothetical protein M6C3040 
hypothetical protein FU20736 
gb:yg24h04.s1 Soares Infant brain INIB H 
gb7g38g04^1 Soares Infant bram INIB H 
ESTs 

ketch (Drosophila)-1ike 3 

Homo sapiens cDNA FU12789 fis, clone NT 

EST 

ESTs 

Homo sapiens clone FLB8503 PR02286 mRNA, 
Human DNA sequence from done RP1-304B14 
ESTs 

exostoses (muHipleHIke 1 
ESTs 

hypothetical protein similar to swine ac 
ESTs 

euka/yotlc translation inittatton factor 
heterogeneous nuclear n*bonudeoprotein 
ESTs. Highly simUar to AF161349 1 HSPCO 
ESTs. Weakly similar to ALU1.HUMAN ALU S 
ESTs 

KIAA1546 protein 

hypothetical protein F1J11585 

95 kDa retinoblastoma protein binding pr 

ESTs 

ash2 (absent small, or homeotic, Drosop 
Homo sapiens cDNA: FU217d1 fis, done H 
ESTs 

eukaryotic translation elongation factor 
gb:yb77c10^1 Stratagene ovary (937217) 
FSHD region gene 1 
ESTs 

Homo sapiens cDNA: FU22783 fis, done K 
hypothetical protein FU11264 
gb:yd60a10.s1 Soares fetal liver spleen 
ESTs 

Homo sapiens mRNA for KIAA1724 protein, 
gb:yc30f05.s1 Stratagene liver (937224) 
sex comb on midleg homolog 1 
gb7cl9d03.r1 Stratagene lung (937210) H 
ESTs 

hypothetical protein FU20551 
ESTs 

ESTs, Moderately similar to similar to N 
ESTs, Weakly similar to envelope [H.sapi 
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125063 
125064 
125066 
416507 
125080 
125097 
125104 
135107 
423122 
125116 
125120 
134160 
125136 
125144 
125150 
450142 
131387 
125178 
125180 
125182 
446888 
125197 
133497 
429922 
129232 
422166 
125209 
125212 
456631 
125223 
125225 
125228 
132393 
125238 
125247 
129296 
125263 
125266 
450862 
452401 
428243 
125277 
125278 
125260 
448205 
131844 
125284 
417111 
445424 
125289 
446313 
431342 
433227 
428305 
424624 
125295 
125298 
125300 
448378 
444582 
130882 
128888 
125310 
125315 
125317 
135096 



T85352 

T85373 

T86284 

T89579 

T90360 

T94328 

T95590 

T97257 

T97599 

T97620 

T97775 

T98152 

W31479 

W37999 

W38240 

W4015D 

W45435 

W58202 

W58344 

W58650 

W68736 

W69106 

W69111 

W69399 

W69459 

W72424 

W72724 

W72834 

W73955 

W74701 

W76540 

W79397 

W85888 

W86038 

W86881 

W87804 

W88942 

W90022 

W92272 

W92764 

W93040 

W93227 

W93523 

W93659 

W94003 

W94401 

W94688 

W94787 

Z38294 

238311 

Z38465 

Z38525 

Z38538 

238551 

238783 

239113 

239255 

239591 

239783 

239920 

240166 

240388 

240646 

241697 

299349 

299394 



AW576389 Hs.335774 
T95590 



AW962364 Hs.129051 
AB037742 Hs.24336 
W38240 

AW207469 Hs.24485 
AW453069 Hs.3657 
W93127 Hs.31845 



T85352 gb:yd82d01s1 Scares fetal liver spleen 

T85373 gb:y(l82f07.s1 Scares fetal liver spleen 

T86284 gb:yd77b07.s1 Scares fetal liver spleen 

AL045364 Hs.79353 transcription factor Dp-1 
T90360 Hs.268620 ESTs. Highly similar to ALU6^HUMAN ALU S 
EST, Moderately similar to S65657 alpha- 
gb:ye40a03.s1 Scares fetal liver spleen 
T97257 Hs.94560 ESTs. Moderately similar to 138022 hypot 
AA845462 Hs.1 24024 deltex (Drosophila) homolog 1 
R10606 Hs.269890 gb:yf35f11.s1 Scares fetal Uver spleen 
T97775 Hs.100717 EST 

T98152 Hs.79432 fibrillin 2 (congenital contractural ara 
ESTs 

K1AA1321 protein 

Empirically selected from AFFX single pr 
chondroitin sulfate proteoglycan 6 (bama 
activity-dependent neuroprotective prote 
ESTs 
ESTs 
ESTs 

hypothetical protein LOC57187 
heterochromatfn-like protein 1 
hypothetical protein MGC4251 
HI histone family, member 0 
sex comb on midleg (Drosophna)-like 1 
S100 calcium-binding protein A9 (calgran 
ESTs, Weakly similar to TSP2„HUMAN THROM 
ESTs 

hypothetical protein MGC2749 
ESTs, Weakly similar to ALUS^HUMAN ALU S 
DKF2P564G2022 protein 
ESTs. Weakly similar to 138022 hypotheti 
hypothetical protein FU14495 
ESTs 
ESTs 
ESTs 

gb:zn45g10.r1 Stratagene HeLa cell s3 93 
ESTs, Highly similar to LCT2_HUMAN LEUKO 
chromodomain hettcase DNA binding protei 
tumor necrosis factor, alpha-Induced pro 
H05317 Hs.283549 ESTs 
W93227 Hs.103245 EST 
AI218439 Hs.129998 enhancer of polycomb 1 
AI123705 H3.106932 ESTs 
W93949 Hs.33245 ESTs 
AI419294 Hs.324342 ESTs 
periilpin 

destrin (actln depotymerizing factor) 
cortactin SH3 domain-binding protein 
Homo sapiens cDNA FU13069 fis, ctone NT 
ESTs, Weakly similar to PC4259 ferritin 
ESTs 

kelch (Drosophila)-like 1 
hypothetical protein FU10210 
Ca2-K!ependent activator protein for seer 
sema domain, immunoglobulin domain (Ig), 
Homo sapiens cDNA: FU21814 fis, clone H 
EST 

Homo sapiens cDt^ FLJ12908 fs, done NT 
R55344 Hs.22142 cytochrome b5 reductase b5R.2 
AA497044 Hs.20887 hypothetical protein aJ10392 
A1760853 Hs.24155d ariadne (Drosophila) homotog 2 
R59161 Hs.124953 ESTs 
R38H0 Hs,106296 ESTs 

299348 H3.1 12461 ESTs. Weakly similar to 138022 hypotheti 
AA081258 zinc finger pnstein 36 (KOX 18) 



W58469 

AA451755 

ALD30996 

AF086270 

BE617303 

297630 

R98881 

W72424 

W72724 

AA746225 

BE383436 

A1916269 

W74169 

AA033982 

AL135094 

N99713 

AA694191 

A1051967 

AA098B78 

W90022 

U91543 



Hs.103120 
Hs.263560 
Hs.16411 

Hs.278554 

Hs.74266 

Hs.226117 

Hs.109655 

Hs.1 12405 

Hs.103174 

Hs.103173 

Hs.108847 

HS.1Q9057 

Hs.16492 

Hs.1 10059 

Hs.47334 

Hs.109514 

Hs.163914 

Hs.1 10122 

Hs.186809 
Hs.25601 



NM.007115HS.29352 



Nl\/L002686Hs.103253 
AW016321 Hs.82308 
AB028945 Hs.12696 
T34530 H5.4210 
H06245 Hs.106801 
AW971018 Hs.21659 
AB040923 Hs.106808 
AB037715 Hs.183639 
AB032947 Hs.151301 
AB022317 Hs.25887 
AW972642 Hs.289008 
239591 Hs.101376 
BE622770 Hs.264915 



144 



BNSCXDCID: <WO 02079492A2_L> 



wo 02/079492 

TABLE 3 A 



PCTAJS02/04915 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Table 3A shows the accession numbers for those pkeys lacking unigeneID*s for Table 3. The pkeys In Table 7 lacking unlgenelD's are represented within 
Tables 1-6A. For each probeset we have listed the gene cluster number from which the oligonucleotides were designed. Gene clusters were compiled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (IDoubleTwist. Oakland California). The Genbank aocesston numbers for sequences comprising each duster are listed in the "Accession" 
column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 

108459 
124105 
108501 
108562 
101300 



CAT Number Accession 



1 16761J AA079487 AA128547 AA128291 AA079587 AA079600 
125446J H12245 AA094769 R14576 

13684_-12 AA083256 

36375J AA100798 AF020589 AA074629 AA075946 AA100849 AA085347 AA126309 AA079311 AA079323 AA085274 

4669.1 BE535511 M62098 AA306787 AW891766 AA348998 AA338869 AA344013 AW956561 AW389343 AW403607 L40391 

AW408435 AA121738 A1568978 H13317 R20373 AW948724 AW948744 AA335023 AA436722 AA446690 C21404 
AW884390 AA345454 AA303292 AA174174 BE092290 T90614 AA035104 R76028 AA126924 AA741086 AW022056 
AW1 18940 AA121666 AI832409 AA683476 AI140901 Ai623576 AW519064 AW474125 AI953923 A1735349 AW150109 
AI436154 AW1 18130 AW270782 AI804073 N27434 AA876543 AA937815 AI051 166 AA505378 AI041975 AI335355 
AI089540 AA662243 AI127912 AI925604 AI250880 AI366874 AI564386 AI815196 A1683526 AI435885 AI160934 H79030 
AI801493 AA448691 AI673767 A(0r6042 AI804327 AA813438 AA680002 AI274492 T16177 AI287337 AI935050 
AA907805 AA91 1493 AI58941 1 AI371358 AW576236 AI078866 AW516168 AA346372 AI560185 AA471009 R75857 
AA296025 AA523155 AA853168 AI696593 AI658482 A1566601 AW072797 AA128047 AAD35502 AW243274 AA992S17 
R43760 

132091 94851J AW954243 AA829930 AA412478 AA828434 AA814538 Ai927418 AI192435 WS2897 AA443666 AA031913 AI683306 

AA918481 AI183314 083907 A1206832 AA876122 D83836 D83838 D82533 AI761290 Aligi125 AI143749 AW771909 
AI241436 AI767267 W56507 AA847787 AA568692 T10502 AI247870 AA715017 AA643304 AA890233 AA611387 . 
AA897470 AA907729 A(708679 AI078010 AA452630 AW419180 AI783713 N80205 W56778 AA676899 A1888718 N69930 
At336935 A(217560 AA639508 AA575836 BE046652 AI312651 AI036405 AA626$49 AA643636 Af493761 AA032024 
W38849 AA340178 AA447052 AA452969 W19369 AA2g6364 H44229 W58767 C05751 C05835 AI741989 Ng8532 
AW102617 AA412583 A1922246 W38495 AA355375 AA928571 C06275 AA352500 N93132 

1 17034 201 13_2 U72209 NIVL005748 AI655607 AI052758 AA385199 AW956794 H88679 AL135153 AI765644 AA384399 AW966458 

AA568443 AA804610 AI873513 H88639 Z25371 R63456 W44919 

100752 33207.21 T81309 BE019033 R94181 BE019198 NiyL000812 J03242 AW41 1299 BE300064 BE297544 R94182 AW630108 T53723 
058853 H78073 H80594 BE299560 T48899 H70196 M17426 N77077 S77035 H58384 H61664 H78540 T84527 C17198 
H60255 H71980 R92644 W79050 XG0910 M29645 R91055 M17863 M17862 T71815 BE299561 BE464561 X06260 
R94741 T54216 C18594 BE262015 X06161 AW409889 AA378400 BE263228 BE313278 R88116 BE313457 H43500 
T48617 BE313761 H77309 AI207601 X06159 H40413 X03425 T87663 R10627 X03562 M141 18 W03982 R97520 H81229 
T83157 H83168 H48762 AA669898 BE263054 H47289 AA022807 R1 1555 H74260 R76968 R28338 H72534 H72464 
H62031 N72478 N45355 AW41 1300 R89113 R69135 H58454 T83281 R93476 H69645 H68015 T82229 H71089 T85121 
H59939 W65299 N78176 H53909 N72373 R21788 H04660 H59639 H61874 BE262219 T53614 N73335 N50464 W00943 
N77189 R89257 AA570502 R89432 R06366 AA553480 AA776271 AA551359 AA551050 H51670 AA601052 BE299081 
H68198 H52276 BE207832 N91 192 H70332 X07858 X07868 H69464 H53782 H73710 R80435 AA553384 AW884176 
N53475 T71662 AW954036 AW954033 AA552931 H93206 AA430218 AA553476 AI918470 T54124 BE207982 BE300177 
N73994 AW882625 N39549 N53838 AA722389 H71878 H58909 H37849 H78435 T47933 R77174 R83814 AA41 1890 
H94199 AA663208 BE205778 AA490137 H70492 R98232 H37800 AA679294 H40341 H74238 H47290 H73231 T48618 
AA025428 A(039521 H92969 N59389 H80538 H72933 T90530 AA411891 N55000 H74225 AA340290 AW957061 T54316 
AA340437 H57125 iH58908 H79027 H63450 M74623 R93425 H68714 H68758 N68396 H48763 N69256 H57320 H53831 
H53589 N68833 N52453 H56048 H69870 H7BQ7A R69253 R83375 T53615 H94330 H58455 H90864 T47934 H74261 
R89258 R97997 R91056 R28339 R86760 H78235 R97521 H67692 H40358 AA022688 H52513 H59601 T88690 H65256 
H63397 W65397 AA553588 R19280 N52645 W73930 R06367 R21743 H72372 N73921 AW883539 AW882639 T40616 
H47084 R95723 AA634316 AA862781 H77310 R91389 H93111 R92767 T54512 R89341 H70333 H57817 H82941 H62032 
N52638 H58385 T91796 H51066 AA340292 T49918 H81230 R36121 N50411 T87664 N62436 N39340 AA665637 
AA340446 H93377 H92973 BE296290 BE269788 H61665 AA34CM44 N54605 AA454101 R10628 R94200 A)200549 
AA342640 BE298855 BE25Q229 T49916 H82008 N28278 AW880662 H71268 N76791 H47685 H652S5 W05198 
AW889144 N76677 H71702 H88036 H71915 R91612 R87807 H68059 AI133328 AI247866 AA621443 AW881050 
AA700847 AA340413 AW878608 AW681181 AW876249 H71916 N54596 BE161581 AW878082 W04212 AW881040 
AW885492 AW880519 AA334887 AW878715 W06882 AW630222 AW885381 H70869 AW381778 H47601 AW889982 
H63868 AW884986 AW878713 AW878685 R36391 AW878694 AA368070 C03393 AW876695 AW878705 AWB78665 
AW878742 AW878620 AW878823 AW876688 R29048 AW878690 AW878686 AW878810 AW878827 AW878733 
AW878659 AW878749 AW876681 AW883353 AW883277 AW883300 AW883565 AW883298 AW883143 AW883045 
AW883482 AW883352 AW883417 AW883357 AW883231 AW883474 AW883355 AW882620 AW882533 AW883754 
AW883139 AW882827 AW883641 AW883567 AW883481 AW882983 AW882982 AW882465 AW883419 AW882466 
AW883639 AW883230 AW882981 AW882534 AW882874 AW882619 AW883480 AW882826 AW882831 AW882835 
AW882830 AW883563 AW882456 AW627642 

1 16417 5418J 1 AW499664 AW500888 AL042095 AW576556 AW265424 AI521500 AA761333 AA761319 AW291137 AA649040 AA769094 

AA489664 AA63531 1 AW070509 AA425658 AI381489 AA609309 AA134476 W74704 AI923640 AW084888 H45700 
AI985564 AW629495 AW614573 AI859571 At693486 AA913892 At806164 AA909524 AW263513 AI356361 Z40708 
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123712 
117156 



125008 
125020 
125066 
116661 
125104 
124575 
125263 
131859 



374423J 
145392J 



1802095.1 
116017 1 
1814993J 
1532859_1 
413347 1 
1666649J 
1547.2 
3672J 



125565 1704098_1 
132983 1192^1 



118584 
133607 



532052.1 
1227 6 



A1332765 A1392620 AA181060 AW118719 AW968804 AW263502 AW505314 AA036967 W74741 R51139 H19364 H45751 

Z44962 AW370823 H25650 T54007 AA453000 AL045739 

AA6096B4AA758732 

W73853 AA928112 W77887 AW8B9237 AA148524 AI749182 AI754442 AI338392 AI253102 A1079403 A1370541 A1697341 

H97538 AW188021 AI927669 W72716 AI051402 AI188071 AI335900 N21488 AW770478 W92522 AI691028 AI913512 

AI144448 W73819 AA604358 N26900 Wg5221 AI868132 H98465 AA14B793 

T91251T64891T85665 

T69981T69924AA078476 

T86284T81933 

R61504 F04247 

T95590AA703278 H62764 

N68168 N69188 N9045D 

AA098878 W88942 

AW960564 AA092457 T55890 D5612D T92525 A1815987 BE182608 BE182595 AW08023B M90657 AA347236 AW961686 
AW176446 AA304671 AW583735 T61714 AA316968 AI446615 AA343532 AA083489 AA488005 W52095 W39480 N57402 
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TABLE 4: 



Pkey: Unique Eos probeset Identifier number 

Accession: Accession number used for previous patent filings 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelO: Unigene number 

Unigene Title: Unigene gene Me 
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ExAccn UniGene UnlgeneTitle 



100405 
100420 
100481 
100484 
100718 
100991 
101097 
101168 
101194 
101261 
101345 
101447 
101485 
101543 
101550 
101560 
101674 
101714 
101741 
101838 
101857 
102012 
102024 
102164 
102241 
102283 
102303 
102564 
102663 
102759 
102778 
102804 
102887 
102898 
102915 
103036 
103037 
103095 
103158 
103166 
103185 
103280 
103554 
103850 
104465 
104592 
104764 
104786 
104850 
104865 
104894 
104952 
104974 
105178 
105263 
105330 
105376 
105729 
105826 
105977 
106008 
106031 
106124 



D86425 
D86963 

HG1098-HT1098 

HG1103-HT1103 

HG3342-HT3519 

J03764 

L06797 

L15388 

L20971 

L35545 

L76380 

M21305 

M24736 

M31166 

M31551 

M32334 

IVI61916 

M68874 

M74719 

M92934 

M94856 

U03057 

U03877 

U18300 

U27109 

U31384 

U33053 

U59423 

U70322 

U81607 

U83463 

U89942 

X04729 

X06256 

X07820 

X54925 

X54936 

X60957 

X67235 

X67951 

X69910 

X79981 

Z18951 

AA187101 

N24990 

R81003 

AA025351 

AA027168 

AA040465 

AA045136 

AA054087 

AA071089 

AA085918 

AA187490 

AA227926 

AA234743 

AA236559 

AA292694 

AA39&243 

AA406363 

AA411465 

AA412284 

AA423987 



AW291567 Hs.82733 
D86983 Hs.1 18893 
X70377 Hs.121489 
NM.005402HS.2B8757 
BE295928 Hs.75424 
J03836 Hs.82085 
BE245301 HS.B9414 
NWL005308HS.211569 
L20971 Hs.188 
D30857 Hs.82353 
N1VL005795HS.152175 
M21305 

AA296520 Hs.89546 
M31166 Hs.2050 
Y00630 Hs.75716 
AW958272 Hs.347326 
N!\A_002291Hs.62124 
M68874 Hs.211587 
NM^003199Hs.326198 
BE243845 Hs.75511 
BE550723 Hs.1 53179 
BE259035 Hs.1 18400 
AA301867 Hs.76224 
NM_0001O7Hs.776O2 
NM_007351Hs.268107 
AW161552 Hs,83381 
U33053 Hs2499 
US9423 Hs.79067 
NM.002270HS.168075 
NM.005100Hs.788 
AF000652 Hs.8180 
NM_002318Hs.83354 
J03836 Hs.82085 
NIVL002205HS.149609 
X07820 Hs.2258 
M13509 Hs.83169 
BE018302 Hs.2894 
NM_005424Hs.78824 
BE242587 Hs.1 18651 
AA159248 Hs.180909 
NM_006825Hs.74368 
U84722 Hs.76206 
AI878826 Hs.74034 
AA187101 Hs.213194 
244203 Hs.26418 
AW630488 Hs.25338 
AI039243 Hs.278585 
AA027167 Hs.10031 
AL133035 Hs.8728 
T79340 Hs.22575 
AF065214 Hs.18858 
AW076098 Hs.345588 
Y12059 Hs.278675 
AA313625 Hs.21941 
AW388633 Hs.6682 
AW338625 Hs.22120 
AW994032 Hs.8768 
H46612 Hs^93815 
AA4767S6 Hs.194477 
AK001972 Hs.30822 
AB033888 Hs.8619 
X64116 Hs.171844 
H93366 Hs.7567 



nidogen2 

Melanoma associated gene 
cystatinD 

v-ral simian leukemia viral oncogene horn 
Inhibitor of DNA binding 1, dominant neg 
serine (or cysteine) proteinase inhibito 
chemokine (C-X-G motif), receptor 4 (fus 
G protein-coupled receptor kinase 5 
pliosphodiesterase 4B, cAMP-specific (dun 
protein C receptor, endothelial (EPCR) 
calcitonin receptor-like 
gb'iJuman alpha satellite and satellite 3 
seiectin E (endothelial adhesion nK)lecui 
pentaxin-related gene, rapidly induced b 
serine (or cysteine) proteinase inhibito 
intercellular adhesion molecule 2 
laminin, beta 1 

phospholipase A2. group IVA (cytosolk:, 
transcription factor 4 
connective tissue growth factor 
fatty acid binding protein 5 (psoriasis- 
singed (Prosophila)-like (sea urchin fas 
EGF-containing fibunn-Rke extracelluta 
damage-specific DNA binding protein 2 (4 
muitimerin 

guanine nucleotide binding protein 11 

protein kinase C-like 1 

MAO (mothers against decapentaplegk:, Dr 

karyopherin Omportin) beta 2 

A kinase (PRKA) anchor protein (giavin) 

syndecan binding protein (syntenin) 

lysyl oxidase-like 2 

serine (or cysteine) proteinase Inhibito 
integrin, alpha 5 (fibronecttn receptor, 
matrix metalloproteinase 10 (stromelysin 
matrix metalloproteinase 1 (interstitial 
placental growth factor, vascular endoth 
tyrosine kinase with immunoglobulin and 
hematopoietically expressed homeobox 
peroxiredoxin 1 

transmembrane protein (63kD), endoplasml 
cadherin 5, type 2, VE-cadherin (vascula 
caveolin 1, caveolae protein, 22kD 
hypothetical protein MGC10895 
ESTs 

protease, serine, 23 
ESTs 

KIAA0955 protein 

hypothetical protein DKF2p434G171 
B^jell Gl-Ulymphoma 6, member B (zinc fi 
phospholipase A2, group IVC (cytosoRc. 
desmoplakin(DPI. DPil) 
bromodomain^ntainir^ 4 
AD036 protein 

solute canier fantily 7, (cationic amino 
ESTs 

hypothetical protein FU10849 

Homo sapiens HSPC285 mRNA, partial cds 

E3 ubkiuitin Dgase SMURF2 

hypothetteal protein FU11110 

SRY (sex determining region Y)-box 18 

Homo sapiens cDNA: FU22296 Tis. clone H 

Homo sapiens cDNA: FLJ21962 fts. clone H 
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106155 
106302 
106423 
106793 
107174 
107216 
107295 
107385 
108756 
108846 
108888 
109001 
109166 
109456 
109768 
110107 
110906 
110984 
111006 
111018 
111133 
111760 
113073 
113195 
113923 
114521 
115061 
115096 
115145 
115819 
115947 
116314 
116339 
116430 
116589 
116733 
117023 
117186 
117563 
117997 
118475 
118581 
119073 
119155 
119174 
119221 
119416 
119866 
121335 
121381 
123160 
123473 
123523 
123533 
123964 
124006 
124315 
124659 
124669 
124847 
124875 
125091 
125103 
125355 
125565 
125590 
423765 
126511 
100286 
126563 
126649 
449602 
126872 
456000 
414221 
127402 



AA425309 

AA435896 

AA448238 

AA478778 

AA621714 

D51069 

T34527 

U97519 

AA127221 

AA132983 

AA135606 

AA156125 

AA179845 

AA232645 

F10399 

H16772 

N39584 

N52006 

N53375 

N54067 

N64436 

R26892 

T33637 

T57112 

W80763 

AA046808 

AA253217 

AA255991 

AA258138 

AA426573 

AA443793 

AA490568 

AA496257 

AA609717 

D59570 

F13787 

H88157 



N34287 

N52090 

N66845 

N68905 

R32894 

R61715 

R71234 

R98105 

T97186 

W60814 

AA404418 

AA405747 

AA488687 

AA599143 

AA608588 

AA608751 

C13981 

D60302 

H94892 

N93521 

N95477 

R60044 

R70506 

T91518 

T95333 

R45630 

R20839 ' 

R23858 

R23858 

AI024874 

W26247 

W26247 

AA856990 

AA856990 

AA136653 

AA136653 

AA136653 

AA358869 



AA425414 Hs.33287 
M398859 Hs.18397 
AB020722 Hs.16714 
H94997 Hs.16450 
BE122762 Hs.25338 
D51069 Hs.211579 
AA186629 Hs.80120 
NIVL005397HS.16426 
AA127221 Hs.1 17037 



AL1 17452 
AA135606 
AI056548 
AA219691 



Hs.44155 
Hs.189384 
HS72116 
Hs.73625 



AW956580 Hs.42699 
F06838 Hs.14763 
AW151660 Hs.31444 
AA035211 Hs.1 7404 
AW613287 Hs.80120 
BE387014 Hs.166146 
A1287912 Hs.3628 
AW580939 Hs.97199 
BE551929 Hs.268754 
N39342 Hs.103042 
H83265 Hs.8881 
AW953484 Hs.3849 
AW139036 Hs.108957 
A)75143B Hs.41271 



A)683069 

AA740907 

AA486620 

R47479 

AI799104 

AK0002gO 

AK001531 

A1557212 

AL157424 



Hs,175319 

Hs.88297 

Hs.41135 

Hs.94761 

Hs.178705 

Hs.44033 

Hs.66046 

Hs.17132 

Hs.61289 



AW070211 Hs.102415 



Hs.42612 
Hs.44553 
Hs.47420 



Hs^79477 
H5.310598 

Hs.250700 



H98986 
AF055634 
N52090 
N66845 
N68905 
BE245360 
R61715 
R71234 
C14322 
T97186 

AA496205 Hs.193700 
AA404418 

AW088642 Hs.97984 

AA488687 Hs.284235 

AA599143 

AA608588 

AA608751 

C13961 

AI147155 Hs.270016 
NM.005402HS.288757 
AI680737 Hs.289068 



AI571594 

W07701 

AI887664 

T91618 

AA570056 

R60547 

R20840 

R23858 

R23858 

T92143 

BE247550 

AA516391 

AA001850 

AA001860 

AW450979 

BE180876 

AW450979 

AA358869 



Hs.102943 
Hs.304177 
Hs.285814 

Hs.122730 
Hs.170098 

Hs.143375 

Hs.143375 

Ks.57958 

Hs.86859 

Hs.181368 

Hs.279531 

Hs.279531 

Hs.1 1614 

Hs.227949 



nuclear factor l/B 

hypothetical protein FU23221 

Rho guanine exchange factor (GEF) 15 

ESTs 

ESTs 

melanoma cell adhesion molecule 

UDP-N-acetyl-alpha-D^aIactosamine:polyp 

podocalyxin-like 

ESTs 

DKFZP5&6G1517 protein 

gb2l10a05^1 Soares_pregnanLuterus.NbH 

hypothetical protein FIJ20992 similar to 

RAB6 interacting, tdnesin-lilce (rabkines 

ESTs 

ESTs 

ESTs 

ESTs 

UDP-N-acetyI-alpha-D^a!actosamlne:polyp 
Homer, neuronai Immediate early gene, 3 
mitogen-acUvated protein kinase kinase 
complement component CI q receptor 
Homo sapiens cDNA FLJ11949 fis, clone HE 
mlcrotubuie-assodated protein IB 
ESTs, Weakly similar to S41044 chromosom 
hypothetical protein FLJ22041 similar to 
40S ribosomal protein S27 isofonn 
Homo sapiens mRNA fuB length insert cDN 
ESTs 
ESTs 

endomucin-2 
KIAA1691 protein 

Homo sapiens cDNA FU11333 fis, done PL 

dipeptldyi peptidase 8 

hypothetical protein FU10669 

ESTs, rVioderateJy similar to 154374 gene 

synaptojanin 2 

Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
ESTs. Weakly sfmOar to ALU1_HUMAIM AliJ S 
unc5 (Celegans homotog) c 
EST 

gb:za46c1 1 ^1 Soares fetal liver spleen 
gb:za69b09.s1 Soares.fetaLlung.NbHL19W 
ESTs 

ESTs, IVIoderately similar to ALUl.HUMAN A 
gb:yl54c08.s1 Soares placenta Nb2HP Homo 
tryptase beta 1 

gb:ye50h09.sl Soaros fetal liver spleen 
Homo sapiens mRNA; cDNA DKFZp586l0324 (f 
gb:zw37e02.s1 SoarBS_totaLfetus^Nb2HF8. 
hypothetical protein FIJ22252 similar to 
ESTs, Weakly similar to 138022 hypotheti 
gb:ae52d04.s1 Stratagene lung cardnoma 
gb:ae54e06.sl Stratagene lung cardnoma 
gb:ae56h07.sl Stratagene lung cardnoma 
gb:C13961 Clontech human aorta polyA-»- mR 
ESTs 

v-ral simian leukemia viral oncogene horn 

Homo sapiens cDI^ FU11918 fis. done HE 

hypothetical protein MGC12916 

Homo sapiens done FLB8503 PR02286 mRNA, 

sprouty (Drosophila) homolog 4 

gb:ye20i05^1 Strategenehing (937210) H 

ESTs, Moderately similar to K1AA121 5 pro 

K1AA0372 gene product 

gb:yg05o08.r1 Soares in^t brain 1NIB H 

Homo sapiens, done IMAGE:3840937. mRNA, 

Homo saptens. done IMAGE:3840937, mRNA. 

EGF-TM7-^trophnin-felated protein 

growth factor receptor-bound protein 7 

tJ5 snRNP-spedfic protein (220 kD). orth 

ESTs 

ESTs 

gb:Ul-H-BI3^la^12^Ul.s1 NCI.CGAP.Su 
HSPC065 protein 

gb:Ul-H-Bl3-ala-a-12-<WJi.s1 NCLCGAP.Su 
SEC13 (S. cerevsiaeHike 1 
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127651 

424806 

128062 

128992 

129046 

129188 

129314 

129371 

129468 

129765 

129805 

129884 

130495 

130639 

130657 

130828 

130972 

131080 

131137 

131182 

131486 

131573 

131647 

131756 

131859 

131881 

132050 

132083 

132164 

132358 

132413 

132456 

132490 

132676 

132687 

132718 

132736 

132760 

132933 

132968 

132994 

133061 

133147 

133161 

133200 

133260 

133363 

133491 

133517 

133550 

133607 

133614 

133627 

133691 

133696 

133913 

133975 

133985 

134039 

134088 

134161 

134299 

134416 

116470 

134656 

134989 

135051 

135073 

135349 

100114 

100130 

100143 

100168 

100208 

100224 

100405 



AI123976 

AI123976 

AA379500 

R49693 

AA195678 

M30257 

AA028131 

M10321 

JG3040 

M86933 

AA012933 

AA286710 

AA243278 

D59711 

T94452 

AA053400 

AA370302 

J05008 

U85193 

AA256153 

X83107 

AA046593 

AA410480 

D45304 

M90657 

AA010163 

AA136353 

Y07B67 

U84573 

X60486 

AA132969 

AA1 14250 

F13782 

AA283035 

AB002301 

AA056731 

U68019 

H99198 

AA598702 

N77151 

AA505133 

AB000584 

D12763 

AA253193 

AA432248 

AA083572 

AA479713 

L40395 

X52947 

W80846 

M34539 

D67029 

U09587 

M85289 

D10522 

W84712 

D29992 

L34657 

S78559 

D43636 

U97188 

AA487558 

M28882 

X70683 

X14787 

AA236324 

C15324 

AA452000 

D83174 

000596 

D11428 

D13640 

D14874 

D26129 

D28476 

D86425 



AA382523 Hs.105689 
AA382523 Hs.105689 
AA379621 Hs.105547 
H04150 Hs.107708 
AB029290 Hs.108258 
NM_001078Hs.109225 
BE62276B Hs.290356 
X06828 Hs.1 10802 
AW410536 Hs.111779 
M86933 Hs.1238 
AA012848 Hs.12570 
AF055581 Hs.13131 
AW250380 Hs.109059 
AI557212 Hs.17132 
AW337575 Hs.201591 
AW631469 H5.203213 
D81866 Hs.21739 
NM_001955Hs^271 
W27392 Hs.33287 
AI824144 Hs.23912 
F06972 Hs.27372 
AA040311 Hs^8959 
AA359615 Hs.30089 
AA443966 H5.31595 
AW960564 
AW361018 Hs.3383 
AI267615 HS.3B022 
BE386490 Hs.279663 
AI752235 Hs.41270 
NM_003542Hs.46423 
AW361383 Hs.260116 
AB011084 Hs.48924 
NM_001290Hs.4980 
N92589 Hs.261038 
AB002301 Hs.54985 
NNL004600HS.554 
AW081883 Hs.211578 
AA125985 Hs.56145 
BE263252 Hs.6101 
AF234532 Hs.61638 
AAl 12748 Hs.279905 
AI186431 Hs.296638 
AA026533 Hs.66 
AW021103 Hs.6631 
AB037715 Hs.183639 
AA403045 Hs.6906 
AI866286 Hs.71962 
BE619053 Hs.170001 
NWL000165HS.74471 
AI129903 Hs.74669 
BE273749 

NM_003003Hs.75232 
NM-002047HS.75280 
M85289 Hs.211573 
AI878921 Hs.75607 
AU076964 Hs.7753 
C18356 Hs.295944 
L34657 Hs.78146 
NM_002290Hs78672 
A1379954 Hs.79025 
AA634543 Hs.79440 
AW580939 Hs.97199 
X68264 Hs.211579 
AI272141 Hs.83484 
A1750878 Hs.87409 
AW968058 Hs.92381 
AI272141 Hs.83484 
W55956 Hs.94030 
AA114212 Hs.9930 
X02308 Hs.82962 
NM.000304HS.103724 
AU076465 Hs^8441 
H73444 Hs394 
NMJ)02933Hs.78224 
AL121516 Hs.138617 
AW291587 Hs.82733 



MSTP031 protein 
MSTP031 protein 

neural proliferation, differentiation an 
ESTs 

actin binding protein; macrophin (microf 
vascular cell adhesion moleciJle 1 
mesoderm development candidate 1 
von Willebrand factor 
secreted protein, acidic, cysteine-rich 
amelogenin (Y chromosome) 
tubulin-specific chaperone d 
lysosomal 

mitochondrial ribosomal protein L12 
ESTs. Moderately similar to 154374 gene 
ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp58611518 (f 
endothelin 1 
nudear factor l/B 
ESTs 

BMX non-ieceptor tyrosine kinase 

ESTs 

ESTs 

ESTs 

transmembrane 4 superfamily member 1 
upstream regutetory element binding prot 

ESTs 
Piiin 

procollagen-lysine, 2-oxoglutarate 5-dio 
H4 hlstone family, member G 
metailoprotease 1 (pitrilysin family) 
KIAA0512 gene product; Al£X2 
Lll^ domain binding 2 
ESTs, Weakly simitar to 138022 hypotheti 
KIAA0303 protein 

Sjogren syndrome antigen A2 (60kD. ribon 
Homo sapiens cDNA: FU23037 fis, done L 
thymosin, beta, identified in neuroblast 
hypothetical protein IVIGC3178 
myosin X 

clone HQ0310 PRO0310pl 
prostate differentiation factor 
interteukin 1 receptor-like 1 
hypothetical protein F1J20373 
hypothetical protein FU10210 
Homo sapiens cDNA: FU23197 fis. done R 
ESTs. Wealdy similar to B36298 proDne-r 
eukaryotic tnauislatton initiatnn fador 
gap Junction protein, alpha 1, 43kD (oon 
vesk^e-assodated membrane protein 5 (m 
FK506-bin(ftig protein lA (12kD) 
SEC14 (S. cerevisiae)-like 1 
glycyl-tRNA synthetase 
heparan sulfate proteoglycan 2 (pedecan 
myristoylated alanine-rich protein kinas 
calumenin 

tissue fador pathway inh9)itor 2 
platelet/endothelial cell adhesion molec 
lamlnin. alpha 4 
KIAA0096 protein 
1GF-(I mRNA-blnding protein 3 
complement component CI q receptor 
melanoma cell adhesion molecule 
SRY (sex detemiining region Y)-box 4 
thrombospondin 1 

nudix (nudeoslde diphosphate linked mol 

SRY (sex detemnlning region Y)-box 4 

Homo sapiens mRNA; cDNA DKFZp586E1624 (f 

serine (or cysteine) proteinase inhibito 

thymidylate synthetase 

peripheral myelin protein 22 ^ 

KIAA0015 gene produd 

adrencmedullin 

ribonudease, RNase A feinuly, 1 (panoea 
tliyroid hormone receptor interactor 12 
nidogen2 
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100420 
100455 
100529 
100618 
100619 
100658 
100676 
100718 
100752 
100828 
100850 
100991 
101097 
101110 
101142 
101156 
101168 
101184 
101192 
101317 
101336 
101345 
101400 
101475 
101485 
101496 
101505 
101543 
101557 
101560 
101587 
101592 
101633 
101634 
101667 
101682 
101714 
101720 
101741 
101744 
101793 
101837 
101838 
101840 
101857 
101864 
101931 
101966 
102012 
102013 
102024 
102059 
102121 
102283 
102300 
102378 
102395 
102460 
102491 
102499 
102523 
102560 
102564 
102589 
102600 
102645 
102687 
102693 
102709 
102759 
102804 
102882 
102907 
102915 
102927 
102960 



D86983 
D87953 

HG1862-HT1897 

HG2614-HT2710 

HG2639-HT2735 

HG2855-HT2995 

HG3044-HT3742 

HG3342-HT3519 

HG3543-HT3739 

HG4069-HT4339 

HG417-HT417 

J03764 

L06797 

L08246 

L12711 

L13977 

L15388 

L19871 

L20859 

L42176 

L49169 

L76380 

M15990 

M23254 

M24736 

M26576 

M27396 

M31166 

M31994 

M32334 

M35878 

M36429 

M57730 

M57731 

M60858 

M62994 

M68874 

M69043 

M74719 

M75126 

M84349 

M92843 

M92934 

M93058 

M94856 

M95787 

S76965 

S81914 

U03057 

U03100 

U03877 

U08021 

U14391 

U31384 

U32944 

U40369 

U41767 

U48959 

U51010 

U51478 

U53445 

U59289 

U59423 

U62015 

U63625 

U67963 

U73379 

U73824 

U77604 

U81607 

U89942 

X04412 

X06985 

X07820 

X12876 

X15729 



DB6983 Hs.1 18893 
AW888941 Hs.75789 
BE313693 Hs.334330 
AI752163 Hs.1 14599 
N24433 Hs.241567 
U55725 Hs.180414 
X02761 Hs.287820 
BE295928 Hs.75424 
T81309 

ALD48753 Hs.303649 
AA836472 Hs.297939 
J03836 Hs.82085 
BE245301 Hs.89414 
AI439011 Hs.86386 
L12711 Hs.8g643 
AA340987 Hs.75693 
NIVL005308HS.211569 
NIVL001674HS.460 
BE247295 Hs78452 
L42176 Hs.8302 
NM^006732Hs.75678 
NM_005795Hs.152175 
M15990 Hs.194148 
BE410405 Hs.76288 
AA296520 Hs.89546 
X127a4 Hs.1 19129 
AA307680 Hs.75692 
M31166 Hs.2050 
BE293116 Hs.76392 
AW958272 Hs.347326 
AI752416 Hs.77326 
AF064853 Hs.91299 
NIVL0D4428HS.1624 
AV650262 Hs.75765 
NM_005381 
AF043045 Hs.81008 
M68874 Hs.211587 
M69043 Hs.81328 
NM.003199HS.326198 
AI879352 Hs.1ie625 
W01076 HSJ278573 
M92843 H5.343586 
BE243845 Hs.75511 
AA236291 Hs.183583 
BE550723 Hs.153179 
BE392588 Hs.75777 
NM.006623HS.75209 
X96438 Hs.76095 
BE259035 Hs.1 18400 
BE616287 Hs.178452 
AA301867 Hs.76224 
AI752666 Hs.76669 
NM^004998Hs.82251 
AW161552 Hs.83381 
AI929721 Hs.5120 
AU076887 Hs.28491 
AU077005 Hs.92208 
U48959 Hs.211582 
U51010 

BE243877 Hs.76941 
U53445 Hs.15432 
R97457 Hs.63984 
U59423 Hs79057 
AU076728 Hs.8867 
AI984144 Hs.66713 
AL1 19566 Hs.6721 
NIVL007019HS.93002 
AA532780 Hs.183684 
AA122237 Hs.81874 
NM^005100Hs.788 
NM.002318HS.83354 
AI767736 Hs.2g0070 
BE409861 H3^Q2833 
X07820 Hs^8 
6E512730 Ks.65114 
AI904738 Hs.76053 



Melanoma associated gene 

N-myc downstream regulated 

calmodulin 2 (phosphorylase kinase, dell 

collagen, type VII!. alpha 1 

RNA binding motif, single stranded inter 

heat shock 70kD protein 2 

fibronectin 1 

inhibitor of ONA binding 1, dominant neg 
insulin-like growth factor 2 (somatomedi 
small inducible cytokine A2 (monocyte ch 
cathepsin 6 

serine (or ^teine) proteinase inhibito 

chemokine (C-X-C motif), receptor 4 (fus 

myeloid cell leukemia sequence 1 (BCL2-r 

transketolase (Wernicke-Korsakoff syndro 

prolylcarboxypeptidase (angtotensinase C 

6 protein-coupled receptor kinase 5 

activating transcriptton factor 3 

solute carrier family 20 (phosphate tran 

four and a half Ltf^ domabis 2 

FBJ nuirine osteosarcoma viral oncogene h 

calcitonin receptor-tike 

v-yes-1 Yamaguchi sarcoma viral oncogene 

catpam 2, (m/il) targe subunit 

selectin E (endothelial adhesion molecul 

collagen, type iV. alpha 1 

asparaglne synthetase 

pentaxin-related gene, rapkily induced b 

aldehyde dehydrogenase 1 family, member 

Intercellular adhesion molecule 2 

Insulin-Iike growth factor binding prote 

guanine nucleotide binding protein (G pr 

ephrin-A1 

GR02 oncogene 

nucleolin 

fiiamin B, beta (adin-blnding protein-2 
phospholipase A2, group IVA (cytosollc, 
nuclear factor of kappa tight polypeptid 
transcription factor 4 
hexokinase 1 

CD59 anttgen p16-20 (antigen identified 
zinc finger protein homok)gous to Zfp^6 
connective tissue growth factor 
ser^e (or cysteine) proteinase Inhibito 
fatty add binding protein 5 (psoifasis- 



piotein kinase (cAA^P-dependent, catalyti 
Immediate earty response 3 
singed (Drosophila)-like (sea urchin fas 
catenin (cadherin-assodated protein), a 
EGF-contalning fibulin-like extraceUula 
nicotinamide l^-methyltransferase 
myosin IE 

guanine nucteotkJe binding protein 11 
dynein, cytoplasmic, light polypeptide 
spemnidine/spennlne N1-acetyitransferase 
a dislntegrin and metalloprotelnase dome 
myosin, light polypeptide kbiase 
gbiHuman nicotinamide N-methyltransferas 
ATPase, Na-WK+ transporting, beta 3 poly 
dovmregulated In ovarian cancer 1 
cadherin 13, H-cadherin (heart) 
MAD (mothers against decapentaplegic, Dr 
cystelne-rich. angiogenic inducer, 61 
hepatitis delta antigen-interacting prot 
lysosomal 

ubiquitln carrier protein E2-C 

eukaiyotk: translation initiatton factor 

microsomal glutathtone S-transferase 2 

A kinase (PRKA) anchor protein (gravin) 

lysylQxhIase-like2 

getsolin (amytoUlosIs, Rnnlsh type) 

heme oxygenase (decyciing) 1 

matrix metaltoproteinase 10 (stromelysin 

keratin 18 

DEAD/H (Asp^hJ^a^p/Hls) box potypep 
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103011 

103020 

103029 

103036 

103056 

103080 

103095 

103138 

103176 

103195 

103347 

103371 

103432 

103471 

103967 

104447 

104764 

104783 

104798 

104865 

104877 

104894 

104952 

105113 

105178 

105196 

105215 

105263 

105271 

105330 

105461 

105492 

105493 

105594 

105727 

105732 

105767 

105882 

105936 

106031 

106124 

106222 

106241 

106263 

106264 

106366 

106454 

106634 

106724 

106793 

106799 

106842 

106868 

106890 

106961 

106974 

107030 

107061 

107086 

107216 

107385 

107444 

107985 

108507 

108695 

108931 

109001 

109195 

109390 

109456 

109737 

110411 

110660 

110906 

111018 

111091 



X52541 

X53416 

X54489 

X54925 

X57206 

X59798 

X60957 

X65965 

X69111 

X70940 

X87838 

X91247 

X97748 

Y00815 

AA303711 

L44538 

AA025351 

AA027050 

AA029462 

AA045136 

AA047437 

AA054087 

AA071089 

AA156450 

AA187490 

AA195031 

AA205724 

AA227926 

AA227986 

AA234743 

AA253216 

AA255210 

AA256268 

AA279397 

AA292379 

AA292717 

AA346551 

AA400292 

AA404338 

AA412284 

AA423987 

AA428594 

AA43010B 

AA431462 

AA431470 

AA443756 

AA449479 

AA459916 

AA465228 

AA478778 

M47g037 

AA482597 

AA487561 

AA48924S 

AA504110 

AA520989 

AA599434 

AA608649 

AA609519 

D51069 

U97519 

W28391 

AA035638 

AA083514 

AA1213t5 

AA147186 

AA156125 

AA188932 

AA219653 

AA232645 

F10078 

H48032 

H82117 

N39584 

N54067 

N59858 



AJ243425 Hs.326035 
X53416 Hs.195464 
AW800726 Hs789 
M13509 Hs.83169 
Y18024 HSJ8877 
AU077231 Hs.82932 
NM^005424Hs.78824 
X65965 

AL021154 HsJ6884 
AA351647 Hs.2642 
AU077309 Hs.171271 
X91247 Hs.13046 
X97748 

Y00815 HSJ5216 
AL120051 Hs.144700 
AW204145 Hs.156044 
AI039243 Hs.278585 
AA533513 Hs.93659 
AW952619 Hs.17235 
T79340 Hs.22575 
At138635 Hs.22968 
AF065214 Hs.18858 
AW076098 Hs^588 
AB037816 Hs.8982 
AA313825 Hs.21941 
W84893 Hs.9305 
AA205759 Hs.10119 
AW388633 Hs.6682 
AA807881 Hs.25329 
AW338625 Hs.22120 
BE539071 Hs.69388 
AI805717 Hs.289112 
AL047586 Hs.10283 
AB024334 Hs.25001 
AL135159 Hs.20340 
AW504170 Hs.274344 
AW370946 Hs.23457 
W46802 Hs.81988 
AI678765 Hs.2l812 
X64116 Hs.171844 
H93366 HsJ567 
AA356392 Hs.21321 
BE019681 Hs.6019 
W21493 Hs.28329 
AL046859 Hs.3407 
AA186715 Hs,336429 
NIVL014038HS.5216 
W25491 Hs^8909 
N48670 Hs.28631 
H94997 Hs.16450 
BE313412 Hs.7961 
AF124251 HS.260M 
BE185536 Hs.301183 
AA489245 Hs.88500 
AW243614 Hs.18063 
A1817130 Hs.9195 
AL117424 Hs.25035 
BE147611 Hs.6354 
NM_012331Hs.26458 
D51069 Hs.211579 
NM_005397Hs.16426 
W28391 Hs.343258 
T40064 HS71968 
AI554545 Hs.68301 
AB029000 HS70823 
AA147186 

AI056548 HS72116 
AF047033 Hs.1 32904 
AW007485 Hs.87125 
AW956580 Hs.42699 
AA055415 Hs.13233 
AW001579 Hs.9645 
AA782114 Hs.28043 
AA035211 Hs.17404 
AI287912 Hs.3628 
AA300087 Hs.33032 



early growth response 1 

filamin A« alpha (aclin-binding protein- 

GR01 oncogene (melanoma growth stimulati 

matrix metailoproteinase 1 (interstitial 

inositol 1,4.5-trisphosphate 3-kinase B 

cydin D1 (PRAD1: parathyroid adenomatos 

tyrosine kinase with immunoglobulin and 

gb:H.sapiens SOD-2 gene for manganese su 

inhibitor of DNA binding 3, dominant neg 

eukaryotic translation elongation factor 

eaten In (cadherln-^ssociated protein), b 

thioredoxin reductase 1 

gbiH.sapfens PTX3 gene promotor region. 

protein tyrosine phosphatasOt receptor t 

ephrin-BI 

ESTs 

ESTs 

protein disulfide isomerase related prot 

Homo sapiens done TCCCIA00176 mRMA sequ 

B-ceU CLl/lymphoma 6, member B (zinc fl 

Homo sapiens done IMAGE:451939, mRNA se 

phosphollpase A2. group IVC (cytosolic. 

desmoplakin (DPI. DPI!) 

Homo sapiens, done !MAGE:3506202. mRNA, 

AD036 protein 

angiotensin receptor-tike 1 

hypothetical protein FLJ14957 

solute carrier family 7, (cationic amino 

ESTs 

ESTs 

hypothettcat protein FU20505 

CGM3 protein 

RHA binding motif protein 8B 

tyrosine 3-monooxygenase/tryptophan 5-mo 

K1AA1002 protein 

hypothetica! protein MGC12942 

ESTs 

disabled (Orosophita) homolog 2 (mitogen 
ESTs 

Homo sapiens cDNA: FU22296 fis, done H 

Homo sapiens cDNA: FLJ21962 fis, done H 

Honrra sapiens done FLB9213 PR02474 mRNA. 

Homo sapiens cDNA: FU21288 fts, done C 

hypothetical protein FU 14005 

protein kinase (cAMP-dependent, catalyti 

RIKEN cDNA 9130422N19 gene 

HSPC028pn3teln 

hypothetical protein FLJ22471 

Homo sapiens cDNA: FU22141 fis, done H 

ESTs 

Homo sapiens done 25012 mRNA sequence 
novel SH2-containing protein 3 
molecule possessing ankyrin repeats Indu 
mftogen-activated protein kinase 8 inter 
Homo sapiens cDNA FLJ10768 fis, ctone NT 
Homo sapiens cDNA FU13698 fis, done PL 
chloride intracellular channel 4 
stroma! cell derived fador receptor 1 
methionine sulfoxide reductase A 
mefanoma ceil adhesion molecule 
podocalyxin-Iike 

proliferation-associated 2G4, 38kD 

Homo sapiens mRNA; cDNA DKF2p564F053 (fr 

ESTs 

KIAA1077 protein 

gb:zo38dD1.s1 Stratagene endothelial eel 
hypothetical protein FU20992 similar to 
solute canier family 4, sodium bicaibon 
EH-domain containing 3 
ESTs 

ESTs. Moderately similar to A47582 B^l 
Homo sapiens mRNA for KIAA1741 protein, 
ESTs 
ESTs 

mitogen-adivated protein kinase kinase 
hypothetical protein DKFZp434N185 
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111356 

111378 

111741 

111769 

112318 

112951 

113057 

113195 

113490 

113542 

113803 

113847 

113910 

113947 

114047 

115061 

115819 

115870 

115964 

116228 

116264 

116314 

116589 

117023 

117112 

117156 

117176 

117280 

119559 

119866 

120655 

121314 

121335 

121822 

121835 

122331 

122577 

123160 

123486 

124059 

124339 

124358 

124364 

124726 

124763 

125167 

125304 

125307 

125329 

107985 

125598 

125609 

116024 

418000 

126399 

127435 

127566 

127619 

434190 

128453 

128495 

128515 

128580 

128623 

128642 

128669 

128903 

128914 

129087 

129188 

129226 

129265 

129345 

129468 

129488 

101838 



N90933 

N93764 

R26124 

R27957 

R55470 

T16550 

T26674 

T57112 

T88700 

T90527 

W427&9 

W60002 

W78175 

W84768 

W94427 

AA253217 

AA426573 

AA432374 

AA446622 

AA478771 

AA482594 

AA490588 

D59570 

H88157 

H94648 

H97538 

H98670 

N22107 

W38197 

W80814 

AA287347 

AA4D2799 

AA404418 

AA425107 

AA425435 

AA442872 

AA452860 

AA488687 

AA599674 

F13673 

H99093 

N22495 

N23031 • 

R15740 

R39610 

W45560 

Z39833 

Z40583 

AA825437 

R66613 

R66613 

AA868063 

AA128075 

AA1 28075 

AA128075 

N66570 

A1051390 

AA627122 

AA627122 

X02761 

AF010193 

AA14g044 

U82108 

D78676 

L35240 

AA598737 

R69417 

AA232837 

N72695 

M30257 

M96843 

X68277 

AA292440 

J03040 

AA228107 

AA449789 



BE301871 Hs.4867 
AW160993 Hs.326292 
AB020653 Hs.24024 
AW629414 Hs.24230 
AW083384 Hs.11067 
AA307634 Hs.6650 
AW194301 Hs.339283 
H83265 Hs.8861 
BE178110 Hs.173374 
H43374 Hs.7890 
AW6807D9 Hs^83683 
NM.005D32Hs.4114 
AA1 13262 Hs.17901 
W84768 

AL035858 Hs.3807 
AI751438 Hs.41271 
AA486620 Hs.41135 
NM-005985HS.48029 
AA987568 Hs,74313 
AI767947 Hs.50841 
D51174 Hs.272239 
AI799104 Hs.178705 
AI557212 Hs.17132 
AW070211 Hs.102415 
AW969999 Hs.293658 
W73853 

H45100 Hs.49753 
M18217 Hs.172129 
W38197 

AA496205 Hs.193700 
AA305599 Hs.238205 
W07343 Hs.182538 
AA404418 
AI743860 

AB033030 Hs.300670 
AL133437 Hs.1 10771 
AA829725 Hs.334437 
AA488687 Hs.284235 
BE019072 Hs.334802 
BE387335 Hs283713 
Hg9093 Hs.343411 
AW070211 H5.102415 
AF265555 Hs^646 
NM.003654HS.104576 
BE410405 UsJBZaa 
AL137540 Hs.102541 
AL359573 Hs.124940 
AW580945 Hs.330466 
AA825437 Hs.58875 
T40064 Hs.71968 
T40064 Hs.71968 
AA868063 Hs,104576 
AAOSBJe? Hs.83883 
AA932794 Hs.83147 
AA088767 Hs.83883 
X6g086 Hs.286161 
AI051390 Hs.1 16731 
AA627122 Hs.163787 
MSm22 Hs.163787 
X02761 Hs.287820 
NM_005904Hs.100602 
BE395085 Hs.10086 
U82108 Hs.101813 
BE076608 Hs.105509 
Z28913 Hs.102948 
W28493 Hs.160414 
AW1S0717 Hs.345728 
AW867491 Hs.107125 
AI348027 Hs.108557 
NM.001078HS.109225 
BE222494 Hs.180919 
AA530892 Hs.171695 
R22497 Hs.1 10571 
AW410538 Hs.1 11779 
AW966728 Hs.54642 
BE243845 Hs.75511 



mannosyl (a[pha-1.3->{|tycoprotein beta- 
hypothetlcalgene DKFZp434A1114 
KIAA0846 protein 
ESTs 

ESTs. Highly similar to T46395 hypotheti 
vacuolar protein sorting 45B (yeast homo 
Human DMA sequence from done RP1-187J11 
ESTs, Weakly similar to 841044 chromosom 
Homo sapiens cDNA FLJ10500 lis. clone NT 
Homo s£y)lens mRNA for KIAA1671 pnotefn, 
chromosome 8 open reading tame 4 
plastkidfTisofonn) 

Homosapiens, done IMAGE:3937015. mRNA, 
gb:zhS3d03.s1 SoaresJetaLliver.spleen. 
FXYD domain-containing ion transport reg 
Homo sapiens mRNA full length insert cDN 
endomudn-2 

snail 1 (dtosophlEa homolog), zinc finge 

KIAA1265 protein 

ESTs 

lysosomal 

Homo sapiens cDNA FUl 1333 fis, done PL 
ESTs. Moderately similar to 154374 gene 
Homo sap)ens mRNA; cDNA DKF2p586N0121 (f 
ESTs 
ESTs 

uveal autoant^en with coiled coil domai 
Homo sapiens cDNA: FU21409 fis, done C 
Empirically selected from AFFX single pr 
Homo sapiens mRNA; cDNA DKFZp58610324 (f 
hypothetical protein PRO2013 
phospholipid scramblase 4 
gb:zw37e02.s1 Soares_totaLfetus_Nb2HF8. 
metallothioneln IE (functional) 
K1AA1204 protein 

Homo sapiens cDNA: FLJ21904 fis. done H 
hypothetical protein MGC4248 
ESTs. Weakly similar to 138022 hypotheti 
Homo saptens cDNA FU14680 fis. done NT 
ESTs. Weakly simHar to S64054 hypotheti 
DEAO/H (Asp^Iu-Ala^p/His) box polypep 
Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
bacuiovbal lAP repeaVcontaining 6 
carbohydrate (keratan sulfate Gal-6) sui 
ca/pain 2. (m/ll) large subunit 
netrtn4 

6TP-blnding protein 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKF2p564F053 (fr 
Homo sapiens mRNA; cDNA DKF2p564F053 (fr 
carbohydrate (keratan sulfate Gai-6) sul 
transmembrane, prostate androgen induced 
guanine nudeotide binding prDtein4ike 
transmembrane, prostate androgen Induced 
Homo sapiens cDNA FU13613 fis. done PL 
ESTs 
ESTs 
ESTs 

fibronectin 1 

MAD (mothers against decapentapleglc. Dr 
type I transmembrane protein Fn14 
solute canier family 9 (sodium/hydrogen 
CTL2gene 

enigma (LIM domain protein) 
heat shock 70kD protein 8 
STAT Induced STAT Inhibitor 3 
plasmalemma veside assodated protein 
hypothetteal protein PP1057 
vascular cell adheston molecule 1 
inhibitor of DNA lading 2, dominant neg 
dual spedfidty phosphatase 1 
growth anest and DNA-damage-inductble, 
secreted protein, addle cystelne^lch 
methkxiine adenosyltransferase II. beta 
connective tissue growth factor 
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413731 


AA449789 


BE243845 


Hs.75511 


129557 


W01367 


AL045404 


Hs.45366 


129619 


AA610116 


AA209534 


Hs.284243 


129627 


AA258308 


T40064 


Hs.71968 


129762 


AA460273 


AA453694 


Hs.12372 


129884 


AA286710 


AF055581 


Hs.13131 


130018 


T68873 


AA353093 




130147 


D63476 


D63476 


Hs.172813 


130178 


M62403 


U20982 


Hs.1516 


130282 


X55740 


BE245380 


Hs.153952 


130431 


L10284 


AW505214 


Hs.155560 


130495 


AA243278 


AW250380 


Hs.109059 


130553 


AA430032 


. AF062649 


Hs.252587 


130638 


H16402 


AW021276 


Hs.17121 


130839 


D59711 


AI557212 


Hs.17132 


130857 


T94452 


AW337575 


Hs.201591 


130866 


AA431571 


BE548267 


Hs.337986 


130776 


R79356 


AF167706 


Hs.19280 


130818 


AA280375 


AW190920 


Hs.19928 


130840 


Z49269 


BE048821 


Hs.20144 


130899 


Z41740 


AI07728e 


Hs.296323 


131002 


AA121543 


AL05Q295 


H5.22039 


131080 


J05008 


NWL001955HS.2271 


131084 


AA101878 


NM.017413HS303084 


131091 


T35341 


AJ271216 


Hs.22880 


131107 


N87590 


BE620886 


Hs.75354 


131182 


AA256153 


AI824144 


Hs.23912 


131207 


W74533 


AF104266 


Hs.24212 


131319 


U25997 


NIVL003155HS.25590 


131328 


V01512 


AW939251 


Hs.25647 


131509 


X56681 


X56681 


Hs.2780 


131555 


AA161292 


T47364 


Hs.278613 


131564 


AA491465 


T93500 


Hs.28792 


131573 


AA046593 


AA040311 


Hs.28959 


131692 


D50914 


BE559681 


Hs.30736 


131756 


D45304 


AA443966 


Hs.31595 


131859 


M90657 


AW960564 




131909 


W69127 


NP\/L016558Hs.274411 


131915 


AA316186 


AI161383 


Hs.34549 


132046 


AA384503 


AI359214 


Hs.179260 


132050 


AA136353 


AI267615 


Hs.38022 


132151 


AA044755 


BE379499 


Hs.173705 


132164 


U84573 


AJ752235 


Hs.41270 


132187 


AA058911 


AA235709 


Hs.4193 


132303 


AA620962 


6E177330 


Hs.325093 


132314 


AA285290 


AF112222 


Hs.323806 


132358 


XG0486 


NM.003542HS.46423 


132398 


R31641 


AA876616 


Hs.16979 


132421 


AA489190 


AW163483 Hs.48320 


132490 


F13782 


NM.001290Hs.4980 


132520 


AA257993 


AA2579g2 


Hs.50651 


132546 


M24283 


M24283 


Hs.168383 


132610 


AA443114 


AA160511 


Hs.5326 


132716 


T35289 


BE379595 


Hs,283738 


132840 


N23817 


BE218319 


Hs.5807 


132883 


AA047151 


AA373314 


Hs.5897 


132968 


N77151 


AF234532 


Hs.61638 


132989 


AA480074 


AA480074 


Hs.331328 


132999 


Y00787 


Y00787 


Hs.624 


133071 


T99789 


BE384932 


Hs.64313 


133076 


W84341 


AW946276 Hs.6441 


133099 


L09209 


W16518 


Hs.279518 


133147 


D12763 


AA026533 


Hs.66 


133149 


T16484 


AA370045 


Hs.6607 


133161 


AA253193 


AW021103 Hs.6631 


133200 


AA432248 


AB037715 


Hs.183639 


133220 


X82200 


NM.006074HS.318501 


133260 


AA083572 


AA403045 


Hs.6906 


133295 


L00352 


AI147861 


Hs.213289 


133349 


N75791 


AW631255 Hs.8110 


133391 


X57579 


AW103364 Hs.727 


133398 


X02612 


NM.000499HS.72912 


133436 


H44631 


BE29406d 


Hs.737 


133454 


AA090257 


BE547647 


Hs.177781 


133478 


X83703 


X83703 


Hs.31432 


133491 


L40395 


BE619053 


Hs.170001 



connective tissue growth factor 
KIAA0948 protein 
tetraspan NET-6 protein 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 

tripartite motif protein TRIM2 

lysosomal 

metailothioneinIL 

PAK-interacting exchange factor beta 
Insulin-like growth factor-binding prote 
5' nucleotidase (C073) 
calnexin 

mitochondrial ribosomai protein L12 
pituitary tumor-transfomning 1 
ESTs 

ESTs, Moderately similar to 154374 gene 
ESTs 

Homo sapiens cDNA FLJ10934 fis, clone OV 

cysteine-rich motor neuron 1 

hypothetical protein SP329 

snfiaO inducible cytokine subfamily A (Cy 

serum^iucocorticotd regulated kinase 

Kf AA0758 protein 

endothelDi 1 

apelin; peptide ligand for APJ receptor 

dipeptidylpeptidase 111 

6CN1 (general control of amino-add synt 

ESTs 

latrophllin 

stanniocalcin 1 

V'fos FBJ murine osteosarcoma viral onco 

jun D proto-oncogene 

interferon, alpha-inducible protein 27 

Homo sapiens cDNA FU1 1041 fis. done PL 

ESTs 

KIAA0124 protein 
ESTs 

transmembrane 4 superfamily member 1 

SCAN domain-containing 1 

ESTs, Highly similar to S94541 1 done 4 

chromosome 14 open reading frame 4 

ESTs 

Homo sapiens cDNA: FLJ22050 fis. done H 

procollagen-lysine, 2-oxoglutarale 5^^k> 

DKF2P58601624 protein 

Homo sapiens cDNA: FLJ21 21 0 fis, ctone C 

pinln. desmosome associated protein 

H4 histone family, member 6 

ESTs, Weakly simDar to A43932 mudn 2 p 

double ring-finger protebi, Dorfin 

UM domain binding 2 

Janus kinase 1 (a protein tyrosine kinas 

Intercelhilar adhesion molecule 1 (CD54) 

amino add system N transporter 2; porcu 

casein kinase 1, alpha 1 

GTPaseRab14 

Homo sapiens mRNA; cDNA DKFZp586P1622 (f 

myosin X 

hypothetical protein FU13213 

interleukin 8 

ESTs. Weakly similar to AF257182 1 G-pro 
Homo sapiens mRNA; cDNA DKFZp586J021 (fr 
amyloid beta (A4) precursor-like protein 
interleukin 1 receptor-like 1 
AXIN1 up-regulated 
hypothetical protein FIJ20373 
hypothetical protein FLJ10210 
Homo sapiens mRNA full length Insert cDN 
Homo sapiens cDNA: FU23197 fis, done R 
low density lipoprotein receptor (famili 
L-3-hydFoxyacyl-Coenzyme A dehydrogenase 
inhibin, beta A (activin A, activin AB a 
cytochrome P450. subfamily I (aromatic c 
immediate early protein 
hypothetical protein M6C5618 
cardiac ankyrin repeat protein 
eukaryotic translatton initiatton factor 
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133510 AA227913 AW880841 Hs.96908 

133517 X52947 NM_000165Hs.74471 

133526 M11313 AU077051 Hs.74561 

133538 L14837 NMJ03257Hs.74614 

5 133562 M60721 M60721 Hs74870 

133584 D90209 D90209 HS.1B1243 

133590 767986 T70956 Hs.75106 

133617 AA148318 BE244334 Hs.75249 

133651 U97105 AI301740 Hs.173381 

10 133671 T25747 AW503116 Hs301819 

133678 K02574 AW247252 

133681 D78577 AI352558 

133722 X53331 AW969976 Hs.279009 

133730 S73591 BE242779 Hs.179526 

15 133750 X95735 BE410769 Hs.75873 

• 133802 L16862 AW239400 Hs.76297 

133625 U44975 BE616902 Hs.285313 

133838 M97796 BE222494 Hs.180919 

133859 U86782 U66782 Hs.178761 

20 133889 AAa99391 U48959 Hs.211582 

133980 M19267 M19267 Hs.77e99 

133975 D29992 C18356 Hs.295944 

133977 L19314 AI125639 Hs.250666 

134039 S78569 Nt\L002290Hs.78672 

25 134075 U28811 NIVL012201HS.78979 

134081 L77886 AL034349 Hs.79a05 

134164 C14407 AW245540 Hs79516 

134203 M60278 AA161219 Hs.799 

134238 R81509 AA102179 Hs.160726 

30 134299 AA487558 AW580939 Hs.97199 

134332 D86962 D86962 Hs.81875 

134339 AA478971 R70429 Hs.81988 

134343 D50683 D50683 Hs.82028 

134381 U56637 AI557280 Hs.184270 

35 134403 M61199 AA334551 

134416 M28882 X68264 Hs^11579 

134493 X15183 M30627 Hs.289088 

134558 S53911 NM_001773Hs.85289 

134817 U20734 AU076592 Hs.198951 

40 134983 D28235 D28235 Hs;ig6384 

134989 AA236324 AW968058 Hs.92381 

135052 AA148923 AL136653 Hs.93675 

135062 AA174183. AK000967 Hs.93872 

135069 AA456311 AA876372 H5.93961 

45 135071 L08069 W27190 Hs.94 

135073 AA452000 W55956 Hs.94030 

135170 AA282140 T53169 Hs.9587 

135196 J02854 C03577 Hs.9615 

135348 AA442054 U80983 Hs.268177 



p53-mduced protein 

gap junction protein, alpha 1, 43kD (con 

alpha-2-macrog)obuiln 

tight junction protein 1 (zona occludens 

H2.0 (Drosophila)-lii(e homeo box 1 

activating transcription factor 4 (tax-r 

dusterin (complement lysis Inhibitor, S 

ADP-ribosylation factor-lilce 6 interadi 

dihydropyrfm(c(fnase4ike 2 

zinc finger protein 146 

nucleoside phosphorytase 

tyrosine 3-monooxygenase/tryptophan 5-mo 

matrix Gla protein 

upregulated by 1.25-dihydroxyvitamin D-3 
zyxin 

G protein-coupled receptor icinase 6 
core promoter element binding protein 
inhibitor of DNA binding 2. dominant neg 
268 proteasome-assodated padi homolog 
n^osin^ light polypeptide idnase 
tropomyosin 1 (alpha) 
tissue factor pathway Inhibitor 2 
hairy (Drosophiia)-homolog 
laminin, alpha 4 
Gotgi apparatus protein 1 
protein tyrosine phosphatasOt receptor t 
brain abundant membrane attached signal 
diphtheria toxin receptor (heparin-blndi 
Homo sapiens cDI^ FU1 1680 fis, done HE 
complement component Clq receptor 
growth factor receptor-bound protein 10 
disabled (DrosophBa) homolog 2 (mitogen 
transforming growth factor, beta recepto 
capping protein (adin filament) muscle 
sperm spedfic antigen 2 
melanoma cell adhesion molecule 
heat shock 90kD protein 1, alpha 
CD3A antigen 
iun 8 proto-onoogene 
prostaglandin^doperoxide synthase 2 (p 
nudix (nucleoside diphosphate linked moi 
decidual protein induced by progesterone 
KIAA1682 protein 

Homo sapiens mRNA; cDNA DKFZp667D095 (fir 
DnaJ (Hsp40) homolog. subfiamily A. membe 
Homo sapiens mRNA; cDNA DKFZp588E1624 (f 
Homo sapiens cDNA: FU22290 fis. done H 
myosin regulatory light chain 2. smooth 
phospholipase C, gamma 1 (fonnerty subty 
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TABLE 4A 

Table 4A shows the accession numbers for those pkeys lacking unigenelD's for Table 4. The pkeys in Table 7 lacking unigenelD's are represented within 
Tables 1-6A. For each probeset we have listed the gene cluster number from which the oligonucleotides were designed. Gene clusters were compiled 
5 using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 

Alignment Tools (DoubleTwist. Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 



1 0 Pkey. Unique Eos probeset identiHer number 

CAT number Gene cluster number 
Accession: Genbank accessfon numbers 



15 Pkey 



CAT Number Accession 



100752 33207.21 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



117156 145392_1 
131859 367^1 



125565 1704098J 
133607 1227_6 



T81309 BE019033 R94181 BE019198 NMJD00612 J03242 AW411299 BE300064 BE297544 R94182 AW630108 T53723 
D58853 H78073 H8G594 BE299560 T48899 H70196 M17426 N77077 S77035 H58384 H61664 H78540 T84527 C17198 
H60255 H71980 R92644 W79050 X00910 M29645 R91055 M17863 M17862T71815 BE299561 BE464561 X06260 
R94741 T54216 C18594 BE262015 X06161 AW409889 AA378400 BE263228 BE313278 R881 16 BE313457 H43500 
T48617 BE313761 H77309 AI207601 X06159 H40413 X03425 T87663 R10627 X03562 M14118 W03982 R97520 H81229 
T83157 H83168 H4876? AA669898 BE263054 H47289 AA0228D7 R1 1555 H74260 R76968 R28338 H72534 H72464 
H62031 N72478 N45355 AW411300 R89113 R69135 H58454 T83281 R93476 H69645 H68015 T82229 H71089 T85121 
H59939 W65299 N78178 H53909 N72373 R21788 H04660 H59639 H61874 BE262219 T53814 N73335 N50464 W00943 
N77189 R89257 AA570502 R89432 R06366 AA553480 AA776271 AA551359 AA551050 H51670 AA601052 BE299081 
H68198 H52276 BE207832 N91192 H70332 X07868 X07868 H69464 H53782 H73710 R80435 AA553384 AW884176 
N53475 T71662 AW954036 AW954033 AA552931 H93206 AA430218 AA553476 A1918470 T54124 8E207982 BE3a0177 
N73994 AW882625 N39549 N53838 AA722389 H71878 H58909 H37849 H78435 T47933 R77174 R83814 AA411890 
H94199 AA653208 BE205778 AA490137 H70492 R98232 H37800 AA679294 H40341 H74238 H47290 H73231 T48618 
AA025428AI039521 H92969 N59389 H80538 H72933 T90630 AA411891 N55000 H74225 AA340290 AW957061 T54316 
AA340437 H57125 H58908 H79027 H63450 N74623 R93425 H68714 H68758 N68396 H48763 N69256 H57320 H53831 
H53589 N68833 N52453 H56048 H69870 H78074 R69253 R83375 T53615 H94330 H58455 H90864 T47934 H74261 
R89258 R97997 R91056 R28339 R85760 H78235 R97521 H67692 1440358 AA022688 H52513 H59801 T88690 H65256 
H63397 W65397 AA553588 R19280 N52645 W73930 R06367 R21743 H72372 Isl73921 AW883539 AW882639 T40616 
H47084 R95723AA634316AA862781 H77310 R91389H93111 R92767T54512 R89341 H70333 H57817 H82941 H62032 
. N52638 H58385 T91796 H51086 AA340292 T49918 H81230 R36121 N50411 T87664 N62436 N39340 AA665637 
AA340446 H93377 H92973 BE296290 BE269788 H61665 AA340444 N54605 AA454101 R10628 R94200 AI200549 
AA342640 BE298855 BE250229 T49916 H82008 N28278 AW880662 H71268 N76791 H47685 H65255 W05198 
AW8B9144 N76677 H71702 H68036 H71915 R91612 R87807 H68059 AI133328 A1247866 AA621443 AW881050 
AA700847 AA34041d AWe78608 AW881 181 AW878249 H71916 N54596 BE161581 AW878082 W04212 AW881040 
AW6854g2 AW880519 AA334887 AW878715 W06882 AW630222 AW885381 H70869.AW381778 H47601 AW889982 
H63868 AW884986 AW878713 AW878685 R36391 AW878694 AA368070 C03393 AWB78695 AW878705 AW878665 
AW878742 AW878620 AW878823 AW878688 R2904d AW878690 AW878686 AW878810 AW878827 AW878733 
AW878659 AW878749 AW878681 AW883353 AW883277 AW883300 AW883565 AW883298 AW883143 AW883045 
AW883482 AW883352 AW883417 AW883357 AW883231 AW8B3474 AW883355 AW882620 AW882533 AW8B3754 
AW883139 AW882d27 AW883641 AWd83567AW883481 AW882983 AW882982AW862465 AWd83419 AW882466 
AW883639 AW883230 AW882981 AW882534 AW882874 AW882619 AW883480 AW882826 AW882831 AW882835 
AW882830 AW883563 AW882456 AW627642 

W73853 AA9281 12 W77887 AW889237 AA148524 Ar749182 AI754442 AI336392 AI253102 AI079403 AI370541 AI697341 
H97538 AW188021 AI927669 W72716 AI051402 AI188071 AI335900 N21488 AW770478 W92522 A1691028 AI913512 
AI144448 W73819 AA604358 N28900 W95221 A1888132 H98465 AA148793 

AW960564 AA092457 T55890 D56120 T92525 A)815987 BE182608 BE182595 AW080238 M90657 AA347236 AW961686 
AW176446 AA304671 AW583735 T61714 AA316968 AI446615 AA343532 AA083489 AA488005 W52095 W39480 N57402 
D82638 W25540 W52847 D82729 D58990 BE619182 AA315188 AA308636 AA112474 W76162 AA088544 H52265 
AA301631 H80982AA1 13786 BE620997 AW651691 AA343799 BE613669 BE547180 BE546656 F11933 AA376800 
AW239185 AA376086 BE544387 BE619041 AA452515 AA001806 AA190873 AA180483 AA159546 F00242 AI940609 
AI940602 AI189753 T97663 T66110 AW062896 AW062910 AW062902 AI051622 A1828930 AA102452 A1665095 AI819390 
AA557597 AA383220 AI804422 AI633575 AW338147 AW603423 AW606800 AW750567 AW510672 AI250777 AA083510 
AW629109 AW513200 AA921353 AI677934 AI148698 AI955858 AA173825 AA453027 AI027865 AW375542 AA454099 
AA733014 AI591384 R79300 R80023 AA843108 AA626058 AA844898 AW375550 AA889018 A1474275 AW205937 
AI052270AW388117 AW388111 AA699452 AI242230 N47476 H38178 AA366621 AA113196 AA130023 H39740 T61629 
AI885973 AW083671 AA179730 AA305757 AI285455 N83956 AA216013 AA338155 AW999959 T97525 AA345349 
T91762 AA771981 AI285092 AI5913B6 BE392486 BE385852 AA682601 AI682884 AA345840 T85477 AA292949 
AA932079 AA098791 D826Q7 T48574 AW752038 C06300 
R20840 R2083g 

BE273749 BE397561 BE387188 Alj037858 AL037878 AI963094 BE259216 AA01 1363 AL036189 BE562325 AA251 169 
BE617431 N98537 AA158093 AL047800 M34539 NM.000801 AA312140 D16971 AA158904 AA3071 14 AA312803 
T09203 AW629686 AL048504 BE388578 AA220957 AA158364 BE267385 AA294971 C18055 BE241757 AA1 15056 
A1936769 BE37a435 BE208971 AW674g24 BE622060 AA604674 AA115273 AW402159 AA338608 BE568819 M80199 
X55741 AA37511 1 AA376016 BE612671 AA805742 AW405568 N25850 N44580 H08031 AW403549 BE536552 AA056726 
BE543239 AA082517 AI201645 AI201642 AI192622 N40104 AA370921 BE547569 AI969602 AA302038 AI197890 
AW268354 AI014938 W45448 AI541395 AA037272 BE538826 AL039613 BE538130 AA299355 AW805147 AW974624 
. H53220 AI471471 AA399303 AA007386 W35106 BE613277 R12739 R12738 AA304342 AA687802 BE4095B1 AI498844 
AV662092 AW904105 AA011375 BE315214 H99302 BE537893 N32299 AW855829AI291320 BE078322 AI301395 
AA303362 N32719 AA358328 AA357877 AI952540 H56279 H02758 H02048 AW805233 R82224 AA410772 AA291352 
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BE171 109 N69935 BE169248 AA361173 H44978 BE617887 D52560 AA084043 W03595 R67219 N36477 N42924 R67104 
H44901 H79695 W21 105 AA393988 W30e99 AA316096 BE622896 W46872 AA442678 BE544893 BE5401 12 BE621873 
AA338067 N55052 BE398154 BE621210 AA740760 C03739 C03206 BE396692 AA482370 AA031614 AA301575 
AA304710 AA132153 AA029796 AA994960 H19587 AA442989 H49781 H46871 AA035395 AA056185 AA149378 
AA643080 AL135479 AA292329 AA654337 AA041228 AA454888 AA025039 W58331 AA625981 T94941 AA302448 
H19900 AA218956 AA513790 AA563962 AA39B076 W44441 AA293276 W47373 AA625879 W30688 AA043029 T64284 
R79151 AA304340 AA485186 AA604939 R82470 AA421425 AW771456 AI339329 AA304424 AA605236 AA936934 
AA587673 AI209162 A1697301 AM79995 AI679814 AI361950 AW189125 At955888 AI986019 BE301019 AI084792 
AI310211 AW189307 At022070 AW977204 AI146825 AW190163 AW303281 AI82B345 BE(M6043 AW029257 AA482268 
AI246507 AI420729 AW084932 AW439514 AI890487 AW439692 AI523896 AI186812 A1659953 AI889773 AA687527 
AW072694 AW262153 AW467371 AI613269 AI679238 D54404 AA158103 AW105527 AW149739 AW150361 AW268387 
AWl 1770B A1951682 AI687440 AW674285 AA67B365 AI587082 AA732095 AA019899 W45661 AA627300 BE613304 
AA765691 AAei2935 AI814658 AW316916 R66594 AA514640 AAQ25040 AA031472 AW732076 AA029797 AI244560 
AI128734 AW381720 AI092360 AI263283 AW613175 AI890675 A1720156 AW63134B AI635106 A1278045 AA303979 
AA703505 W45449 AW078661 AI292052 AW381707 AI147854 AW381743 AA158905 AA303258 AA888144 AW195967 
AA428706 AA989559 AA617731 H19882 BE543418 AA830386 AA421302 W5B652 T94995 AI869743 AI679145 
AW085971 N98425 AA765136 AI347027 AI356955 AA928038 AI679717 AA458459 AA679281 AI367973 AI270041 
AA765135 AA732793 AI798447 AA668646 AA251008 AI984538 AI401737 AA056186 BE043308 AW662375 AI3021 10 
N50724 W96332 BE537047 N26983 AB67172 AA765296 AW673237 N29784 AA534275 AA084044 AW067973 
AW300766 T63398 W46823 R39790 AI364185 AW298582 AA454814 AW069878 N67751 H05982 N23140 AI362647 
AI302086 AI767772 N25755 H531 14 AA706133 T9351 1 AA429291 AA935294 AA987647 W02803 R66595 AI680795 
W23673 AW440794 AA722872 H49538 AW131042 AA531603 AA908665 AA040791 AA235312 W52205 N93444 R82180 
H02759 H79696 AW088894 H56079 AA961143 AW067776 AW973745 AA01631 1 AW071227 AA017511 AI753994 
W47374 T64165 AA296092 AI698626 AA558158 AA296088 AW794259 H01963 AA149267 AA485076 AA975856 H44938 
AA035396 AI955555 H46289 AA486161 AI631222 AA359047 AW794253 AI806962 AW243930 AA526145 AW878734 
AA018464 AA132031 R67220 R79152 AA296093 H54300 AI005160 BE242548 AW992803 AW878644 AW878666 T27742 
R82471 AW517604 AW472738 A1282904 R39791 AA486098 AW467891 AW960520 AA551736 AA056621 AW945197 
R66373 AA554236 BE242202 AI904376 A1832590 H19484 R00890 A1627677 AA302287 AI869451 A1734855 A(708073 
AI832902 AA585184 AW204299 AA055565 D12417 D11975 T63543 AW664099 R54423 BE612712T96340 T63985 
AA598917 T40735 T64053 AA149284 AW272548 AA363445 AA042893 AW300697 BE261973 T53501 T53500 AW878729 
AW878657 AW794391 AA0S9193 R01553 H44875 AA385406 AA533968 M93060 AL135600 W96331 AA017651 
AA018849 AA017692 H85337 BE278690 AA731598 AA018512 A!076813 AI022644 R02585 X52220 AW296894 AA825671 
AI699321 AI393601 AW592611 AI146747 AA60B921 AA168365 AW590007 AA354519 D20081 R02704 AW798339 
M92422 AA094903 AA007676 

133681 13893 1 AI352558 282248 X78138 NM-003405 AU077248 AA223125 S80794 D78577 AI124697 AW403970 BE614089 BE296713 

BE621334 L20422 X80536 D54224 D54950 X57345 N29226 AA127798 AA340253 F08031 AA192540 H67636 AA321827 
AW950283 AA0841 59 BE538808 AW401 377 AA256774 C03366 W46595 W47608 AA305009 He9431 H69456 AL120082 
H1 1 706 AA303717 AA361357 H22042 H78020 AW999584 AA134368 AA32291 1 AA322961 H60980 N85248 N31547 
H79624 T1 1718 W85826 AW894663 AW894624 BE167441 BE170015 AA304626 AW602163 AW998929 AA1566B1 
AA151067 BE002724 AA608688 H82692 BE155392 AW383636 6E155394 AA4&7004 AW383504 AI342365 R62553 
W1649d BE155344 At143938 R69901 AA322873 AW340648 R25364 AA367935 AI559406 AA033522 AA374252 
AW835019 A19221 33 AI697089 N99862 AW189078 A1199076 AW151598 W59944 AA662875 W94Q22 AA299055 
AI039008 AI829449 AA583503 A1635674 AW131G65 At473820 AW273118 AW900930 AA908944 AI68803S AW170272 
AI082545 AW468176 AI608761 AI082748 A1911682 AI248943 AI831016 AA192465 AI218477 AA938406 AA385288 
AI809817 AA905196 AI191245 AI470204 AI188296 AI421367 AI125315 AI087141 AA629032 AA740589 AI554181 
AA150830 AI248541 AI077943 AA775g58 AA864930 A1261476 AI123121 At310394 AA862331 AA872478 BE537084 
AI205606 AA72a684 AI872093 AW150042 AL120538 AA219627 AA988608 C21397 AI359337 H25337 A1089749 
AA605146 AI359620 AA150478 AI359738 AW383642 AW995424 AI766457 R56892 AI089839 W61343 N69107 W46459 
AA565955 N20527 AI279782 W46596 AA776573 H23204 AI866231 AI083995 N21530 AA126874 D82630 W65437 
AI086917 AW382095 AI086877 H69844 AW340217 W85827 L08439 AA262704 AA5053B0 W47413 W94135 AA223241 
AW089153 AA084101 BE538000 AA096126 T28031 AA491574 R84813 AA774536 AW383522 AA155615 AW383529 
AA491520 AW028427AA171496A1469689AW664539AI811102AI811116BE464590BE350791 H78021 T15405H21979 
AA219489 H13301 AA505883 A1864305 AI423963 AW084401 F04963 R69858 H67097 AI917740 AI655561 He9864 
AA033631 AW383484 A1886261 H25293 AA513281 AW271 187 H1 1617 N79982 AI174338 AI904207 AI904208 BE614558 
W94127 W65436 A1272249 AA700018 AI579932 AI085941 AW152629 

134403 17037 1 AA334551 BE008229 AA307537 AW961156 AW995894 AW995826 NM.006751 M61199 AA045603 AL036372 AV645606 

AI688095 AW351901 AA101337 AA101345 N73342 BE018030 BE569044 AW841975 AA373388 BE090412 H95440 
N53845 R67867 AA093441 AA363427 H93708 AW023134 A\W994986 AW994989 BE090429 R23614 A1567932 H03726 
HO1 101 H01867 AA548743 AI671806 AW872949 AW872941 AA742447 AI199788 AA045604 AI637465 AI741796 
AW242217 AW131463 A1765302 AI683923 AA889762 AI804889 AI986437 C08049 BE502340 AI595651 A1491970 
AA496804 AA281008 AA665699 AI473814 BE301445 AA707837 AA551925 AI017348 AI208185 AA775203 AA156296 
AA5574e3 H95441 AA768547 AW769358 AA991197 AA181954 AI091389 A1147289 AW771837 AI638582 AA84441 1 
AI374750 T29320 AW951272 AW085923 H02834 AA843259 AA814696 AW183290 AA158453 N68125 N69039 AA100423 
AA101346 AI918720 H01102 R67868 H01868 N66438 R46580 AI858433 AA599560 AA187577 AA157481 AA361520 
AL047827 AA158452 R21688 AW964874 AA325161 R40871 AW752395 AW375924 R1 3355 AA281 174 AA428908 

126872 142696 1 AW450979 AA1 36653 AA136656 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BEOl 1212 BE01 1359 
BE011357 BE011368 BE011362 BE011215 BE011365 BE011363 

121335 279548J AA404418 AI217248 

130018 18986 1 AA353093 AW957317 AW872498 AI560785 AI289110 AW135512 X97261 T68873 

121822 24439? 1 AI743860 N49543 AW027759 BE349467 AI656284 BE463975 R35022 AA370031 AW955302 AL042109 N53092 AI611424 
AL079362 A1969290 AI928016 6E394912 BE504220 BE467505 AI61 1611 A1611407 A1611452 W56437 AI284566 
AI583349 AW183058 A130B085 AI074952 AA437315 AA628161 AW301728 A1150224 AA400137 AA437279 AI223355 
AA639462 AI261373 AI432414 AI984994 A1539335 AA401550 AA358757 AI6D9976 AA442357 AA359393 AA437046 
AA370301 AA429328 AW272055AI580502 AI832944A1038530AA425107At014986AI148349AW237721 AW77g756 ^ 
AW137877 AI125293 AA4004G4 R26554 

123523 genbanlLAA608588 AA608588 
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123533 genban)eAA608751 AA608751 

125091 genban!eT91518 T91518 

123964 genbank_C13951 C13961 

102491 enlfez^USIOlO U51010 

1 18475 genbanleN65845 N66845 

118581 genbanleN68905 N68905 

1 13947 genbanLW84768 W84768 

101447 entrez^M21305 M21305 

101667 13349J NM^005381 M60858 AW373732 AW373724 AW373689 AW373629 AW373609 AW373776 AA187806 AW386946 

AW374207 T05235 AA216203 AW385556 AA306940 AA306526 AA315461 A1036757 AW373711 AW403124 AW403640 
AW377084 T27360 H62638 F06957 AW377051 AA554779 AA378568 AA096007 AW352407 AW302637 F07929 H17433 
AW382712 H05665 F07292 N39875 AA089729 H62556 N42842 R12952 AW373735 AW364155 AA056183 W39185 
AW382708 N32488 AF114096 AW375993 AI133569 W52561 AA603040 AA133710 AI928796 AW176370 AA827519 
AW338437 AA521142 T29341 AI800461 AW317002 AA703914 AA860830 A1859203 AI445772 AA714334 AI817066 
AI832027 AW510442 AI635802 AW088306 AW068672 AW408555 AW467542 AA552657 AA152367 W32081 AA582124 
AA074040 AA931657 AI051154 AW410203 AI921644 H17434 AI832330 AW404836 AI925038 AA088423 AA954166 
AA580453 AW021292 AI267215 AW0800B2 AW383778 AI933053 AI919097 W31557 N90245 AA931591 AA563995 
F36352 AA056184 AA476294 AA641327 AA533550 AI749630 W58323 AA5691 19 AA508573 AI809050 AI378996 
AA411362 AW407505 AA938104 AA074041 AA632876 AW193748 AA507873 AI270128 AI472365 AA411363 AI523216 
AI719965 AI816302 AA182681 AI707990 AA133588 Ai758537 W60253 AI460308 AA135423 AI083904 F04188 N89693 
AW408776 AI678595 AI270568 AA722059 W58234 F33650 AA090547 AA285108 AA425981 N85079 D20218 AI273980 
AA159028 F03226 AW247914 N26918 AW272741 N90109 H05666 N23327 AW247953 R44748 AAg62015 F03558 
At752394 AW409913 AW248396 AI816463 AI752393 AA325370 AA263089 AI570130 AI971951 AI160658 AI357360 
AW168685 AL121075 AW050536 N21672 W67748 AA514242 AI127386 H14607 AI185752 W79364 AA088520 AA152476 
AW351940 AW373883 A1940524 AW374953 T56500 N24329 AI940720 AW374933 AW374947 AW391913 AL138337 
AW376241 AW062943 F26666 AW410202 AW0829S8 F34529 AW381807 AW39331S W17147 AW176359 AA664576 
AW380424 AA306040 AI745674 AW300951 AI188579 AI438973 AI305271 AA4338ia AA612807 AI831809 AI940409 
AA158663AI57298& 

108931 genbanH_AA147186 AA147186 

103138 entrez_X65965 X65965 

103432 entrez^X97748 X97748 

119174 genbanK.R71234 R71234 

133678 1 1235.1 AW247252 AA346143 NM.000270 AA381085 N91995 X00737 AA381079 AA298473 AA2961 10 AA315735 AA31 1617 

AA326750 AA376804 AW403290 T95231 M13953 T47963 H82039 AA279899 AA627997 N76320 N99527 H37842 
W20095 AA457308 AW469547 AA724143 H83220 AA319496 W86334 W30892 R89169 R99427 N41854 H47286 
AA348094 AA045089 R63016 AI922219 A1024906 AI096488 A1885005 AA194872 N90489 A1452544 H7241 1 AA282427 
AA430735 R68963 R22453 H70385 AW129369 AW467320 AW519082 AA345018 AA582183 AI961789 R65918 N30611 
A1979189 AI280889 AW273191 R66531 AI285845 A1675927 AI421990 AW190879 H37794 AA699667 H68427 AA954388 
AI188757 AI140048 AA430382 AI204151 AW247864 AA559099 AI431420 AA548276 AI149466 AA772669 AA694388 
AA724168 AA301651 AA281952 AA779925 AA234760 W86290 AA913603 AW511745 AI500697 AA814922 AA835040 
T47964 H53998 AA975804 R98710 AI077604 N70252 R98084 AW250171 H69268 A1597614 AA970746 AA972548 
AI377116 R62962 H16737 R89070 AA731329 R66532 N54354 AI818832 H81944 N71567 T95122 W86463 AA437095 
AI431999 AI915724 W63851 A(674743 AA457307 AA211475 N64444 AI799146 H72853 R99335 H60413 AA770367 
AA156105 AI269937 H64029 H89728 R65819 AW470496 AI873318 AI735713 H82987 C02447 AI478666 T27651 
AI699770 AW025156 H59719 A1984717 N69225 AI459856 AA953577 AI424691 H13843 R22404 AI873796 AI336C02 
N70898 A1420854 AA541792 AA346142 AI000814 AI828348 AA045090 T51257 N90434 H13890 N73184 AI708083 
AA781606 AA329050 AA339985 R68964 H64795 W04186 H16845 

1 19416 genbanlL.T97186 T97186 

119559 NOT„FOUND_entre^W38197 W38197 

123473 genbanIUl<V599143 AA599143 
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TABLE 5: 



10 



15 



20 



25 



30 



Pkey: Unique Eos probeset identifier number 

Accession: Accession number used for previous f 

ExAccn: Exen^plar Accession number, Genbank accession number 

UnlgenelD: Unlgene number 

Unigene Title: Unigene gene title 



Pkey Accession ExAccn UniGene UnigeneTftle 



115819 
132837 
101545 
102898 
101192 
102915 
105330 
107385 
102024 
134416 
103036 
104885 
106124 
105330 
109001 
104764 
133200 
105263 
105178 
109456 



AA426573 

D58024 

M31210 

X06256 

L20859 

X07820 

AA234743 

U97519 

U03877 

1VI28882 

X54925 

AA045136 

AA423987 

AA234743 

AA156125 

AA025351 

AA432248 

AA227926 

AA187490 

AA23264S 



AA486620 Hs.41135 
AA370362 Hs.57958 
BE246154 Hs.154210 
NI\L002205Hs.149609 
BE247295 Hs.78452 
X07820 Hs.2258 
AW338625 Hs.22120 
NM„005397Hs.16426 
AA301867 Hs.76224 



X68264 
M13509 
T79340 . 
H93366 



Hs.211579 
Hs.83169 
Hs.22575 
Hs.7567 



AW338625 Hs.22120 
AI056548 Hs.72116 
A1039243 Hs.278585 
AB037715 Hs.183639 
AW388633 Hs.6682 
AA313825 Hs.21941 
AW956580 Hs.42699 



AA486620 

AA370362 

BE246154 

NM_002205 

BE247295 

X07820 

AW338625 

NIVL005397 

AA301867 . 

X68264 

M13509 

T79340 

H93366 

AW338625 

A1056548 

A!039243 

AB037715 

AW388633 

AA313825 

AW956580 
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TABLE 5A 

Table 5A shows the accession numbers for those pkeys lacking unigenelD's for Table 5. The pkeys In Table 7 lacking unigenelD's are represented within 
Tables 1-6A. For each probeset we have listed the gene cluster number from which the oligonudeolides were designed. Gene clusters were compiled 
5 using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 

Alignment Tools {DoubleTwist Oakland California). The Genbank accession numbers for sequences comprising each duster are listed in the "Accession' 
column. 



1 0 Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Pkey CAT Number 
115819 10241J 

102024 14505 1 



101545 24607J 
109456 180633J 
103036 17145J 



133200 28960^1 



132837 256666J . 



102898 24023.1 



Accession 

AA486620 AF205940 AA297524 AB034695 AA081335 NM.016242 AA188323 AA297537 H88204 AW953081 W31695 
AW582203 AA248250 AW68121 1 AA426230 AA464807 AA426155 N44141 AA347390 AA770661 AI333225 N36136 
AW665724 AA431894 AI374976 AI400254 AI338446 AA18669S H88205 W04527 AA487066 AI051414 AA918383 
AA426573 AA425620 AW438654 AA090513 BE167284 BE167291 AI301726 

AA301867 AW957981 1^614 AA155808 AI920990 AI740711 AA301026 AA301015 AI220981 AI857670 AI537140 
AW015210 AA030000 W46890 H44021 AI355967 AI651735 AA058479 AA146932 T58265 R85890 AA047810 AA017387 
AW026093 AA971133 A1827263 AI056416 AI355994 AI127691 H46603 U03877 NM_004105 AA157357 H42844 
AA146824 AA187709 AA187269 AA304348 AA147292 AA361687 AA156041 AA330636 R32929 AA321130 AW950260 
AA082157 AA029129 AA303708 AA028155 D31561 T84689 AA302493 BE153057 BE153181 W39408 AA187200 
BE153260 AW383337 AW382622 AW382647 AW750072 BE153060 AW382630 AW371865 AW392464 AW382664 
AW382658 AW382650 H61647 AW365075 AW365049 AA373397 BE072779 BE072781 Z30254 W24381 BE153254 
AA040442 BE072729 BE072731 N94740 AA146945 AW802737 Ai826799 AI085395 R34D34 H65140 AA082800 H88275 
AA147824 R63882 W80899 AA296413 AI765300 AI862426 AW022055 AW3000D3 AI743784 AI862635 AI985428 
AA147764 AW573245 AW190290 AI04Q898 D57613 N63457 AA148082 AI02B458 AA1481 10 AW814489 N75105 
AW629443 AA704122 AW582220 AA181240 AA057495 AI418224 AI261751 AW388595 AI472205 AW470672 AA102546 
AA789046 AA182416 AA06266B AW300732 AI286220 AA181982 AA146825 AA028130 AI985522 AA3D3344 AA081313 
N69082 AA182035 AI867128 AA100902 AA605087 N67178 AW020324 AW890446 A1472191 AI335691 AI597837 
A1081143 AI335681 AA040443 A11 28067 A1678244 AA018303 AA157260 W80792 AI934590 AI096430 T54343 AI446350 
AA165195 AA780683 AA603631 AA0477B7 AA968580 AA912645 AW890504 AW026913 D56983 H52088 AA156121 
R30848 AW023036 AI590960 N67345 AI753225 AI753283 AI183768 AA147818 H89101 AI362141 H89205 AI14771 1 
AA321 129 AA668622 AA343479 AW069438 A1422376 AW629270 AA013413 AI221948 AA97Q605 N52335 H38366 
T91180 AA657841 AA017386 AA152227 AA187593 AI913340 AI719313 AI969943 AI701271 AI004328 A1868348 N93659 
H65093 H25736 D57007 D56957 C00987 D61839 D56661 AI472137 Ai971002 D56971 BE048830 D57972 AI589286 
AI361055 AI361071 AI292223 AA155898 D57139 D57981 D57345 AI420034 D57332 057959 AA875933 R33493 N67558 
D5&3S3 AA188394 AA147966 At160640 AI363165 H4063B AA578137 AW950265 AA300343 At128999 H46584 AA917355 
N57820 AA320504 H51959 H25737 

BE246154 M31210 NM.001400 AA193392 NIVL016537 AF233365 AFQ22137 H27787 AA370448 R)5373 T27666 W21494 
AAD369D7 AI249966 N93476 F01623 AA304390 AA308808 

AW956580 AA886361 AI147670 At090115 AI168683 AA232645 H99504 AA374707 AA380875 AW139567 AI735132 
BE439385 AW629780 N28322 AA232789 AA232790 N73285 

M13509X54925NM_002421 1^16567 X05231 M15996 W39354 AA186634AA852324 AA187507AA081 149 AA1 86524 
AA187264 AA187361 AA386155 AA186973 AA374217 U78045 AA081230 AA188049 AA186393 W56827 AA852602 
AA157468 AA308204 AA186754 AA186808 AA082516 AA3D4334 AW376428 BE439384 AW376420 AA156273 T18504 
AA186521 W49496 AW084608 AA083575 AA372360 AW963590 AA132297 W47445 AA186376 AA157628 AW003999 
AI037890 AI858060 AI589010 AI743739 AI452673 AW304188 AW117854 BE439933 AA157416 AW778966 A1038497 
AA0B1006 AA100829 AA181048 C02231 T27821 W23980 AW954802 A1471432 AW801296 AW801289 AW8016D3 
AW801523 AW801292 AW801542 AW801601 AA181134 AI445147 AA191501 AA582862 N94407 A1147810 AA181880 
W49497 W52714 AA188249 AI932881 AI082493 AA503656 AA182582 AW801393 AA182830 AA181882 AA182826 
AI613182 N94510 W47343 AI085755 AI076956 AI918426 AA081208 AI282835 AA147528 A(081490 AI654536 AA181875 
AA081282 AA186389 C06085 AA083542 AI800644 AA157642 AA101069 AA157752 AA158121 AA143331 AA081283 
AA852603 AA188296 A1932880 AW449B28 AA187348 C02091 AA514656 AA082736 AA308786 AA143201 M16567 
AB037715 AI351347 AI375796 AI884765 AL121124 W01068 AI807275 T95240 R42807 AW515645 AI057314 AI033520 
AA057671 N70215 AA054215 AW204183 AA552149 T95130 AW796310 AI866520 AW275564 AW796308 AI637901 
AW197404 T78405 AA456232 AW206463 AA779800 AID52696 AA026744 AA454623 AW470729 R45490 AW77025B 
AI038393 AI290170 AA722734 AL121125 R41608 AI862414 AA838611 R45582 AI278083 BE466849 BE219944 
AA418030 BE041555 AA578572 T16528 AW006344 Z39782 A1244848 AW137344 AA707400 AI032028 BE540464 
A1094265 AI184281 AA931890 AW382744 AW382729 AW02D448 AW827237 AA431226 A1672059 AW772345 N70172 
AW022003 AI862704 H19344 R61511 AI080204 H16566AA432248 AI767980T1 6688 A1984342AI21 7478 AI767095 
Z38551 AI359566 AI361437 AI041000 R07033 H16608 H19054 R12874 R61567 N98368 BE221 199 Z42320 AA094554 
R07078 AW860885 AA418090 R41262 

AA370362 AA364110 AW959554 AW371737 AW3&2068 AW604716 AW604713 AA487827 AW371674 AA429137 
BE503321 T93570 W72803 AI093076 AA487977 AI241S62 BE439445 AW204065 R51635 AI802994 T10362 W68553 
AI866215 AW152154 AA700716 A1127443 R15624 AI537587 AA9531 10 D58024 AI52081 1 AA693670 AI453280 W763^ 
AW023g55AW022563 

NM.002205 X06256 M13918 BE070866 AW239485 AW996127 BE273894 BE272590 BE410252 R25975 T1 1786 T1 1787 
AA301142 AA30116S AW960506 BE272819 AA386086 T39391 AA285303 AA370580 D58585 T58668 AA156213 W24142 
AA343323 AW7g6067 AA151197 AA376121 R94782 AA302363 Hg0357 R82621 AA301677 H55997 AW796059 W92358 
AL0484S8 AA47119d AA301952 R46287 R82694 H03186 AA187706 R32562 R27094 R25947 R25320 AWg49809 H13505 
H7g049 R32403 H11213 R39710 H49765 H21142 H21006 AA417664 W52075 N56771 AA2B4240 N98556 N30907 
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AA707335 AW603781 AI340367 AI814584 AA524182 AA370076 AA418785 AA704082 AI806851 H25513 T56388 
AA419627 H03986 H20963 T58245 AI459715 AW973768 AI33409B A1693020 T63414 R82646 AW167251 H55998 
AI274916 AA778367 AI755253 At033667 AW083222 AA181979 R26865 AA661627 AA706329 AI798648 AA612799 
AI160180 AI274973 A1039264 AA301880 AI042429 AA307832 AI085688 AI278366 A1498890 AA303065 AI954844 
AA502380 AA156334 AA723480 AI803584 A1581026 AA304584 N51038 R94702 R69814 AW150962 AI570049 AA588807 
AA151198 T53400 A1567709 A1185326 AA309205 AW33B969 R53903 AA991891 AA301643 AI493337 AI026049 H25514 
AI741075 R28632 AW166445 A1333068 H49978 H91267 AA558193 AW079663 AA627380 AA807401 AI199956 AA6661 IB 
AI718216 AW193228 A1077745 A1500496 AI266059 AW080383 R0646B R26757 R32404 AA716599 W92322 AI077734 
A1270181 R46198 AI217540 AA304045 AA305421 AW074445 AM68256 AW089568 AW571605 BE162930 H41009 
AA578313 AW874497 AA181284 AA861947 T29451 D20841 T58618 AA418731 AI282500 AW081407 AA604560 
AA729855 AI262538 A1580225 

102915 2903 2 X07820 NM_002425 BE271570 AI263526 AW296143 AI829878 AI973162 AI085155 AA857496 AA709305 C02220 

134416 30694 1 X68264 NM_006500 AF089868 BE257461 BE275425 AW997154 AI902799 AI902803 M78206 AA085e91 AW392972 

AA325490 BE006161 AA349269 AA323568 AL042S48 AA191 148 AA187703 AA322791 AJ297452 T1 1625 AW366487 
AA303513 AA186961 AA173480 N28330 N28379 W40320 AA187118 H03695 AA4027Q9 BE407476 H06354 BE2765B9 
AA351284 AA379921 AL138060 BE410587 AA113094 AA340481 BE277483 R21191 R79518 N86170 AA320505 
AA296065 AW9519aO AA658897 AA650052 AA654304 AA191691 N26649 AW080983 AI265800 N72019 AI45345d 
AA092563 AA402310 AW39450 AI061054 AA302358 T71566 AA302D47 AA303432 N21289 H27357 AA303504 AI174583 
AW151762 AA181958 AW880618 AA630773 AI889539 AW901058 A13734Q5 AA341941 AA086217 AI675590 A1653936 
AA633570 AA987619 A1270656 N93847 N40689 AW517517 N20030 W95985 AA303955 H89170 AA309917 N21642 
AA373132 W38517 AI687806 W76182 AA101065 AA036916 N45635 AI744510 AI669803 A1039157 AI1 26355 AA634607 
AW131120 AW196838 AA190601 AA911 130 BE221320 N92355 AA036752 H03696 AA588873 AI458868 AI041818 
* AA090477 AI093248 AA304755 AL137942 AL044688 A1083709 AI1509B5 N88691 AA635675 AA594898 W94657 
AA182823 AW166205 F27886 R79246 F37329 AA565697 AI075739 AI088654 AI094287 AI204256 AA095203 T93020 
AA688298 AA057324 N23442 AA075411 AA305046 AI031688 A1191503AA1 11887 AA1 12264 N27929 AA1 87509 
AI375522 AW74006 H06297 AI826177 N48880 H28333 AA075490 R22809 W79542 AI055934 AA042901 AA173481 
AA301986 W74531 AI051747 AA187715 AI888888 AA993017 A1057530 T92954 N80227 AW273595 AI351260 AW170643 
AW292979 AA302605 AA302330 BE349495 AA328602 AA302361 AI470984 AA155943 AA155914 
105178 779^1 AA313825 AW960347 AF223468 NI^016613 AA186345 AA186508 AA081195 AA147972 AA346943 AW961667 

AA187222 AA187207 AW371052 AW449751 AW748803 AW391606 AW371047 AW371057 AW371085 AW362895 
AW371092 AW377556 BE010930 A1016882 AA247878 C04398 C05158 F11398 AA188315 H23385 R55086 H15346 
AA029106 AA2281 14 H17005 F08498 243376 AA095582 AA055186 AA463361 R15218 AA299132 AW103578 W21538 
AA428131 AA187115 AA157197 AA157167 AW371371 AA363562 AW965995 N55663 Z17878 AA228023 AI140342 
AA100927 AA496988 AA055917 A1089303 AW014967 AW090248 AW338371 AW131066 D62963 D79713 A1583950 
A)336781 AI500705 AI471485 AW090239 D79784 D61847 D62789 D61842 AI086327 AI273381 D61815 D63043 AI913548 
A1280560 AI510828 AA029996 C16343 C16513 AI075741 AW516308 AI804764 AA948068 AI356588 AW103452 
AW573063 Z39445 C16489 A1949870 F04712 AA147823 AWa26284 AI151538 AA081303 AA613890 AI251B65 AW086499 
AA992111 AI862091 A1373465 BE502094 AI922270 AA884288 AA157079 N56963 AW189145 AA428080 R55056 
AA884068 AW771716 AA186662 C16364 H15723 AI921181 AA156888 H17006 AA187490 AI400994 AA346942 H28533 
AW129047 R41656 H14636 AA995041 D58370 Z21131 D58186 AI383271 AA643977 D58044 AI934302 AW779425 
F09065 H14930 AA890693 H23274 

105263 178672 2 AW388633 AW378440 AW38B283 AW388339 AW38B333 AW388414 AW388413 AW388607 AW388453 AW3886B7 
AW388480 AW388591 AW388711 AW388511 AW388438 AW388570 AW388449 AI694363 AW23714S AI652991 
AI964041 AW366319 AW366321 AW96ig38 AW46921 1 AI634155 A)492186 AI624430 Ai677965 N26502 Aig63871 
AW378431 AW378421 AI015391 AW352126 N59336 AI352317 AW197113 N67998 AW778935 AI476054 A1206626 
R37116 R4021 1 AA227926 AA639698 R38073 AI001745 T32854 AI619649 AI423703 F10774 AW38861 5 T16595 H05894 

105330 182497 1 AW338625 R43226 R51640 AI307645 A1308100 AI085787 AI420357 AI692610 AA877160 AI953366 AA234743 

104764 90967 ? AI039243 R68234 AA025351 AA971063 A1537757 AA025362 R81636 T86650 

104865 102037 1 T79340 Ar742317 AW182676 AW451460 AI420964 R43284 AA088179 AW590886 AW269529 AA045187 AI521736 

AI827455 AA045136 AW271709 AI004344 AA639631 AA744417 AA744218 AA045357 AA045351 

106124 5454^.1 H93366 AI653547 AA336265 AW966175 BE566451 R71 178 A1630556 AA234331 N55039 AA305632 AW960431 R34044 

R32254 AW020970 AW451281 AW275041 AI636933 AI655640 AA423986 AA642466 AI684063 AI633876 A1624897 
AA814795 AW590328 A1889166 AW243541 AI439691 AW473445 A1475516 AA741228 All 27534 AA165143 AI074714 
A1654076 AA400674 AI560249 N50709 AW438621 Af806810 AI434579 AI308184 AA423987 A1141272 A1565586 
AI338440 AA219628 AI246643 AI985809 AA724260 AA633988 AI364172 AI798439 AI650801 R33503 AI435891 
AA903649 T96161 AA665538 AA219620 AI309962 AA400707 BE247066 R32178 A1275962 AA661602 AW003197 
BE466649 AA831198 AI620052 A1825387 AI634037 AI670978 AI670979 AI655092 R32304 AA828858 AI382428 
AW023660 AA262892 T26891 AW089917 T26926 R32227 

107385 6976 1 NWL005397 U97519 AW899329 A1902387 AA077792 AA078525 AW376607 AA077946 AA070415 BE208721 AW167958 

BE293050 BE208240 A1648698 AA101314 BE393348 BE305122 AA077591 BE274036 AA313687 BE392220 BE378954 
AA171461 AA464821 AW938242 AW938224 AW938243 AW938232 AA147953 N64294 AA205218 AW305065 AW517478 
AA307983 AA377023 BE563629 R99976 N80294 T87719 T87928 AA495849 AA486344 AA204938 AW370448 AA318242 
AW964384 H92423 W95317 BE378774 BE391156 AA349138 AA173095 AW513198 AA037672 AA148029 AA169726 
W04791 AA075508 BE382937 BE395034 AF139793 AA961734 rM8612 H64714 AW151251 A1565113 AI566881 
AW087370 AA631168 AA622014 AW513098 AI857810 AW152287 AI052596 AI983246 AA024856 AI912456 AI677938 
AW026403 AA972537 A1088497 AW999869 W94582 AI140166 AI160659 AI566868 AA101263 AW190390 AW166466 
AI401207 A1418156 AI625265 AI146298 AW008592 BE223020 N58926 AI308797 AA037673 AI935992 AI304706 
AA024939 AI216589 AI610423 AI354621 AI500677 AI679389 AI799310 N64508 AI128756 At679897 AW589535 
AA989333 A1500527 AA565479 AA913529 AI923295 F21691 AA989376 AI699064 AA902447 A1690910 AA772659 
AA204983 AI337895 R99975 H65205 AA340766 AI339441 AI913855 AA450293 AW192010 AA070416 N72401 A1371481 
A)247108 A1371261 At364987 AI280171 A)269104 AI868756AA909836 AA983840A1973271 AA913092AI868205 
AI1441 12 Alig0975 N58085 AI56663& N93405 AW150504 AW296846 AI687036 AA902g84 AI824460 AI625047 AA653148 
A161 1228 AW131922 AA862687 AA902519 C01732 AW796045 AL044660 

101192 15367 1 BE247295 AW068092 AU)41313 AA159244 NMJ)05415 L20B59 AL135570 W47073 AW516906 BE388271 BE408629 

W46972 BE293646 6E256647 AI075010 AL041095 AA285300 AL039560 AA368740 W26602 AA399344 AA039235 
W27631 AW834898 AW834914 R9339D AA378039 AV649660 T53674 N98824 AA399974 AW843378 AA368267 R08256 
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AV653575 R27900 N48215 AW366371 N45500 AV652967 AI889251 AI080457 N39021 AI738542 AW242849 AI857471 
AI859775 AI582830 R75850 N86564 AW341636 AI499006 A1887217 AW026594 AW182840 AA039313 AA831346 
AI393465 AW069210 AI743830 AA744243 AA401310 AW439758 AW088152 R93391 AA291379 AA225220 AW009358 
AI192879 AA291202 AI565089 AA225089 AA807688 A105205B AI341641 A1066625 AA333864 AA159147 A1923912 
5 R75851 AI761 143 AW768588 AA394195 AI2B8450 AW512564 AI452775 A1056520 AA468602 AA872566 AI434739 

AA291838 AI948523 AW768614 AI374753 AW068174 AA884908 AI199346 A)199347 W94946 AI159995 AA877642 
AI280646 AI307610 AA403310 R08205 AW182123 AI000999 R27808 AW026571 D20816 A1560350 T27667 AW960271 
AI174628 AI432042 AI424528 AA909562 T17342 A1783865 
109001 146370_3 AI056548 AW409843 AW263540 AA723669 AA909334 AA156120 AA157141 AA156125 AW409866 W19499 AA157229 
10 AW887435 
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TABLE 6: 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number . 

UnigenelD: Unigene number 

Unigene TiUe: Unigene gene title 

AUC1 : 70i» percentile of average intensity (Al) for probeset at each of 2,6.15,24,48. and 96 hour timepoints minus 70^ percentile At at 0 hrs, 
summed over 5 experiments. 

AUC2: AUCiySO*^ percentile of Ai for aorta, aortic valve, vein, and artery. 



Pkey 


Ex.Accn 


unigeneiu 


1 IntnofiATillA 
111119 clBI IUb 










nS.l JUD9U 


col S 


IUJ0 


Q 


327414 






preaicieu exon 




^n *K 


321911 


AF026944 




CO 1 S 




AO Q 


331578 


AI246482 




CO 1 9 


fi77 A 
Or / .*t 


lU.O 


332466 


AB0ia259 


no. 1 lo i*tu 


I^IAAHTIR nono nmrtiirt 
rMA\MUr lO ycilo piUUUM 




39.5 


313513 


AW298600 


nb. IH IO*»U 


PCTc XMosAht crnrtllar tn Q'^Q'^Hi tntorfom 

CO 1 0| Wcdfuy oirHifcti lo OD^Kjyj i iiucnciu 


'V>A 


^9 A 


320635 


N50617 




small nudear ribonucieoprotein polypept 


394.8 




326230 






predicted exon 


357.2 


^'^ 7 


313556 


AA628517 


ns. ilooU^ 




433.6 


Ic 


313665 


AW751201 




ESTs 


-83 


v.O 


324852 


AI380792 


nS.ioi) iU*t 


ESTs 


348.2 


ft 


314372 


AL040178 


HS. 14^00 J 


ESTs. Weakly similar to The KIAA0149 gen 


^9.2 


n K 
U.d 


311877 


AA08424e 


nS.o5j39 


G protein-coupled receptor 39 


-1309 




322262 


AA632012 


Hs. 168740 


ESTs 


-247.8 


1 


312173 


At821409 


onAA'74 

nS.ou4471 


ESTs, Highly similar to AF1 16865 1 hedge 


-1025.8 


A 

1 


319795 


AB037821 


nS. i4Ddoo 


protocadherin 10 


203.6 




313350 


AW591949 


HS.o795o 


ETL protein 


183.8 


10.4 


326759 






predicted exon 


1654.4 


1.2 


300318 


AW444502 


Lin OCCADO 


ESTs. Highly similar to AP1 16665 1 hedge 


-346 


4 

1 


313978 


AI870175 


HS. 13957 


ESTs 


576.6 




306840 


AI077477 


Ufa* 'iniM'i 

nS. 00/91^ 


EST 


56.4 


nA 


310272 AF216389 


Lie ^AAOVi 


semaphorin Rs, short form 


-127.6 


n 
U 


315044 


BE547674 


Lie OnAAtSQ 


ESTs 


-102.6 


A 

u 


321325 


AB033100 


nS.ouUD4b 


KIAA protein (similar to mouse paladin) 


1080.6 


•A ft 
■1.0 


303251 


AF240635 


nS.115o97 


protocadherin 12 


1270.8 


D.O 


302378 


AL109712 


Hs.296506 


Homo sapiens mRNA fuD length insert cDN 


915.8 


4fl ft 
19.0 


315060 AA551104 


Hs.189048 


ESTs. (Moderately simDar to AlAJCHUMANi 1236.8 


A 0 


332048 


AW337575 


Hs.201591 


ESTs 


522.6 


A 7 


337214 






predicted exon 


269 


OR Q 


311598 


AW023595 


Hs.232048 


ESTs 


796.4 


Ofl 0 


304782 


AA582081 




gb:nn32h08.s1 NCi.CGAP.Gas1 Homo sapiens 


OIO.4 lU.9 


312802 


AA644669 


Hs.193042 


ESTs 


349.6 


/ .D 


302680 


AW1 92334 


Hs.38218 


ESTs 


638.6 


ft*) Q 


317452 


AA972965 


Hs.1 35568 


ESTs 


360.8 


'\R 1 
Ow. 1 


318558 


AW402677 


Hs.146381 


RNA binding motif protein. X chromosome 


700.2 


O.D 


312149 


T90309 


Hs.269651 


ESTs 


274.2 


7 *\ 


319267 


F11802 


Hs.6818 


ESTs 


238.2 


23.8 


321510 


H75391 


Hs.255748 


ESTs 


231.8 


9<i 0 


326198 






predicted exon 


581.6 




315730 


H25899 


Hs.201591 


ESTs 


281.6 


Q 7 


310442 


AW072215 


Hs.208470 


ESTs 


-213 


0.3 


331237 


W87874 


Hs.25277 


hypothetical protein nJ21065 


285 


0.5 


300469 


BE301708 


Hs.233955 


hypothetical protein FU20401 


26.6 


0.3 


338316 






predicted exon 


1494.2 


347 


330968 


R44557 


Hs.23748 


ESTs 


975.8 


1.8 


331019 NI^006033Hs.65370 


lipase, endotheiiat 


201.2 


0.9 


331261 


BE539976 


Hs. 103305 


Homo sapiens mRNA; cDNA DKFZp434B0425 (f 


478.61.3 


301822 


X17033 


Hs.271986 


Integrin, alpha 2 (CD49B. alpha 2 subunl 


356.2 


1.7 


325544 






predicted exon 


1014.6 


9.4 


328700 






predicted exon 


627.4 


627 


322882 AW248508 Hs.279727 


Homo sapiens cDNAFU14035fis. done HE 84.8 


5.7 


336034 






predicted exon 


782.6 


78.3 


316580 AA938198 Hs.146123 


hypotiietical protein FLJ12972 


746.4 


13.8 


309931 


AW341683 




gb:hd13d01j(1 Soares.NFU.T.GBC.S1 Homos 


134.813.5 


330692 R39288 


Hs.6702 


ESTs 


137 


137 


319962 


H06350 


Hs.135056 


Human DMA sequence from done RP5-850E9 


14.6 0.5 


338033 






predided exon 


540.6 


14 


314943 


y00272 


Hs.184572 


celt drvision cyde 2, G1 to S and G2 to 


-494.8 


1 


332640 


BE568452 


Hs.5101 


protein regulator of ^kinesis 1 


-600 


1 


338158 






predicted exon 


311.2 


31.1 


327036 






predicted exon 


351.8 


35.2 
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302655 
327568 
324801 
317850 
322818 
324626 
317224 
310955 
315240 
338388 
338442 
318617 
338645 
313135 
324716 
330305 
308248 
308886 
315622 
323675 
312164 
300376 
317478 
317559 
317207 
334834 
320925 
303289 
328548 
317108 
318013 
314299 
317702 
316094 
323706 
325843 
316012 
309687 
323329 
312853 
313070 
314096 
338728 
316609 



AJ227892 Hs.146274 



AW770553 

AI681545 

AW043782 

A1685464 

X73608 

AI476732 

R38772 



Hs.14553 

Hs.152982 

Hs.293616 

Hs.292638 

Hs.93029 

Hs.263912 

Hs.172619 



AW247252 Hs75514 



N58907 
BE169746 

AIS60919 

A)833240 

A1796144 

R43240 

T91980 

Z45270 

At343569 

AW452344 

AI873348 

D62892 
ALl 21460 

AA884000 

All881d3 

AW382682 

AW173339 

AW975920 

AA377578 

AA764950 

AW236154 

AL134744 

W05086 

AI422023 

AW977642 



Hs.162430 
Hs.12504 



Hs.258188 
Hs.272168 
Hs^1074 
Hs.235873 
Hs.107000 
Hs.129977 
Hs.214505 



Hs.272673 

Hs.8173 

Hs.144078 

Hs.154840 

Hs.135665 

Hs.283361 

Hs.65234 

Hs.1 19898 

Hs.77385 

Hs.10852 

Hs.1 14256 

Hs.161338 

Hs.291742 



312642 
339236 
317058 
311137 
310178 
320745 
317336 
309871 
302038 
332237 
312362 
331558 
316215 
336059 
302790 
328416 
304229 
331606 
338962 
317959 
336228 
313534 
317404 
311943 
314680 
331484 
338116 
329863 
315555 
317039 
331138 



AW292520 Hs.122082 

AA888220 

AW052128 



AI217713 

AW207582 

AI936450 

H51696 

AW014637 

AW300366 

AC004076 

N52883 

AW015994 

N62401 

AI684535 

AJ245245 

AK000149 
AW273285 



HS.147S88 

Hs.196042 

Hs.147482 

Hs.89278 

Hs.130212 

Hs.1 29709 
Hs.102676 

Hs.48531 
Hs.200811 



Hs.29493 
Hs.50802 



AI204202 Hs.130264 



AW072916 

AI806867 

AI469911 

AI247425 

N29696 



AW452866 
AA868583 
R63616 



HS78743 

Hs.126594 

Hs.26498 

Hs.152182 

Hs.44076 



Hs.239107 
Hs.126153 
Hs.28445 



ESTs 180.2 18 

predicted exon 229 22.9 

sterol 0-acyltransferase {acyl-Coenzyme 1 61 .2 1 6.1 

hypothetical protein FU131 17 -690 1 

ESTs 126.4 4.5 

ESTs 170.2 17 

sparc/osteonectin, cwcv and kazal-like d -80 0 

ESTs 466.8 467 

KIAA1 105 protein 277 277 

predicted exon 267.6 26.8 

predicted exon 256 25.6 

nucleoside phosphorylase 1247.8 24.2 

predicted exon 206 20.6 

ESTs 204.8 20.5 

hypotheUcal protein DKFZp761D081 203.6 20.4 

predicted exon 199.8 20 

gb:tq4l9l0.x1 NCLCGAP.Ull Homo sapiens 199.4 19.9 

gb3t76d10j(1 Barstead colon HPLRB7 Homo 198.2 19.8 

Homo sapiens cDNAFU11674rB, done HE 191.2 19.1 

tumordlfferentially expressed 1 189.2 18.9 

ESTs 187.6 18.8 

hypothetical protein RJ22672 271.6 187 

Homo sapiens mRNA for WDC146, complete c 187 187 

ESTs 184.2 18.4 

ESTs 182.8 18.3 

predicted exon 178.8 17.9 

gb:HUM337C07B Clontech human aorta poiyA 177.2 177 

hypothetical protein R.J20508 316.4 17.6 

predicted exon 174.6 17.5 

hypothetical protein FU10803 172.4 17.2 

ESTs 326 17,2 

ESTs 170.8 17.1 

ESTs 169.8 17 

ESTs 169.4 16.9 

hypothetical protein fU20596 169.2 16.9 

predicted exon 321.4 16.9 

ESTs 1047.2 16.9 

myosin,lightpolypeptide6»allcali,smoothmu 168.2 16.8 

ESTs 168 16.8 

ESTs 167.4 167 

ESTs 298.6 16.6 

ESTs 165.6 16.6 

predicted exon 165.4 16.5 

ESTs 165 16.5 

gb:oj15h01.s1 NCLCGAP_Kid5 Homo sapiens 164.6 16.5 

gbnmc26c02j(1NCLCGAP_,Kid11Homosaplen 164 16.4 

predicted exon 163.6 16.4 

ESTs 161.8 16.2 

ESTs 582.2 162 

ESTs 161.2 16.1 

hypothetical protein FU11186 161 16.1 

ESTs 160 16 

gb:xs63b05j(1 NCLC6AP.Kid11 Homo sapten 159.8 16 

Homo sapiens chromosome 19, cosmM R3021 159 15.9 

EST 159 15.9 

gb:UI-H-BI0p^bh-g-09^l.s1 NCLCGAP.S 158.6 15.9 

EST 158.6 15.9 

ESTs 158.4 15.8 

predicted exon 157.4 15,7 

gb:Homo sapiens mRNA for immunoglobulin 155.8 1 5.6 

predicted exon 153.8 15.4 

hypothetical protein FU20142 153.6 15.4 

ESTs 153 153 

predicted exon 664.4 15.3 

ESTs 152.6 15.3 

predicted exon 152.4 ^52 

zinc finger protein 131 (clone pHZ-10) 152.2 15.2 

ESTs 152.2 15.2 

hypothetical protein FU21657 152 15.2 

ESTs 151.4 15.1 

EST 151.2 15.1 

predicted exon 151.2 15.1 

predicted exon 150.6 15.1 

ESTs 149.6 15 

ESTs 149.6 15 

ESTs 149.6 15 
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316561 AI917222 Hs.121655 ESTs 149.4 14.9 

328695 predicted exon 149.2 14.9 

302282 BE396283 Hs.173987 eukaryoUc translation inifiation factor 148.4 14.8 

318781 F118Q2 Hs.6818 ESTs 146.2 14.8 

5 323709 AW297246 Hs.288546 Homo sapiens CDNARJ14190 lis, done KT 148 14.8 

310790 AW192063 Hs.248865 ESTs . 147.6 14.8 

316833 AW292614 Hs.124367 ESTs 147.8 14.8 

323176 NM-007350HS.82101 pleckstfin homology-nke domain, family 229 14.8 

324188 AW274439 Hs.252709 ESTs 147.6 14,8 

10 317441 AA922798 Hs.196583 ESTs 147.4 147 

317584 AI825890 Hs.220513 ESTs 146.8 147 

321798 AI308206 Hs.181959 ESTs 146.8 147 

304363 AA206O45 gb:zq77f05.s1 Stratagene hNT neuron (937 146.6 147 

313952 F20956 gb:HSPD05390 HM3 Homo sapiens cDNA clone 146.6147 

15 301909 AI702609 Hs,15713 ESTs 263.8 147 

309198 AI904895 Hs.g614 nucleophosmin (nucleolar phosphoprotein 146.2 14.6 

321860 N47474 Hs.212631 ESTs 146.2 14.6 

330187 predicted exon 146 14.6 

323042 AA463571 Hs.172550 polypyrimkline tract binding protein (he 145.6 14.6 

20 313636 AA262397 Hs.201366 ESTs 145.2 14.5 

302437 AB024729 Hs.227473 UDP-N-acetylglucosamine:a-1.3-D-mannosid 145 14.5 

31S197 A1473096 Hs.133403 ESTs 144.8 14.5 

302749 IVI16951 gb:Human Ig mu-chainmRI^ VDJ4-region»5144.6 14.5 

322357 AI734258 Hs.245367 ESTs. Wealdy similar to ALU1.HUIV1AN ALUS 144.614.5 

25 300391 AI927371 Hs.288839 hypothetical protein FU12178 144.4 14.4 

326077 predicted exon 144.4 14.4 

302004 Y18264 Hs. 123094 sal (DrDSOphila)-Iike 1 * 144 14.4 

320668 AA805666 Hs.146217 Homo sapiens cONA: FU23077 Os. done L 144 14.4 

331212 T8B693 Hs.226410 ESTs 144 14.4 

30 311268 AI959727 Hs.231859 ESTs 143.2 14.3 

305159 AA659166 Hs.275668 EST.WeaklyslmilartoEF1D.HUMANELONGATIONF 143 14.3 

304510 AA457391 Hs.1 19122 ribosoma!piioteinL13a 14^8 14.3 

320852 AA772920 Hs.303527 ESTs 142.8 143 

330854 AW291944 Hs.122139 ESTs 142.8 14.3 

35 318275 AW449952 Hs.190125 bas{c4ierix-loop-herD(-PAS protein 14Z6 143 

314992 Ai824879 Hs.21 1286 ESTs, Weakly similar to 1207289AreveiSB 142.2 14.2 

322631 AA001697 Hs.293565 ESTs. Wealc^ similar to putative pi 50 [H 142.2 14.2 

332283 R40855 H5.100839 EST 142 14.2 

302894 AA719572 Hs.274441 Homo sapiens mRNA; cDNA DKFZp434t4011 (fr 141.214.1 

40 301808 R35391 Hs.252831 reticulonS 141 14.1 

318608 AI204491 Hs.151502 ESTs 141 14.1 

316499 AW292947 Hs.122872 ESTs 140.8 14.1 

317011 AI248760 Hs.150276 ESTs 140.8 14.1 

321840 N4560D Hs.46534 Homo sapiens mRNA; cDNA DKFZp434P0714 (f 140.814.1 

45 327365 predicted exon 140.8 14.1 

331264 AA278B98 Hs.225979 hypothetical protein similar to small G 140.8 14.1 

324545 AW501944 Hs.127243 Homo sapiens mRNA for KIAA1724 protein. 140.4 14 

312986 AA211586 gb:zn56d05.s1 Stratagene musde 937209 H 140.2 14 

316053 AA825814 Hs.149065 ESTs 140.2 14 

50 330723 BE247449 Hs.31082 hypothetical protein FU10525 140.2 14 

304876 AA595765 gb:nj28g06.s1 NCI,CGAPJ^1 Homo sapiens 139.8 14 

311379 AW134766 Hs.202450 ESTs 139.8 14 

318265 AW019873 Hs.146840 ESTs 139.8 14 

324137 AA393127 Hs.222762 ESTs 139.8 14 

55 328262 predicted exon 139.6 14 

322349 AK001279 Hs.18017l Homo sapiens cD^^AFU10417fis. done NT 139.4 13.9 

323504 AA280223 Hs.130865 ESTs 139.4 13.9 

304261 AA059387 gb:2f66d01.s1 Scares retina N2b4HR Homo 139.2 13.9 

310489 AW451493 Hs.235516 hypothetical protein PR02955 139.2 13.9 

60 335946 predicted exon 139^ 13.9 

318155 AI041546 Hs.132133 ESTs 1383 13.9 

313796 AI797169 Hs.208486 ESTs 138.6 13.9 

333977 predicted exon 138.6 13.9 

324845 AW989635 Hs.283718 ESTs 138.2 133 

65 331139 R65706 gb.7ll6gi2.sl Soares placenta Nb2HP Homo 138.2 133 

331131 R54797 gb7g87b07^1 Soares Infent brain 1 NIB H 6693 133 

321250 H58539 Hs.151692 ESTs 138 133 

312498 AA668782 Hs.191284 ESTs. Wealdy similar to ALUIJ^UMAN ALUS 1373133 

331252 W52470 Hs.34578 alpha2.3-slalyltransferase 1373 133 

70 337407 predicted exon 1373 133 

303973 AW512014 gbja68a03.x1 NCtjCGAPJLym12 Homo sapien 137.4137 

314582 AA412258 Hs.1 88817 ESTs 137.4 137 

327373 predicted exon 137.2 137 

323367 AA234591 Hs.304123 ESTs 1363 137 

75 316207 AA832065 Hs.120260 ESTs 136.4 13.6 

315231 AA705809 Hs.1 19922 ESTs 136.2 133 
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318592 


T39310 Hs.1 139 cold shock domain protein A 


136.2 


13.6 


320906 


AW969706 Hs.293332 ESTs 


136.2 


13.6 


328937 


pr^ictedexon 


136.2 


13.6 


329073 


predicted exon 


136.2 


13.6 


318231 


AV659082 Hs.134228 ESTs 


136 


13.6 


311992 


AL360200 Hs.1 14145 ESTs 


135.8 


13.6 


316497 


AA766457 Hs.l36849 ESTs 


135.8 


13.6 


317677 


AA96a594 Hs.127868 ESTs 


135.8 


13.6 


321680 


W02848 Hs.93704 ESTs 


135.8 


13.6 


326080 


predicted exon 


135.8 


13.6 


330938 


AF036943 Hs.172619 KIAA1 106 protein 


135.8 


13.6 


306573 


AL134878 Hs.119500 ribosomal protein, large P2 


135.6 


13.6 


307383 


AI223207 Hs.147888 EST ' 


135.6 


13.6 


311114 


AW449382 Hs.195297 ESTs 


135.6 


13.6 


320579 


R15138 Hs.165570 Homo sapiens done 25052 mRNA sequence 135 


13.5 


301328 


AA884104 H5.125546 ESTs 


134,8 


13.5 


312063 


N58198 Hs.162898 ESTs 


134.8 


13.5 


323036 


H09604 H5.13268 ESTs 


134.6 


13.5 


332776 


AF241850 Hs.151428 ret finger protein 2 


134.4 


13.4 


332494 


AA282330 HS.14S668 ESTs 


134.2 


13.4 


334376 


predicted exon 


1342 


13.4 


313264 


N93416 Hs.118228 ESTs 


133.6 


13.4 


313669 


AA351109 Hs.5437 Taxi (human T-cell leukemia virus type 1 


133.2 


13.3 


312083 


T87398 Hs^816 ESTs 


132.6 


13.3 


319354 


AA993807 Hs.167367 ESTs 


132.6 


13.3 


307414 


AI242106 gb:qh92a02jc1 Soares_NFUJ.GBC.S1 Homo s 


132.213.2 


312771 


AA018515 Hs.264482 Apg12 (autdphagy 12. S. cerevisiae)-Bke 


131.8 


13.2 


313004 


A1274963 Hs.145900 ESTs 


131.2 


13.1 


300995 


AW510641 Hs.258018 ESTs 


220.6 


13 


319323 


F12650 Hs.13287 ESTs 


125.4 


12.5 


329451 


predicted exon 


123.4 


12.3 


337603 


predicted exon 


572 


12.2 


312480 


R68651 Hs.1 44997 ESTs 


121.4 


12.1 


324934 


AW452051 Hs.147546 ESTs 


119,4 


11.9 


320723 


BE178025 Hs.7942 tiypothetical protein FLJ20080 


117 


11.7 


318188 


AI792566 gb:qi74f02,y5 NCLCGAP_Ov26 Homo sapiens 


116.6117 


320873 


AF238869 Hs.283955 Homo sapiens clone GLSH-2 similar to gli 


112.8 


11.3 


331005 


BE003191 Hs.1 19555 ESTs 


112.6 


11.3 


304969 


AA614406 gb:np46f05.s1 NCI_CGAP_Br1 1 Homo sapiens 


112.411.2 


319799 


AI139253 Hs.227767 zinc finger protein 41 


111.2 


11.1 


302610 


AA347945 Hs.256024 ESTs 


111 


11.1 


309485 


AW130320 Hs.108124 ribosoma|proteinS4,X-iinl(ed 


111 


11.1 


311880 


AW419225 Hs.256247 ESTs 


110.2 


11 


313981 


AW452334 Hs.128148 ESTs 


110.2 


11 


322442 


W49701 Hs.29667 ESTs 


109.4 


10,9 


315099 


AA806536 Hs.2gi841 ESTs 


109 


10.9 


304793 


AA583264 Hs.182g79 rib05omatproteinL12 


108.8 


10.9 


330815 


AA019211 Hs.236463 KIAA1 238 protein 


108.8 


10.9 


304044 


T81656 Hs.252259 ribosomal protein S3 


714.8 


10.8 


325222 


predicted exon 


135 


10.8 


325889 


predicted exon 


814.6 


10.8 


321447 


AW891130 Hs.38173 ESTs 


107.8 


10.8 


302990 


AA496212 Hs.180182 ESTs 


106.2 


10.6 


308106 


AI476803 gb:tj77e12j(1 Soares_NSF_F8_9W.OT_PA.P„S 


270.6 10.6 


310536 


A1301041 Hs.150174 ESTs 


106 


10.6 


315257 


AW157431 Hs.248941 ESTs 


233 


10.6 


318787 


242313 Hs.22657 ESTs 


105.8 


10.6 


312306 


AI927226 Hs.l75610 ESTs 


105.2 


10.5 


326788 


predicted exon 


104.4 


10.4 


312234 


AA830640 Hs.208934 ESTs 


104 


10.4 


314482 


AW085525 Hs.134182 ESTs 


234 


10.4 


323597 


AI185693 Hs.135119 ESTs 


102.4 


10.2 


302623 


AW838724 Hs.194110 hypothetical protein PRO2730 


162.4 


10.2 


323594 


AI791531 Hs.129993 ESTs 


101 


10.1 


324315 


N55761 Hs.194718 zinc finger protein 265 


100.2 


10 


314217 


AA255465 Hs.188725 ESTs 


99.2 


9.9 


320932 


AA554913 Hs.162297 ESTs 


98.2 


9.8 


327876 


predicted exon 


98.2 


9.8 


319736 


R17424 Hs.6650 vacuolar protein sorting 458 (yeast homo 


98 


9.8 


327747 


predicted exon 


97.6 


9.8 


327844 


predicted exon 


97.4 


9.7 


318200 


Ai061192 Hs.166517 ESTs 


97.2 


9.7 


329414 


predicted exon 


97.2 


9.7 


316296 


AI089867 Hs^70713 ESTs 


121.4 


.9.7 


307010 


AI140014 gb:qa6Sf09j(1 Soares.fetaLhearLNbHH19W295 


9.7 


319792 


AI138635 Hs^968 ESTs 


385.4 


9.6 
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305671 AA811688 Hs.62113 dUTPpyrophosphatase 96 9.6 

329440 predicted exon 93.8 9.4 

310381 AI263059 Hs.145594 ESTs 93.4 9.3 

318824 F06771 Hs.27226 ESTs 93.4 9.3 

5 328957 predicted exon 92.2 9.2 

318804 Z42549 HS.160B93 ESTs 92 9.2 

330836 AA055611 Hs.226568 ESTs, Moderately similar to ALU4^HUMAN A 92 9.2 

324592 AW752437 Hs.325708 ESTs 91.8 9.2 

311820 AW274545 Hs.254333 ESTs 91.4 9.1 

10 321614 H86161 gb:ys94b01.r1 SoaresrefinaN2b5HRHomo 91 9.1 

330306 predicted exon 91 9.1 

303096 AL080276 Hs.268562 regutatorof G-proteinsignairing 17 90 9 

313275 A1027604 Hs.159650 ESTs 110.4 8.8 

302593 H54855 Ks.36958 ESTs 88 8.8 

15 321421 BE465115 Hs.171688 ESTs 86.2 8.6 

330832 AI13353D Hs.62930 ESTs 456.4 8.6 

311847 AW301807 Hs.297260 ESTs 86 8.6 

322036 BE002723 Hs.30ig05 Homo sapiens cDNAFU14080ris. done HE 145.8 8.6 
328688 predicted exon 85.6 . 8.6 

20 325251 predicted exon 85.4 8.5 

329088 predicted exon 85.4 8.5 

322524 W79027 Hs.271762 ESTs 84 8.4 

337953 predicted exon 451 8.3 

323529 AA284397 Hs.201485 Homo sapiens clone FLC0664 PR02866 mRNA, 82.6 8.3 

25 307041 AI144243 gb:qb85b12.x1 Scares JetaLhearLNi)HH19W 306.88.2 

316285 AI332454 Hs.158412 ESTs 81.4 8.1 

312021 AA759263 Hs.14041 ESTs 81 8.1 

329350 predicted exon 81 8.1 

326169 predicted exon 80.4 8 

30 338038 predicted exon 1024.2 7.9 

312549 A1214510 Hs.146304 ESTs 77.4 7.7 

312542 D60076 gb:HUM084E10A Clontech human fetal brain 76.8 7.7 

320992 AB026891 Hs.225972 solute carrier family 7, (cationic amino 76 7.6 

318596 AI470235 Hs.172698 EST 150.6 7.5 

35 315650 AA649042 Hs.269615 ESTs 73.4 7.3 

324328 AA447276 Hs.292020 ESTs 210.4 7.1 

332622 R10674 Hs.128856 CSRl protein 70.2 7 

328229 predicted exon 69.4 6.9 

319110 T75260 Hs.9d321 hypothetical protein FU141 03 68.6 6.9 

40 316133 AI187742 Hs.125562 ESTs 308.6 6,9 

303992 AW515800 gb:hd88g01jc1 NCI_C6AP_GC6 Homo sapiens 67,8 6.8 

322675 AA017656 Hs.146580 enolase 2. (gamma, neuronal) 377.2 6.7 

325753 predicted exon 105,2 6.6 

312539 AI004377 Hs.200360 Homo sapiens cDNAFU13027fis. done NT 92.2 6.4 

45 302592 AA294921 Hs.250811 v-ral simian leukemia viral oncogene horn 361.6 6.3 

314578 AA410183 Hs.137475 ESTs 201.6 6.1 

335986 predicted exon 108.6 6 

321478 AW402593 Hs.123253 hypothetical protein FU22009 528 6 

305192 AA666019 gb:ag44a04.s1 Jia bone manow stroma Horn 58.6 5.9 

50 304275 AA070605 gb:zm53h09.s1 Stratagene fibroblast (937 78.6 5.6 

302779 AJ235667 gbiHomo sapiens mRNA for immunoglobulin 278.8 5.5 

301976 T97905 Hs.77256 enhancer of zeste(Drosophila)homok^ 2 479.2 5.4 

316021 AW293399 Hs.144904 nudear receptor co-repressor1 792.4 5.3 

320802 BE336699 Hs.185055 BENE protein 2423.8 5.3 

55 317282 AI733112 Hs.176101 ESTs 523.2 5.1 

316827 AI380429 Hs.172445 ESTs 578 5.1 

303190 BE280787 Hs.1607g hypothetical protein FU10233 223 5.1 

315587 A1268399 Hs.140489 ESTs 136.2 5 

333122 predicted exon 399 5 

60 310214 AI220072 Hs.l65893 ESTs 234.4 4.9 

320089 D43945 Hs. 113274 transcriptfon factor EC 68 4.9 

309328 AW024348 Hs.233191 EST, Weakly similar to A27217 glucose tr 258.8 4.8 

318971 Z44067 Hs.10957 ESTs 376.6 4.8 

327220 predicted exon 47.4 4.7 

65 315757 AW014605 Hs.179872 ESTs 177.4 4,7 

320730 R68869 Hs.1 51072 ESTs 205.2 4.6 

313339 A1682536 Hs.163495 Homo sapiens cOIMA FU 13608 fis. done PL 260 4.5 

318634 T49598 Hs.156832 ESTs 475.2 4.5 

320955 AW820035 Hs.27867g a dlsintegrin and metatloproteinase dome 388.6 4.4 

70 306605 AI000497 Hs.119500 ribosomalproteinJargeP2 ,81.6 4,4 

309349 AW051913 gb.'wx24a09.x1 NCi^CGAP^Kidl 1 Homo sapien 102.4 4.3 

306004 AA889992 Hs.2186 eukaryotictranslattonelongationfactorlga 451.2 4,2 

330020 predtetedexon 61.2 4.1 

302308 AW327279 Hs.91379 ribosomal protein 1^6 342 3.9 

75 314648 AW979268 gb:EST391378 MAGE resequences. MA6P Homo 56.4 3.8 

315131 AI753709 Hs.152484 ESTs 130.4 3.7 
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313690 A1493591 Hs7B146 platelet/endothelial cell adhesion molec 3179.6 3.6 

333585 predicted axon 175.4 3.5 

312911 H93366 Hs.7567 Homo sapiens cDNA: FU21962 fis, done H 219 3.5 

322966 AA633669 Hs.235920 Homo sapiens cell recognition molecule C 3507 3.4 

5 312492 R71072 Hs.191269 ESTs 322.8 3 

318988 Z44203 Hs.26418 ESTs - 25 2.5 

332363 AI123705 Hs.106932 ESTs 773.4 2.5 

324181 AI025476 Hs.131628 ESTs 634.8 2.4 

311717 AW205369 Hs.312830 ESTs 54.2 2.4 

10 321342 AA127984 Hs.222024 transcription factor BMAL2 23.4 2.3 

308852 AI829848 Hs.1 82937 peptidylprolylisomeraseA(cyclophillnA) 92 2.3 

331466 AA373210 Hs.43047 Homo sapiens cDNA FU13585 fis. done PL 494 2.3 

320279 AB033062 Hs.134970 DKF2P434N1 78 protein 76.2 2.2 

322221 N24236 Hs.179662 nudeosome assembly protein l-like 1 253.2 2.1 

15 302925 AL137449 Hs.126666 homeoboxB4 136.6 2.1 

331384 AB041035 Hs.93847 NADPH oxidase 4 720 1,8 

' 300938 AA514416 Hs.152320 ESTs, Weakly similar to 1605244A eiythro 27 1.8 

312695 AW196663 Hs.200242 ESTs 303.8 1.6 

320223 W35132 Hs.267442 ESTs 189 1.5 

20 332743 AW247977 Hs.87595 translocase of inner mitochondrial membr 14.4 1.4 

331039 AW378685 Ms.18625 Mitochondrial Acyl-CoA Thioesterase 529.8 1.4 

333123 predicted exon 396.2 1.4 

328455 predicted exon 91.8 1.3 

334458 predicted exon 406.4 1.3 

25 313478 AA643008 Hs.192775 ESTs 413.4 1.1 

309899 AW338564 Hs.217493 annexinA2 ^0.8 1 

311735 AW294416 Hs.144687 Homo sapiens cDNA FU 12981 fis. done NT -62,8 1 

312953 NM_001992Hs.128087 coagulation factor II (thrombin) recepto -73.6 1 

313055 AW367295 Hs.241175 ESTs -43.8 1 

30 313291 AI267970 Hs.150614 ESTs, Weakly similar to ALU4_HUMAN ALU S -63 1 

315059 AW275110 Hs.271106 ESTs -67 1 

322284 AI792140 Hs.49265 ESTs ^95.2 1 

322450 AL121278 Hs.25144 ESTs -1.6 1 

324803 AW975183 Hs.292663 ESTs 4.4 1 

35 331495 AW970939 Hs.291039 ESTs -282.8 1 

333610 predicted exon -152.6 1 

335093 predicted exon -23.2 1 

339403 predicted exon -331.2 1 

302820 X04588 Hs.85844 neurotrophic tyrosine kinase, receptor, 591.2 1 

40 302270 R56151 Hs.93589 Homo sapiens mRNA;cDNA DKFZp564B1 162 (f 276.61 

323755 AW300094 Hs,136252 ESTs 135 0.9 

326946 predicted exon 727.4 0.9 

315343 BE144306 Hs.179891 ESTs. Weakly similar to P4HA_HUMAN PROLY 122.80.9 

311168 AK001270 Hs.196086 hypothetical protein FU 10408 304 0.9 

45 329732 predicted exon 109J! 0.9 

321415 BE621807 Hs.3337 transmembrane 4 superfamHy member 1 414.8 0.7 

333121 predated exon 87.8 0.7 

333120 predicted exon 379.8 0.7 

330392 AW797956 Hs.75748 proteasome (prosome. macropain) subunit 589.2 . 0.7 

50 314711 AA769365 Hs.126058 ESTs -87 0.6 

330865 6E409857 Hs.69499 hypothetical protein 347.4 0.6 

333169 predicted exon -1182 0.6 

335095 predicted exon 106.4 0.6 

335815 predated exon -156 0.6 

55 330232 predicted exon 102.6 0.6 

330823 AA031565 Hs.221255 ESTs, Moderately simOar to ALU5_HUMAN A -62 0.5 

331704 F04225 Hs.66032 ESTs -14.6 0.5 

302642 NM_016428Hs.130719 NESHprotebi 267.6 0.5 

304484 AA432067 Hs.258373 ESTs 85 0.5 

60 310230 AK000377 Hs.144640 homoiog of mouse C2PA -70 0.4 

301531 A(077462 Hs.134084 ESTs -195.4 0.4 

306337 AA954221 Hs.73742 ribosomalprotein.large.PO ^.4 0.4 

331327 N46436 Hs.1 09221 ESTs -392 0.4 

332961 predicted exon -5.6 0.4 

65 322796 W31178 Hs.154140 Homo sapiens ovary-specific acidic prote -880.6 0.3 

326857 predicted exon 55.2 0.3 

316342 AA743935 Hs.202329 ESTs 43.4 0.3 

331263 AW780192 Hs.267596 ESTs -180.4 0.3 

335987 predicted exon -134 0.3 

70 311923 T60843 Hs.189679 ESTs 12.2 0.3 

310522 AW134529 Hs.244647 ESTs -187.8 0.3 

315363 AA759190 Hs.121454 ESTs. Weakly similar to otfactoiy recept 80 0.3 

302032 NM.001992HS.128087 coagulation factor II (thrombin) recepto -877 0.3 

313140 BE265133 Hs.217493 annexinA2 95.4 0.3 

75 310860 AW015920 Hs.161359 ESTs -239 0.3 

317899 AI952430 Hs.150614 ESTs. Weakly similar to ALU4.HUMAN ALUS -715.2 0.3 



170 

3NSDOCID: <WO 02070492A2_L> 



wo 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



328520 
302406 
311804 
315065 
314129 
335697 
335989 
320608 
329745 
313628 
334616 
308820 
320416 
335211 
323629 
331420 
315984 
332833 
332607 
313467 
323333 
330775 
333188 
332079 
322724 
303652 
303131 
320716 
300454 
312757 
312391 
308877 
311275 
302363 
321717 
302638 
306352 
313798 
320807 
320931 
332450 
332535 
335990 
330746 
316820 
337429 
331192 
330609 
323593 
302704 
330534 
332374 
333221 
335988 
330574 
312052 
319568 
337113 
335149 



NM^012099Hs^11956 
AI866921 Hs.203349 
AK001122 Hs.105859 
AA228366 Hs.1 15122 



AW867943 Hs.127216 

AW419069 Hs.209670 

A1821267 Hs.207243 
AI026984 Hs.293662 

AA375957 Hs.6682 
AW452904 

AI015862 H5.131793 

NM.002314HS.36566 
AA004879 Hs.187820 
AV651680 Hs.208558 
AW247020 Hs.250747 



predicted exon 

CD3-epsilon-as5ociated protein; antisens 
Homo sapiens cDNA FLJ12149 fis. clone MA 
hypothetical protein FU10260 
ESTs 

predicted exon 
predicted exon 

hypothetical protein FU13465 
predicted exon 
ESTs 

predicted exon 
EST 
ESTs 

predicted exon 
ESTs 

gb.-UW-BI3^)y-h.11-0-Ul.slNCLCGAP_Su 83 
ESTs 



A1308876 
AF161442 
AI799111 
AW081061 
AI479439 
AA659037 
AI285970 
R43707 
A1832519 
Ai659166 
AW1 63799 
AW955580 
AA463798 
AA961367 
AI292148 
AA1 35370 
AW262836 
AW288085 
AF167706 



Hs. 103849 
Hs.191591 
Hs.64341 
Hs. 103180 
Hs. 171532 
Hs.163780 
Hs.183817 
Hs.133159 

Hs.207144 
Hs. 198365 
Hs.42699 
Hs.102696 



UM domain kinase 1 

ESTs 

ESTs 

SUIViO-1 activating enzynie subunit 1 

predicted exon 

ESTs 

Homo sapiens HSPC324 mRNA, partial cds 
ESTs 

DC2 protein 
ESTs 
ESTs 
ESTs 

ESTs, Weakly similar to PIHUSD salivary 
gb:at69h03.x1 Barslead colon HPLRB7 Homi 
ESTs 

2,3-bisphosphoglycerate mutase 
ESTs 

IVICT-1 protein 



Hs.71622 SWI/SNF related, matrix associated, acti 
Hs.188536 Homo sapiens cDNA:FU21635 fis. done C 
Hs.252844 ESTs 
Hs.1 1156 hypothetical protein 
Hs.19280 cystelne-rteh motor neuron 1 
predicted exon 

AB033888 Hs.6619 SRY (sex determining region Y).box 18 
AI627912 Hs. 130783 Forssman synthetase 
predicted exon 

ESTSp H^hly similar to IGF^I mRNA-bInd 
ubiquitin caiboxyl-temiinal esterase LI 
ESTs 

hypothetical protein MG0257A 
mitogen-activating protein kinase kinase 
phosphollpase C. beta 3, neighbor pseudo 
predicted exon 
predicted exon 

hepatitis delta antigen-interacting prot 
nucleolar protein family A. member 3 (H/ 
hypothetical protein FU12442 
predicted exon 
predicted exon 



BE622021 Hs.152571 
At346201 H5.76118 
AI739435 Hs.39168 
AA531133 Hs.4253 
NM.004579HS.82979 
X91195 Hs.100623 



AI984144 Hs.66713 
BE621697 Hs.14317 
AF131781 Hs.84753 



•ina o 


n o 
U.2 


lU 


U.2 




U.2 
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-HO.C 


n 0 
U.2 


-JUO.O 
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U.2 
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U.2 


AO 


n 9 
U.2 


one R 


n 9 
U,2 




Uu! 


47r 0 
-If/.O 


A 9 

0,2 


-93D.D 


0,2 


-7,2 


0,2 


-18,4 


0.2 


-142 


0.2 


-100 


0.1 


83 


0.1 


-250.6 


0.1 


-374,2 


0.1 


-27.6 


0.1 


•288.2 


0.1 


-735.6 


0.1 


53.6 


0.1 


-1041.8 


0.1 


19.4 


0.1 


-123.6 


0.1 


-46.4 


0.1 


-156.4 


0.1 


-146.6 


0.1 


-304 


0.1 


-445 


0.1 


-m .0 


A 4 

0.1 


1-149.6 


0 


-o2.D 


0 


-15 


0 


-1059.6 


0 


-332.2 


0 


IS 


21.8 0 


-97.2 


0 


-2222 


0 


-881.6 


0 


28.4 


0 


-722 


0 


-421 


0 


35.4 


0 


-373.6 


0 


-257 


0 


-33 


0 


-280 


0 


-3627.0 


0 


-278.6 


0 


-244 


0 


4'ifiA *) 

-1204.2 


0 


-189.6 


0 


-122.6 


0 


-2257.4 


0 


-359.2 


0 


-874.6 


0 


-24.6 


0 


-191.8 


0 
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TABLE 6A 



10 



Table 6A shows the accession numbers for those pkeys lacking unigenelD's for Table 6. The pkeys in Table 7 lacking unlgenelD's are represented within 
Tables 1-6A. For each probeset we have listed the gene cluster number from which the oligonucleotides were designed. Gene clusters were compiled 
using sequences derived from Genbank ESTs and mRNAs, These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist. Oakland California). The Genbank accession numbers for sequences comprising each duster are listed in the "Accession" 
column. 



15 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 



320925 1525201,1 
20 321614 87866J 

313952 136885_1 

314648 293660_1 

302749 458.107 

312362 764066J 
25 312542 1522649J 

312642 1005225.1 

312986 171879_1 

329350 cjUns 

329414 c^y.hs 
30 329440 c^.hs 

329451 c.y.hs 

338033 CH22.6528FG_L1NK_EM:AC00 

338038 CH22.6535FG_LINK^EM:AC00 

338116 CH22„6650FG_LINK-.EMACOO 
35 338158 CH22L6700FG„LINK-EM:AC00 

329732 c14j>2 

329745 c14j)2 

308106 AI476803 

329863 c14.p2 
40 338316 CH22.6944FG_LINK,Efifl:AC00 

308248 AI560919 

338388 CH22L7034FG_UNK_EM:AC00 
338442 CH22.71O9FG_LINK^EI^:AC00 
338645 CH22^7410FG_LlNieEM:AC00 
45 338728 CH2a.7527FG_LINK.EM:AC00 
308077 AI832519 

338962 CH22.7838FG_LINK_DJ32I10 
308886 A1833240 

333120 CH22.349FG.81.3_LINK^EfVI:A 
50 333121 CH22.350FG_81_4_UNKLEMA 

333122 CH22.351FG.81_6_LINK_EM:A 

333123 CH22.352FG_81.7_L1NK^EMA 

333168 CH22.400FG.94J.LINK.EMA 

333169 CH22L,401FG.94.2.LINK-EMA 
55 333221 CH22»458FG,105_1.UNK_EM: 

326077 c17_hs 
326080 Cl7.hs 
326169 c17_hs 
326198 c17.hs 

60 326230 Cl7_hs 

333585 CH22^846FG_203 4 UNK.EM: 
333610 CH22.871FG.217.5_UNK^EM: 
335093 CH22_2423FGJ92.3^LINK.EM 
335095 CH2^2425FGJ92 5_LINK-EM 

65 335149 CH22_2484FG,499 5.UNK_EM 
326759 c20.hs 

333977 CH22^1254FG.309_6.LINK-EM 
326788 C20„hs 

335211 CH2^550FG_511_2_UNK_EM 
70 305192 AA666019 

303973 AW512014 

303992 AW515800 

326946 c21.hs 

328229 c_6.hs 
75 328262 c,6.hs 



062892 D79755 D62760 
H86161 AA054308AA018955 
F20956 AA129374 AA133740 AW8ig878 
AW979268 AA878419 AA431342 AA431628 
M16951 M16952 M16948 M16949 M16950 
AW015994 R39d98 AW000978 A1598202 Af521706 
D60076 D60259 D61037 
AW052128 H51439 H51481 

AA21 1586 F35799 AA21 1641 F29720 AW937387 AW937408 
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328418 c_7_.hs 
328455 c„7„hs 

335697 CH22_3058FG_596J^_L1NK.E 
328520 c_7.hs 
5 328548 c^7.hs 

335815 CH22^3187FG_618„3_UNK-EM 
328688 c_7_hs 
328695 c_7^hs 
307010 AI140014 
10 337113 CH2?_5058FGJ93J. 
307041 AI144243 
328700 c_7^hs 

335346 CH22„3324FG„646^0_LINK.D 
335986 CH22.3366FG_654J0.UNK.D 
15 335987 CH22^3367FG_654J1_UNK_D 

335988 CH2?.3368FG.654J2.UNKJD 

335989 CH22.3369FG_655JLUNK_DJ 

335990 CH22.3370FG,655j*„LINK-DJ 
337214 CH22.5288FG.613.7. 

20 330020 c16j)2 

305989 AA888220 

328857 c_7^hs 

328937 c^8_hs 

328957 c 8_hs 
25 330187 cjj>2 

337407 CH22L5607FG.755J. 

337429 CH22.5633FG_762.3. 

330232 c_5j)2 

307414 AI242106 
30 330305 cj^2 

330306 c^7_p2 

337603 CH22„5896FG_LINK^C20H12. 
337953 CH22^6395FG_L1NK^EM:AC00 
339236 CH22L8181FG_L1NK.BA354I1 
35 339403 CH2^8384FG_UNK.BA232E1 
309349 AW051913 
325222 c10.hs 
325251 clO.hs 

318188 956161 1 AI792566 AI053836A1054127 AI792489 A1268324 
40 309871 AW300366 
325544 c12.hs 
309931 AW341663 

332833 CH22-50FGJ7J^LINK.C20H1 

302779 33837 1 AJ235667 AJ235666AJ235664 AJ235665AJ235668AJ235669AJ235670 
45 302790 34168"l AJ245245 AJ245247 AJ245257 AJ245248 AJ245254 AJ245256 AJ245253 AJ245203 AJ245250 AJ245252 AJ245243 AJ245204 
AJ245201 AJ245206 AJ245246 AJ245255 AJ245205 AJ245202 AJ245251 AJ245249 AJ245207 AJ245244 

332961 CH2^185FG.48J8.UNK.EM: 

325753 c14Ji8 

327036 c21Jw 
50 325843 c16Jw 

325889 c16_hs 

304261 AA059387 

304275 AA070605 

334376 CH22_1670FG_379^8„UNieEM 
55 327220 c_1_hs 

304363 AA206045 

334458 CH22„1757F6.391.2^LINK-EM 
327365 cj.hs 
327373 c^2_hs 
60 334616 CH22^1923FG.411J5.LINICE 
327414 c.2^hs 
327568 c_3_hs 

336034 CH22.3419FG.678_5_LINK_DJ 
336059 CH2;L3445FG^684^.LINieDJ 
65 334834 CH22_2148FG.439_3_LINK_EM 
304782 AA582081 
304876 AA595765 
327747 c_5_hs 

336228 CH22_3626FG^730.4.LINK.DA 
70 329073 cjuhs 

329088 cjehs 

304969 AA614406 

327844 Q.5_hs 

327B76 C.6J1S 
75 308352 AA961367 

331131 genbanK.R54797 R54797 
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331139 9enbanK_R65706 R6S706 

3314» 675963.1 AW452904AW449414BE467906AI298565BES49932BE326357F04362 
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TABLE 6B 

Table 6B shows the genomic positioning for ihose pkeys lacking unigene ID's and accession numbers in Table 6. The pkeys in Table 7 lacking 
5 unigenetD*s are represented within Tables 1 -6B. For each predicted exon, we have listed the genomk: sequence source used for predk:tion. Nucleotide 
locatk)ns of each predicted exon are atso listed. 



10 



15 



Pkey: 
Ref: 



Strand: 
NLposItton: 



Unk)ue number conesponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (61) numbers. "Dunham i. et al.' refers to the pubftcafton 
entitled "The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positbns of predicted exons. 



Pkey Ref 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



332961 
333221 
333585 
333610 
334376 
334458 
334616 
335149 
335211 
335697 
335986 
335987 
335988 



335990 
336034 
337953 
336033 
338038 
338316 
338442 
338962 
332833 
333120 
333121 
333122 
333123 
333168 
333169 
333977 
334834 
335093 
335095 
335815 
335946 
336059 
336228 
337113 
337214 
337407 
337429 
337603 
338116 
338156 
338388 
338645 
338728 
339236 
339403 
325222 
325251 
325544 
325753 
329745 
329732 
329863 
325889 



Dunham. I. etat. 
Dunham. I. etal. 
Dunham. I. etal. 
Dunham, I. etal. 
Dunham. I. etal. 
Dunham, I. etal. 
Dunham, t etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etat. 
Dunham. I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham. I. etat 
Dunham^ I. etal. 
Dunham. I. etal. 
Dunham, I. etal. 
Dunham. 1. et.at 
Dunham. I. etal. 
Dunham. I. etal. 
Dunham. I. etat 
Dunham, I. etal. 
Dunham. I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham. I. etat 
Dunham. I. etal. 
Dunham. 1. etal. 
Dunham, I. etal. 
Dunham. I. etal. 
Dunham. I. etal. 
Dunham. I. etal. 
Dunham, I. etal. 
Dunham. 1. etal. 
Dunham, t. etat. 
Dunham. I. etat 
Dunham, t etat 
Dunham, 1. etat 
Dunham, tetat 
Dunham. I. etat. 
Dunham. I. etal. 
Dunham. K etat 
Dunham, I. etat 
Dunham. 1. etat 
Dunham. 1. etal. 
Dunham. 1. etat 
Dunham, I. etal. 
6525287 
6682448 
6682452 
6682474 
6065779 
6065783 
6691797 
5867087 



Strand 


Nt_j30sition 


Plus 


2521424-2521555 


Plus 


3978070-3978187 


Plus 


6234778-6234894 


Plus 


6547007-6547116 


plus 


13902218-13902331 

1 vr ■ t\J 1 w ww^ ww w 


Plus 


14353496-14353572 


Plus 


151761 23-1 51 76470 


Plus 


21 497441-21497587 


Plus 


21774611-21774680 


Plus 


25481456-25481 649 


Plus 


27967791-27967852 


Plus 


27971 41 3-27971 481 


Plus 


27977912-2797801 3 


Plus 


27983788-27983860 

Mm « wUvf Vlw^fcf WWW WWw 


Plus 


27988532-27988608 

0uu*#wfc cf www www 


Plus 


29014404-29014590 




6827029-6827125 




8092128-8092271 




ft138219.8138392 




1706971 1-17089^88 


Plus 


1 9980640-19980898 

1 wwwww* W IWwwwV/wV 


Plus 


2958 1 892-29582020 


Minus 


1119848-1119705 


Minus 


3307508-3307427 


Minus 


3308446-3308358 


Minus 


3309596-3309531 


Minus 


3310817-3310749 


Minus 


3729896-37297B8 


Minus 


3730864-3730767 


Minus 


8722928-8722725 


Minus 


17182681-17182535 


Minus 


21297367-21297214 


Minus 


21292546-21292381 


Minus 


26320518-26320421 


Minus 


27487203-27487035 


Minus 


29184079-29183969 


Minus 


30904602-30904497 


Minus 


21233344-21233237 


Minus 


26095902-26095502 


Minus 


31886652-31886567 


Minus 


32086238-32086079 


Minus 


1299296-1299194 


Minus 


10614071-10613814 


Minus 


11794465-11794343 


Minus 


18662403-18662305 


Minus 


24063839-24063775 


Minus 


25949039-25948927 


Minus 


32773355^2773202 


Minus 


34050728-34050625 


Minus 


22332-22473 


Minus 


411693-411751 


Plus 


171228-171286 


Plus 


398512-398621 


Plus 


174774-175142 


Plus 


161252-161322 


Plus 


196801-196971 


Plus 


223829-223891 
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10 



15 



20 



25 



30 



35 



40 



loco A O 

325843 


6552453 


Minus 


7126-7232 


330020 


6671887 


Plus 


172397-172491 


326198 


5867215 


Minus 


80295-80674 


326230 


5857230 


Minus 


301868-301972 


326169 


5867255 


Minus 


128321-128388 


326077 


6682495 


Minus 


312108-312168 


32bOoO 


6682495 


Plus 


478644-478847 


326759 


6249610 


Plus 


97216-97311 


3^d7oo 


6682503 


Plus 


277132-277335 


326946 


6004446 


Minus 


116677-116967 


327036 


6531965 


Plus 


319951-320040 


327220 


5867525 


Minus 


65701-65781 


327365 


6552412 


Minus 


118133-118198 


327414 


5867750 


Plus 


102461-102586 


327373 


5867792 


Minus 


8186-8742 


327568 


5867811 


Minus 


46152-46287 


330187 


6706138 


Plus 


212923-213020 


327747 


5867947 


Plus 


115322-115498 


327844 


6249582 


Minus 


18895-18958 


330232 


6013526 


Plus 


113655-113830 


328229 


5868105 


Minus 


120936-121053 


327876 


5868140 


Plus 


103882-104034 


328262 


6381906 


Plus 


11867-12027 


328688 


5868262 


Plus 


626030-626094 


328700 


5868264 


Plus 


764089-764203 


328695 


5868264 


Plus 


318632-318695 


328418 


5868409 


Minus 


258811-258894 


328455 


5868431 


Plus 


385576-385633 


328520 


5868477 


Plus 


1942075-1942246 


328548 


5868487 


Plus 


72301-72397 


328857 


6381927 


Minus 


80557-81051 


330305 


4877982 


Minus 


52269-52365 


330306 


4877982 


Plus 


96161-96233 


328937 


5868500 


Minus 


1448241-1448333 


328957 


6456773 


Plus 


219195-219297 


329073 


5868596 


Plus 


37838-37956 


329088 


5868608 


Plus 


116738-116950 


329350 


6456785 


Plus 


98911-98969 


329414 


5868874 


Plus 


942555-942643 


329440 


5868885 


Plus 


21943-22063 


329451 


5868887 


Plus 


25974-26048 
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TABLE 7: 

Table 7 depicts Seq 10 No., UnigenelD, UnigeneTHIe, Pkey, and ExAccn for aU of the sequences in Table 8. Seq ID No Vnks the nucleic add and protein 
sequence Infonnation InTable 6 to Table 7. 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Pkey: 
ExAccn: 
UnigenelD: 
Un^ene Title: 
Seq.lD.No.: 



Unique Eos probeset identifier number 

Exemplar Accesston number, Genbank accession number 

Unigene number 

Un^ne gene title 

Sequence ktenHricatkin Number found In Table 8 



Unigene Tittle 











pp9^C1<vd 
DC4*4v7 lirt 


Hs 154210 


1 I90 19 


fv\H OUU4 w 


Hs 41135 






H<; 14QR0Q 


109017 


MiU ID# 14 


Ue 9A77Q7 
nd»40f /«// 


1AOQ1C 


aU/ 04U 


nb.^4uo 


inRQ'Jft 


AW000040 


Ue 99190 
nD.44 1 4U 


1 m*iQC 
\\Ji6oD 


lMlVL00009f 


Mc 1AA^ 




AAOUlODf 


Wq 7R99d 
nb.r D44*t 




AAoOlOD/ 


ns./ D44S 




Ab04D4 


Wc 911*^70 




1144'] cno 
Ml 0009 


ns.ooiDs 


tf\AOCC 

104 000 


T7O0>in 

1 / 9d40 


We 99»;7*i 
nS.440rO 


lUD 144 


n9^ooo 


Uq 7Cft7 
no. / oOf 


4 nOAA4 

10901)1 


AIOooo4o 


Uc 791 1ft 

ns.f 4 1 10 


104764 


A10o944o 


Uo 97A(^A(l 
nS.4r0000 


iiioon 


A 00*577 1R 
ADUOr t 10 


He lA.'^R^Q 
fib. lOOOOIl 


lOozoo 


AWOOOOOO 


Ue ccao 


104:094 


DC*HiUUn4 


Ue irrioc 
no.voo4v 


109400 


AW90000U 




4 4 nOAC 


A An^ROII 

AAOoO^ll 


ns. If ■*w*i 


119070 


DC«400DU 


Me 97CIA77 

115.4/9*1/ / 


132050 


AliCD/DiO 


nS.00U44 


4 01 Jinn 
132490 


MAX nmoon 
NlvL00l49O 


Ue 4onn 


102263 


AtAI4C4CeO 


Ue SlViM 


101714 


M68874 


nS.4llOO/ 


133975 


/^40QCC 


Ue OQRQ^ 


106793 


1194997 


Ue IR^M 
ns. 1D40U 


4'IOC44 


N70D^0 


Ue Al^l^ 


10144f 


M01Qni\ 
IVl4lOU0 




04^QA1 


MAO l3pLF4 


no.. louiMv 


OJ^'tOO 


MDU I0439 


no. 1 lo IHU 


01 <301 J 


AW9QAfifin 
rWV490uUu 


Uq 141840 
no. i*t io*ti/ 


01 JOOO 


A ACOOC47 


Ue 1 18509 
no. 1 io«^u4 


Ol ODDO 


AW/ a I4U 1 


Ue *?i9ri 
nO'W I400 


Q1>l^79 
J 140/4 




Ue 14900.') 
noi i»»4vuo 


AIQTrR 
4494rD 




Uc IQARI? 

no. I900 14 


lUlo*IO 


KIM nn'^7Q'^ 


Me 1<i9175 

no. )34 !/«/ 


A 4 oao>i 
415994 


AA49004U 


Mc AQt;4R 
no.osoMO 


lUOOWU 


AA187101 


Hs.213194 


133260 


AA403045 


Hs!6906 


101097 


BE245301 


Hs.89414 


104786 


AA027167 


Hs.10031 


132173 


X89426 


Hs.41716 


100420 


086983 


Hs.1 18893 


111018 


AI287912 


H$.3628 


108507 


AI554545 


Hs.68301 


104894 


AF065214 


Hs.18858 


118511 


N75620 


Hs.43157 


125609 


AA868063 


Hs.104576 


101543 


M31166 


Hs.2050 


102241 


NNL007351 


Hs^68107 


101560 


AW958272 


Hs.347326 


103280 


UB4722 


Hs.76206 


105826 


AA478756 


Hs.194477 


102604 


Nti/L002318 


H8.83354 


131647 


AA359615 


Hs.30089 


103095 


NWL005424 


Hs.78824 


103037 


BE018302 


Hs.2894 


100405 


AW291587 


Hs.82733 


102012 


BE259035 


Hs.1 18400 



endothelial differentiation. sphingoDpI 
endomucin-2 

integrin. alpha 5 (fibronectin receptor, 
integrin. beta 1 (fibionectin receptor, 
matrix metalloprotelnase 10 (stromelysin 
ESTs 

podocalyxin-like 

EGF-contalnlng fibulln-like extraceliula 
EGF-containing fibulin-like extraceliula 
mebnoma cell adhesion molecule 
matrix metailoproteinase 1 (interstitial 
B-ceil Cli/Iymphoma 6. member B (zinc ft 
Homo sapiens cDNA: FU21962 fis. clone H 
hypothetical protein FU20992 similar to 
ESTs 

hypothetical protein FU10210 

solute carrier family 7. (cationlc amino 

matrix metailoproteinase 3 (stromelysin 

ESTs 

ESTs 

ESTs 

ESTs 

LIM domain binding 2 

guanine nucleotide binding protein 11 

phospholipase A2, group IVA (cytosollc, 

tissue factor pathway inhibitor 2 

ESTs 

ESTs 

gb:Human alpha satellite and sateiate 3 
ESTs 

KIAA0716 gene product 

ESTs. Weakly similar to S59501 interfero 

ESTs 

ESTs 

ESTs 

G proteln-coupted receptor 51 

calcitonin receptor-filte 

selectin E (endothelial adhesion motecul 

hypothetical protein IWGC10895 

Homo sapiens cDNA: FU23197 fis, clone R 

chemokine (C-X-C motif), receptor 4 (fus 

KIAA0955 protein 

endothelial cen-spedfic molecule 1 

Melanoma associated gene 

mitogen-actlvated protein kinase kinase 

ESTs 

phospholipase A2. group IVC (cytosollc, 
ESTs 

carbohydrate (keratan sulfate Gal^] sul 
pentaxln-related gene, rapidly induced b 
multimerin 

Intercellular adhesion molecule 2 

cadherin 5. type 2, VE-cadherin (vascula 

E3ubfquItinl^aseSMURF2 

lysyl oxidase-like 2 

ESTs 

tyrosine kinase with immunoglobuiin and 
placental growth factor, vascular endoth 
nidogen 2 

singed prosophila)-like (sea urchin fas 



SEQ ID NO 

Seq ID1&2 
SeqID3&4 
Seq tD5&6 
Seq iD7&8 
Seq ID g & 10 
Seq ID 11 & 12 
Seq ID 13 & 14 
Seq ID 15 & 16 
Seq ID 17 & 18 
Seq ID 19&20 
Seq ID 21 &22 
Seq ID 23 & 24 
SeqlD25&26 
SeqlD27&28 
Seq ID29&30 
Seq ID31&32 
Seq ID33&34 
SeqlD35&36 
SeqlD37&38 
SeqlD39&40 
Seq ID 41 & 42 
Seq ID 43&44 
Seq ID 45 & 46 
Seq ID 47 & 48 
Seq ID 49 & 50 
Seq ID 51&52 
Seq ID53&54 
Seq ID 54 & 55 
Seq ID56&57 
Seq ID 58&59 
Seq ID 60 & 61 
Seq ID 62 & 63 
Seq ID 64 & 65 
Seq ID66&67 
Seq ID68&69 
Seq ID 70 & 71 
Seq ID 72 & 73 
Seq ID 74 & 75 
SeqID76&77 
Seq ID 78 & 79 
Seq ID 80 & 61 
SeqlD82&83 
SeqlD84&85 
SeqlD86&87 
SeqlD88&89 
SeqlD90&91 
SeqlD92&93 
SeqlD94&95 
SeqlD96&97 
Seq ID 98 & 99 
Seq ID 100 & 101 
Seq ID 102 & 103 
Seq ID 104 & 105 
Seq ID 106 & 107 
Seq ID 108 & 109 
Seq ID 110 & 111 
Seq ID 112 & 113 
Seq ID 114 & 115 
Seq ID 116 & 117 
Seq ID 118 & 119 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



101261 


D30857 


Hs.82353 


protein C receptor, endothelial (EPCR) 


Seq ID 120 & 121 


105729 


H46612 


Hs.293815 


Homo sapiens HSPC285 mRNA, partial cds 


Seq ID 122 & 123 


107216 


D51059 


Hs.211579 


melanoma cell adhesion molecule 


Seq ID 124 & 125 


131080 


NWL001955 


Hs.2271 


endothelin 1 


Seq ID 126 & 127 


131486 


F06972 


Hs.27372 


BMX non-receptor tyrosine kinase 


Seq ID 128 & 129 


134299 


AW580939 


Hs.97199 


complement component Clq receptor 


Seq ID 130 & 131 


134983 


D28235 


Hs.196384 


prostaglandin-endoperoxide synthase 2 (p 


Seq ID 132 & 133 


115827 


AA428000 


Hs.283072 


actin related protein 2/3 complex, subun 


Seq ID 134 & 135 


133614 


NM_003003 


Hs.75232 


SEC14 (S. cerevisiae)-like 1 


Seq ID 136 & 137 


116483 


A1346201 


Hs.76118 


ubiquitin carboxyl-terminal esterase LI 


Seq ID 138 & 139 


132546 


M242d3 


Hs.168383 


intercellular adhesion molecule 1 (CD54) 


Seq ID 140 & 141 


133678 


AW247252 


NA 


nucleoside phosphoryfase 


Seq ID 142 & 143 


130184 


H58306 


Hs.15165 


retinoic acid induced 14 


Seq ID 144 & 145 


134786 


T29618 


Hs.89640 


TEK tyrosine kinase, endothelial (venous 


Seq ID 146 & 147 


129371 


X06828 


Hs,110802 


von Willebrand factor 


Seq ID 148 & 149 


418506 


AA084248 


Hs,85339 


G protein-coupled receptor 39 


Seq ID 150 & 151 


322262 


AA632012 


Hs.188746 


ESTs 


Seq ID 152 & 153 


312173 


AI821409 


Hs.304471 


EST 


Seq ID 154 & 155 


319795 


AB037821 


Hs.146858 


protocadherin 10 


Seq ID 156 & 157 


313978 


AI870175 


Hs.13957 


ESTs 


Seq ID 158 & 159 


306840 


AI077477 


Hs.307912 


ESTs 


Seq ID 160 & 161 


310272 


AF216389 


Hs.148932 


seme domain, transmembrane domain (TM), 


Seq ID 162 & 163 


310272 


AF216389 


Hs.148932 


sema domain, transmembrane domain (TM), 


Seq ID 164 & 165 


315044 


BE547674 


Hs.20416g 


ESTs. Weakly similar to S65657 alpha-IC- 


Seq ID 166 & 167 


321325 


AB033100 


Hs.300646 


KIAA1274 protein (similar to mouse palad 


Seq ID 168 & 169 


321325 


AB033100 


Hs.300646 


KIAA1274 protein (similar to mpuse.palad 


Seq ID 170 & 171 


303251 


AF240635 


Hs.1 15897 


protocadherin 12 


Seq ID 172 & 173 


302378 


AL109712 


Hs.296506 


Homo sapiens mRNA full length insert cDN 


Seq ID 174 & 175 


319267 


F11802 


Hs.6818 


ESTs 


Seq ID 176 & 177 


310442 


AW072215 


Hs.208470 


ESTs 


Seq ID 178 & 179 


300469 


BE301708 


Hs.233955 


hypothetical protein FU20401 


Seq ID 180 & 181 


331237 


W87874 


Hs.25277 


Homo sapiens cDNA FU10717 fis; clone NT 


Seq ID 182 & 183 


330968 


R44557 


Hs.23748 


ESTs 


Seq ID 184 & 165 


301822 


X17033 


Hs.271986 


integrin. alpha 2 (CD49B, alpha 2 subuni 


Seq ID 186 & 187 


422573 


AW297985 


Hs.295726 


integrin. alpha V (vitronectin receptor 


Seq ID 188 & 189 


133061 


AI186431 


Hs.296638 


prostate differentiation factor 


Seq ID 190 & 191 


135235 


AW298244 


Hs.266195 


ESTs 


Seq (D 192 & 193 


101192 


BE247295 


Hs.78452 


solute carrier family 20 (phosphate tran 


Seq ID 194 & 195 


113195 


H83265 


Hs.8881 


ESTs, Weakly similar to S41044 chromosom 


Seq ID 196 & 197 


101741 


NM_003199 


Hs.326198 


transcription factor 4 


Seq ID 198 & 199 


321911 


AF026944 


Hs.293797 


ESTs 


Seq ID 200 & 201 


320635 


N50617 


Hs.80506 


small nuclear ribonudeoprotein poiypept 


Seq ID 202 & 203 


326230 






NI\/L017643:Homo sapiens hypothetteal prot 


Seq ID 204 & 205 


132968 


AF234532 


Hs.61638 


myosin X 


Seq ID 206 & 207 


135073 


W55956 


Hs.94030 


Homo sapiens mRNA; cDNA DKFZp586E1624 (f 


SeqlD208&209 


108937 


AL050107 


Hs.24341 


tnanscriptional co-adrvator with PDZ-bi 


Seq ID 210 & 211 


116430 


AK001531 


Hs.66048 


hypothetical protein FU10669 


Seq ID 212 & 213 


104877 


AI138635 


Hs.22966 


Homo sapiens ctone IMAGE:451939, mRNA se 


Seq ID 214 & 215 


122697 


AA420683 


Hs.98321 


hypolhetkal protein FLJ14103 


Seq ID 216 & 217 


112522 


R68857 


Hs.265499 


ESTs 


Seq ID 218 & 219 


304782 


AA582081 




gb:nn32h06^1 NCLCGAP.GasI Homosaptens 


SeqID220 &221 


312802 


AA644669 


Hs.193042 


ESTs 


Seq ID 222 & 223 


302680 


AW192334 


Hs,38218 


ESTs 


Seq ID 224 & 225 


326198 






Phase 2 & 3 Exons 


Seq ID 226 & 227 


331019 


NM_006033 


Hs.65370 


lipase; endothelial 


SeqlD228 &229 
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TABLES 



Seq ID NO: 1 DNA seguence 
Nucleic Acid Accession #: 
Coding seguence: 244- 



NM_001400 

220B (underlined sequences correspond to start and stop codons) ) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



GTCGG6GGCA 
CTTCGCCCTG 
CACAAAAAGC 
CGCCCTCTAG 
ACCJ^GGGC 
GTCAACTATG 
GACAAGGAGA 
ATCCTG6AGA 
ATGTACTATT 
GCTAACCTGC 
CGGGAAGGGA 
ATTGAGCGCT 
CTCTTCCTGC 
ATGGGCTGGA 
AAGCACTATA 
CTGTACTGCA 
AACATTTCCA 
ATCGTCCTGA 
GTGGGCT6CA 
GCTGTGCTCA 
CG6GCCTTCA 
TTCAAGCGAC 
CACCCCCAGA 
TCTTCTTCCT 
CCACCCCAGT 
CAAGCCAGAG 
TAGAGTTAGT 
TATATATTCT 
AGCTCCTAAA 
TCTTTGTCTG 
GTGTGCACTT 
TTCATACCCC 
CTGGGGTTGT 
TGGGAAGATG 
CATGTAAGCG 
CATCTTTTCA 
AA6CCCACTT 
A6CAAAACAA 
AAATGAGTCT 
TCTTGTGTGA 
CTTGATTTTT 
GTTAACTTTT 
CGCCAGAACT 
ACAAAGAATA 
AGATGTCTTQ 
TTTGCACATA 



11 
I 

GCAGCAAGAT 
CTTGAGCGAG 
CTGGATCACT 
CGTTCGTCTG 
CCACCAGCGT 
ATATCATCGT 
ACAGCATTAA 
ACATCTTTGT 
TTATTGGCAA 
TCTTGTCTGG 
6TATGTTTGT 
ATATCACAAT 
TAATCAGCGC 
ACTGCATCAG 
TCCTCTTCT6 
GAATCTACTC 
AGGCCAGCCG 
GCGTCTTCAT 
AGGTGAAGAC 
ACTCCGGCAC 
TCC6GATCAT 
CCATCATCGC 
AAGACGAAGG 
AGAACTGGAA 
GTTTGGAAAA 
GGAGGAAGGG 
TCCTGTGAAC 
ACCCCCCTGG 
GGGTTCATTT 
GAGCTTTGAG 
CTGCTTCTTT 
TCCTCAACGT 
GGAATGATCG 
AAGATGGTTT 
GGATCCGTTT 
ATGAAATGTG 
TATCTAAATG 
AGTGAAAACC 
AACAAATAT6 
TTCATTTCAA 
GAATGTATTT 
CTAGAATCCA 
TTTAAGTCCA 
AJU^TATATT 
TTTTTTTAAA 
GCTTTATCAA 



21 
1 

GCGAA6CGAG 
GCTGCGGTTT 
CATCGAACCA 
GAGTAGCGCC 
CCCGCTGGTC 
CCGGCATTAC 
ACTGACCTCG 
CTTGCTGACC 
TCTGGCCCTC 
GGCCACCACC 
GGCCCTGTCA 
GCTGAAAATG 
CZTGCTGGGTC 
TGCGCTGTCC 
CACCACGGTC 
CTTGGTCAGG 
CA6CTCTGAG 
CGCCTGCTGG 
CTGTGACATC 
CAACCCCATC 
GTCCTGCT6C 
CGGCATGGAA 
GGACAACCCA 
GCTGTCCACC 
AAATCTCTGG 
GGAGAATACG 
AATGCACTGG 
AGCTTTGATT 
GGCCCCTCCT 
GAGATGTTTT 
AGGGATGCCC 
TCTTTTACTT 
ATCATCTATA 
G6AGGTGTAA 
TTTGGAATTT 
TTACCATTTC 
ATATTAGCCA 
GAATGGATTA 
ACATCCGTCT 
GCAACAACAT 
GTTTCAGGAA 



GCTATTCATT 
ACTGTCTCTT 
AAGAATAGTA 
CTTTTAAACA 



31 

1 

CCGTACAGAT 
CCGAGGCCCT 
CCCCTGAAGC 
ACCCCGGCTT 
AAGGCCCACC 
AACTACACGG 
GTGGTGTTCA 
ATTTGGAATU^ 
TCAGACCTGT 
TACAAGCTCA 
GCCTCCGTGT 
AAACTCCACA 
ATCTCCCTCA 
AGCTGCTCCA 
TTCACTCTGC 
ACTCGGAGCC 
AAGTCGCTGG 
GCACCGCTCT 
CTCTTCAGAG 
ATTTACACTC 
AAGTGCCCGA 
TTCAGCCGCA 
GAGACCATTA 
CACCGGAAGC 
GCTTCGACTG 
AACAGCCTGG 
GAAGGGTGGA 
TTGCACTGAG 
CAAAGACTAA 
CCTTCACTTT 
TGTAC7VTCCC 
TATACTTTAA 
GCAAATAGGC 
AACAATGTCC 
GGTT6AAGTC 
ATATCCATTG 
GGATCCTTG6 
ACTTTTGCAA 
TTCCCACTTT 
GTTGTATTTT 
GAAGTCATTT 
CCCTTAAGCA 
AGATAGTAAT 
TAGTATGGTT 
TTTAATAGGT 
TTAATAAACT 



41 
I 

CCCGGGCTCT 

CTCCAGCCAA 

CAGTGAAGGC 

CCTGGGGACA 

GCAGCTCGGT 

GAAAGCTGAA 

TTCTCATCTG 

CCAAGAAATT 

TGGCAGGAGT 

CTCCCGCCCA 

TCAGTCTCCT 

ACGGGAGCAA 

TCCTGGGTGG 

CCGTGCTGCC 

TTCTGCTCTC 

GCCGCCT6AC 

CGCTGCTCAA 

TCATCCTGCT 

CGGAGTACTT 

TGACCAACAA 

GCGGAGACTC 

GCAAATCGGA 

TGTCTTCTGG 

GCTCTTTACT 

CTGCCAGGGA 

TGGTGTCGGG 

GATCAGGTCC 

CCA7VAGGTCT 

TGTCCCCATG 

AGTTTCAAAC 

ACACCCCACC 

CTACCTGAGA 

TATGTTGAGT 

TTCGCTGAGG 

ACTTT6ATTT 

AAGCCGAAAT 

TGTCCTAGGA 

ACCAAGGGAG 

TGTTGATGTT 

6TTGTGTTAA 

TATGGATTTT ' 

TTACTTTAAC 

TGAAGATATG 

TTCAGTGCAA 

TTCTGACTTT 

GATTTTTTTA 



51 
I 

CCGAACGCAA 
GGAAAAGCTA 
TCTCTCGCCT 
CAGGGTTGGC 
CTCTGACTAC 
TATCAGCGCG 
CTGCTTTATC 
CCACCGACCC 
AGCCTACACA 
GTGGTTTCTG 
CGCCATCGCC 
TAACTTCCGC 
CCTGCCTATC 
GCTCTACCAC 
CATCGTCATT 
GTTCCX5CAAG 
GACCGTAATT 
CCTGCTGGAT 
CCTGGTGTTA 
GGAGATGCGT 
TGCTGGCAAA 
CAATTCCTCC 
AAACGTCAAC 
TGGTCGCTGG 
GGAGCTGCTG 
TGTTGGTGGG 
CGGCCTGGAA 
AGCATTGTCA 
TGAAAGCGTC 
CCT^GTGAGT 
CTCCCTTCCC 
GTTATCAGAG 
ACGTAGGCTG 
CCAAAGTTTC 
CTTTAAAAAA 
CTGCATAAGG 
GAAACAGACA 
ATTTCTTAGC 
TATTTCAGAA 
AAGTACTTTT 
TCTAACCCGT 
TGGTAGG6AA 
TATAAATATT 
TTAAACCGAG 
TGTGGATCAT 
AAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



60 



65 



70 



75 



Seq ID NO: 2 Protein sequence; 
protein Accession #: NP 001391 



MGPTSVPLVK 
LENIPVLLTI 
E6SMFVALSA 
GWNCISAIiSS 
ISKASRSSEK 
VLNSGTNPII 
PQKDEGDNPE 



11 
I 

AHRSSVSDYV 
WKTKKFHRPM 
SVFSLLAIAI 
CSTVLPLYHK 
SLALLKTVII 
YTLTNKEMRR 
TIMSSGNVNS 



21 
1 

NYDIIVRHYN 
VYFIGNIiAIiS 
ERYITMIiKMK 
HYILPCTTVF 
VI.SVFIACWA 
AFIRINSCCK 
S8 



31 41 51 

I I I 

YTGKIiNISAD KENSIKLTSV VFIIiXCCFII 60 

DLIjAGVAYTA NliliXiSGATTY KLTPAQWFUl 120 

IiHNGSNNFRL FLLISACWVI SIiILGGIiPIM 180 

TIiLlJiSlVXIi YCRIYSLVRT RSRRIiTFRKN 240 

PI,FII«IiIiI<DV GCKVKTCDIIj FRAEYFLVLA 300 

CPSGDSAGKF KRPIIAGMEF SRSKSDNSSH 360 
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5 



Seq ID NO: 3 Nucleoti de aeouence; 
10 Nucleic Acid Accession #: NM_016242 

Coding sequence: 79 > 864 (underlined sequences correspond to start and stop codons) } 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



35 

Seq ID NO: 4 Protein sequence ; 
Protein Accession NP 057326 



60 
120 
180 
240 



Seq ID NO: 5 Nucleotide sequence; 
Nucleic Acid Accession #; NM_002205 , 

50 Coding sequence: 24.. 3173 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

^0 CA6GACAGG6 AAGAGC6G6C 6CTATG666A GCC6GAC6CC AGACSTCCCCT CTCCACGCCG 60 

TGCAGCTGCG CTGGGGCCCC CGGCGCCGAC CCCCGCTCGT GCCGCTGCTG TTGCTGCTCG 120 

TGCCGCCGCC ACCCAGGGTC GGGGGCTTCA ACTTAGACGC GGAGGCCCCA GCAGTACTCT 180 

CGGGGCCCCC GGGCTCCTTC TTCGGATTCT CAGTGGAGTT TTACCGGCCG GGAACAGACG 24 0 

GGGTCAGTGT GCTGGTGGGA GCACCCAAGG CTAATACCAG CCAGCCAGGA GTGCTGCAGG 300 

60 GTGGTGCTGT CTACCTCTGT CCTTGGGGTG CCAGCCCCAC ACAGTGCACC CCCATTGAAT 360 

TTGACAGCAA AGGCTCTCGG CTCCTGGAGT CCTCACTGTC CAGCTCAGAG GGAGAGGAGC 420 

CTGTGGAGTA CAAGTCCTTG CAGTGGTTCG GGGCAACAGT TCGAGCCCAT GGCTCCTCCA 480 

TCTTGGCATG CGCTCCACTG TACAGCTGGC GCACAGAGAA GGAGCCACTG AGCGACCCCG 540 

TGGGCACCTG CTACCTCTCC ACAGATAACT TCACCCGAAT TCTGGAGTAT GCACCCTGCC 600 

65 GCTCAGATTT CAGCTGGGCA GCAGGACAGG GTTACTGCCA AGGAGGCTTC AGTGCCGAGT 660 

TCACCAAGAC TGGCCGTGTG GTTTTAGGTG GACCAGGAAG CTATTTCTGG CAAGGCCAGA 720 

TCCTGTCTGC CACTCAGGAG CAGATTGCAG AATCTTATTA CCCCGAGTAC CTGATCAACC 780 

TGGTTCAGGG GCAGCTGCAG ACTCGCCAGG CCAGTTCCAT CTATGATGAC A6CTACCTAG 840 

GATACTCTGT GGCTGTTGGT GAATTCAGTG GTGATGACAC AGAAGACTTT GTTGCTGGTG 900 

70 TGCCCAAAGG GAACCTCACT TACGGCTATG TCACCATCCT TAATGGCTCA GACATTCQAT 960 

CCCTCTACAA CrTTCTCAGGG GAACAGATGG CCTCCTACTT T6GCTATGCA GTGGCCGCCA 1020 

CAGAC6TCAA TGGGGACGGG CTG6ATGACT TGCTCSGTGGG (3GCACCCCTG CTCATGGATC 1080 

GGACCCCTGA CGGGCGGCCT CAGGAGGTGG GCAGGGTCTA CGTCTACCT6 CAGCACCCAG 1140 

CCGGCATAGA GCCCACGCCC ACCCTTACCC TCACTGGCCA TGAT6AGTTT GGCCGATTTG 1200 

75 GCA6CTCCTT GACCCCCCTG GGGGACCTGG ACCAGGATGG CTACAATGAT GTGGCCATCG 1260 

GGGCTCCCTT TGGTGGGGAG ACCCAGCA66 GAGTAGTGTT TGTATTTCCT G(3GGGCCCAG 1320 



180 
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1 11 21 31 41 51 

15 I I I I I I 

AAGGCCCTGC CAGCTTGGGA GGGAATTGTC CCTGCCTGCT TCTGGAGAAA GAAGATATTG 
ACACCATCTA CGGGCACCAT GGAACTGCTT CAAGTGACCA TTCTTTTTCT TCTGCCCAGT 
ATTTGCAGCA GTAACAGCAC AGGTGTTTTA GAGGCAGCTA ATAATTCACT TGTTGTTACT 
ACAACAAAAC CATCTATAAC AACACCAAAC ACAGAATCAT TACAGAAAAA TGTTGTCACA 

20 CCAACAACTG GAACAACTCC TAAAGGAACA ATCACCAATG AATTACTTAA AATGTCTCTG 
ATGTCAACAG CTACTTTTTT AACAAGTAAA GATGAAGGAT TGAAAGCCAC AACCACTGAT 
GTCAGGAAGA ATGACTCCAT CATTTCAAAC GTAACAGTAA CAAGTGTTAC ACTTCCCAAT 
GCTGTTTCAA CATTACAAAG TTCCAAACCC AAGACTGAAA CTCAGAGTTC AATTAAAACA 
ACAGAAATAC CAGGTAGTGT TCTACAACCA GATGCATCAC CTTCTAAAAC TGGTACATTA 

25 ACCTCAATAC CAGTTACAAT TCCAGAAAAC ACCTCACAGT CTCAAGTAAT AGACACTGAG 
GGTGGAAAAA ATGCAAGCAC TTCAGCAACC AGCCGGTCTT ATTCCAGTAT TATTTTGCCG 
GT6GTTATTG CTTTGATTGT AATAACACTT TCAGTATTT6 TTCTGGTGGG TTTGTACCGA 
ATGTGCTGGA AG6CAGATCC GGGCACACGA GAAAAT66AA ATGATCAACC TCAGTCTGAT 
AAAGAGAGCG T6AAGCTTCT TACCGTTAAG ACAATTTCTC ATGAGTCTGG TGAGCACTCT 

30 GCACAA6GAA AAACCAAGAA CT6ACAGCTT GAGGAATTCT CTCCACACCT AGGCAATAAT 
TACGCTTAAT CnTCAGCTTC TATGCACCAA GCGTGGAAAA GGAGAAAGTC CTGCAGT^TC 
AATCCCGACT TCCATACCTG CTGCTG6 



1 11 21 31 41 51 

40 I I 1 I I I 

MELIiQVTIIiF LLPSICSSNS TGVLEAANNS LWTTTKPSI TTPNTESLQK NWTPTTGTT 

PKGTITNELL KMSLMSTATF IiTSKDEGIiKA TTTDVRKNDS IISNVTVTSV TLPNAVSTLQ 

SSKPKTETQS SIKTTEIPGS VliQPDASPSK TGTLTSIPVT IPENTSQSQV IDTEGGKNAS 

TSATSRSYSS IILPWIALI VITIiSVFVLV GLYRMCWKAD PGTPBN6NDQ PQSDKESVKI* 
45 liTVKTISHES GEHSAQGKTK N 
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GAGGGCTGGG CTCTAAGCCT TCCCAGGTTC TGCAGCCCCT GTGGGCAGCC AGCCACACCC 138 0 

CAGACTTCTT TGGCTCTGCC CTTCGAGGAG GCCGAGACCT GGATGGCAAT GGATATCCTG 1440 

ATCTGATTGT GGG6TCCTTT GGTGT6GACA AGGCTGTGGT ATACAGGGGC CGCCCCATCG 1500 

TGTCCGCTAG TGCCTCCCTC ACCATCTTCC CCGCCATGTT CAACCCAGAG GAGCGGAGCT 1560 

5 GCAGCTTAGA GGGGAACCCT GTGGCCTGCA TCAACCTTAG CTTCTGCCTC AATGCTTCTG 1620 

GAAAACACGT TGCTGACTCC ATTGGTTTCA CAGTGGAACT TCAGCTGGAC TGGCAGAAGC 1680 

AGAAGG6AG6 G6TAC66CGG 6CACT6TTCC TGGCCTCCAG GCAGGCAACC CTGACCCAGA 1740 

CCCTGCTCAT CCAGAATGGG GCTCGAGAGG ATTGCAGA6A GATGAAGATC TACCTCAG6A 1800 

ACGAGTCAGA ATTTCGAGAC AAACTCTCGC C6ATTCACAT CGCTCTCAAC TTCTCCTTGG 1860 

10 ACCCCCAAGC CCCAGTGGAC AGCCACGGCC TCAGGCCAGC CCTACATTAT CAGAGCAAGA 1920 

GCC6GATAGA GGACAAGGCT CAGATCTTGC TGGACTGTGG AGAAGACAAC ATCTGTGTGC 1980 

CTGACCTGCA GCTGGAAGTG TTTGGGGAGC AGAACCATGT GTACCTGGGT GACAAGAATG 2040 

CCCTGAACCT CACTTTCCAT GCCCAGAATG TGGGTGAGQG TGGCGCCTAT GAGGCTGAGC 2100 

TTCGGGTCAC CGCCCCTCCA GAGGCTGAGT ACTCAGGACT CGTCAGACAC CCAGGGAACT 2160 

15 TCTCCAGCCT GAGCTGTGAC TACTTTGCCG TGAACCAGAG CCGCCTGCTG GTGTGTGACC 2220 

TGGGCAACCC CATGAAGGCA GGAGCCAGTC TGTGGGGTGG CCTTCGGTTT ACAGTCCCTC 2280 

ATCTCCGGGA CACTAAGAAA ACCATCCAGT TTGACTTCCA GATCCTCAGC AAGAATCTCA 234 0 

ACAACTCGCA AAGCGACGTG GTTTCCTTTC GGCTCTCCGT GGAGGCTCAG GCCCAGGTCA 2400 

CCCTGAACGG TGTCTCCAAG CCTGAGGCAG TGCTATTCCC AGTAAGCGAC TGGCATCCCC 2460 

20 GAGACCAGCC TCAGAAGGAG GAGGACCTGG GACCTGCTGT CCACCATGTC TATGAGCTCA 2520 

TCAACCAAGG CCCCAGCTCC ATTAGCCAGG GTGTGCTGGA ACTCAGCTGT CCCCAGGCTC 2580 

TGGAAGGTCA GCAGCTCCTA TATGTGACCA GAGTTACGGG ACTCAACTGC ACCACCAATC 2640 

ACCCCATTAA CCCAAAGGGC CTGGAGTTGG ATCCCGAGGG TTCCCTGCAC CACCAGCAAA 2700 

AACGGGAAGC TCCAAGCCGC AGCTCTGCTT CCTCGGGACC TCAGATCCTG AAATGCCCGG 2760 

25 AGGCTGAGTG TTTCAGGCTG CGCTGTGAGC TCGGGCCCCT GCACCAACAA GAGAGCCAAA 2820 ' 

GTCTGCAGTT GCATTTCCGA GTCTGGGCCA AGACTTTCTT GCAGCGGGAG CACCAGCCAT 2880 

TTAGCCTGCA GTGTGAGGCT GTGTACAAAG CCCT6AAGAT GCCCTACCGA ATCCTGCCTC 2940 

GGCAGCTGCC CCAAAAAGAG CGTCAG6TGG CCACAGCTGT GCAATGGACC AAG6CAGAAG 3000 

GCAGCTATGG CGTCCCACTG TGGATCATCA TCCTAGCCAT CCTGTTTGGC CTCCTGCTCC 3060 

30 TAG6TCTACT CATCTACATC CTCTACAAGC TTGGATTCTT CAAACGCTCC CTCCCATATG 3120 

GCACCGCCAT GGAAAAAGCT CAGCTCAAGC CTCCAGCCAC CTCT6ATGCC TGAG TCCTCC 3180 

CAATTTCAGA CTCCCATTCC TGAAGAACCA GTCCCCCCAC CCTCATTCTA CTGAAAA6GA 3240 

GGGGTCTGGG TACTTCTTGA AGGTGCTGAC GGCCAGGGA6 AAGCTCCTCT CCCCAGCCCA 3300 

GAGACATACT TGAAGGGCCA GAGCCAGGGG GGTGAGGAGC TGGGGATCCC TCCCCCCCAT 3360 

35 GCACTGTGAA GGACCCTTGT TTACACATAC CCTCTTCATG GATGGGGGAA CTCAGATCCA 3420 

GGGACAGAGG CCCAGCCTCC CTGAAGCCTT TGCATTTTGG AGAGTTTCCT GAAACAACTG 34 80 

GAMGATAAC TAGGAAATCC ATTCACAGTT CTTTGGGCCA GACATGCCAC /WVGGACTTCC 354 0 

TGTCCAGCTC CAACCTGCAA AGATCTGTCC TCAGCCTTGC CAGAGATCCA AAAGAAGCCC 3600 

CCAGTAAGAA CCTGGAACTT GGGGAGTTAA GACCTGGCAG CTCTGGACAG CCCCACCCTG 3660 

40 GTGGGCCAAC AAAGAACACT AACTATGCAT GGTGCCCCAG GACCAGCTCA GGACAGATGC 3720 

CACAAGGATA GATGCTGGCC CAGGGCCAGA GCCCAGCTCC AAGGGGAATC AGAACTCAAA 3780 

TGGGGCCAGA TCCAGCCTGG GGTCTGGAGT TGATCTGGAA CCCAGACTCA GACATTGGCA 3 840 

CCAATCCAGG CAGATCCAGG ACTATATTTG GGCCTGCTCC AGACCTGATC CTGGAGGCCC 3900 

AGTTCACCCT GATTTAGGAG AAGCCAGGAA TTTCCCA6GA CCTGAAGGGG CCATGATGGC 3960 

45 AACAGATCTG GAACCTCAGC CTGGCCAGAC ACAGGCCCTC CCTGTTCCCC AGAGAAAGGG 4020 

GAGCCCACTG TCCTGGGCCT GCAGAATTTG GGTTCTGCCT GCCAGCTGCA CTGATGCTGC 4080 

CCCTCATCTC TCTGCCCAAC CCTTCCCTCA CCTTGGCACC AGACACCCAG GACTTATTTA 4140 

AACTCTGTTG CAAGTGCAAT AAATCTGACC CA6TGCCCCC ACTGACCAGA ACTAGAAAAA 4200 
AAAA 

50 

Seq ID NO; 6 Protein sequence; 
protein Accession NP_002196.1 

55 1 11 21 31 41 51 

I I I I I I 

MGSRTPESPli HAVQLRWGPR RRPPLVPLLL LLVPPPPRVG GPNLDAEAPA VLSGPPGSFF 60 

GFSVEFYRPG TDGVSVLVGA PKANTSQPGV LQGGAVYIiCP WGASPTQCTP lEFDSKGSRL 120 

liESSLSSSEG EEPVEYKSLQ WPGATVRAHG SSILACAPLY SWRTEKEPLS DPVGTCYIjST 180 

60 DNFTRIIiEYA PCRSDFSWAA GQGYCQGGFS AEFTiCTGRW LGGPGSYFWQ GQILSATQEQ 240 

lAESYYPEYIi INLVQGQLQT RQASSIYDDS YliGYSVAVGE FSGDDTEDFV AGVPKGNXjTY 300 

GYVTILNGSD IRSLYNFSGE QMASYFGYAV AATDVNGDGL DDLLVGAPLL MDRTPDGRPQ 360 

BVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DIiDQDGYNDV AIGAPFGGET 420 

QQGWFVFPG GPGGIiGSKPS QVLQPLWAAS HTPDFFGSAI, RGGRDLDGNG YPDIilVGSFG 480 

65 VDKAWYRGR PIVSASASLT IFPAMFNPEE RSCSLBGNPV ACINLSFCIiN ASGKHV7UDSI 540 

GFTVEIiQIiDW QKQKGGVRRA LFIiASRQATU TQTLIilQNGA REDCREMKIY IiRNESEFRDK 600 

liSPIHIAIiNP SIiDPQAPVDS HGIiRPAIiHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQIiEVP 660 

GEQNHVYLGD KNALNLTFHA QNVGEGGAYE AELRVTAPPE AEYSGLVRHP GNFSSLSCDY 720 

FAVNQSRLLV CDLGNPMKAG ASLWGGLRFT VPHIiRDTKKT ZQFDFQILSK NIiNNSQSDW 780 

70 SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW MPRDQPQKEB DLGPAVHHVY EliINQGPSSI 840 

SC2GVI«EIiSCP QALEGQQLLY VTRVTGLKCT TNHPXNPKGL BIiDPEGSIiHH QQKREAPSRS 900 

SASS6PQILK CPEAECFRLR CELGPLHQQH SQSLQIiHFRV HAKTFIiQREH QPFSLQCEAV 960 

YKAIjKMPYRZ XiPRQLPQKER QVATAVQWTK AEGSYGVPLW ZIIIiAIIiFGIi LIjLGLLIYIIi 1020 
YKLGFFKRSL PYGTAMEKAQ IiKPPATSDA 



181 
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Seq ID NO: 7 Nucleotide seoruence : 
Nucleic Acid Accession #: NM_002211 

Coding sequence: 104.. 2500 (underlined sequences correspond to start and stop codons) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



GTCCGCCAAA 
ACAAGCCTAA 
TTTCTGGATT 
ATGTTTAAAA 
GT6GTGCACA 
TTTAGAAGCC 
CAAAGATATA 
CAAGCCAGAG 
GGAGCCACAG 
CTACCTTATO 
AACAGATCTG 
ATTTGTGGAA 
TTGCACAAGT 
TAATAAAGGA 
TTCTCCAGAA 
CTGGAGGAAT 
AGATGGGAAA 
TATGTACACA 
GAGTGAT^AAT 
GGAGCTGAAA 
TGTAATTCAG 
CGGCAAATTG 
TGGAAC^GGG 
TGAAATTAGC 
GCCTCTGGGC 
CCAAAGCGAA 
TG6CGCGTGC 
AGTTAACAGT 
TAACAATG6A 
AATTTATTCT 
AATTTGTGGA 
TGGCAGTGCA 
CTGCAATGGC 
AGGGCAAACG 
TGTTCAGTGC 
CTATTTTAAC 
TCCTGTGTCC 
AGTGAATGGG 
TCCAGACATC 
ATTACTGCTG 
TGAAAAGGAG 
CGTAACAACT 
ACAACACTGA 
TTGCCATGGT 
TGTTTTATTA 
ACAGGATGGT 
GACTATTGAA 
AGTTAAAGTA 
CTTTACATAT 
AAGGATTGTT 
CTGAAAGACA 
AAAGGCCATG 
GTGCCATTTT 
AGCCTTAGAT 
ACATACAGGC 
TTTATTATTT 
TGTATTTGTT 
AGCGGTCAAT 
TCACCTACGC 
CACCATTTGA 
GTTTTAACAG 



11 
I 

ACCTGCGCGG 
CGTCCGCTGG 
GGACTGATCA 
GCAAAT6CCA 
AATTCAACAT 
TTAAAAAAGA 
AAGAAAAATA 
GATATTACTC 
ACATTTACAT 
GACCTGTCTT 
ATGAATGAAA 
AAGACTGTGA 
GAACAGAACT 
GAAGTATTTA 
GGTGGTTTCG 
GTTACACGGC 
CTTGGTGGCA 
ATGAGCCATT 
AATATTCAGA 
AACTTGATCC 
TTGATCATTG 
TCAGAAGGAG 
GAAAATGGAA 
ATAACTTCAA 
TTTACGGAGG 
GGCATCCCTG 
AGGTGCAATG 
GAAGACATGG 
GAGTGCGTCT 
6GCAAATTCT 
GGAAATGGTG 
TGTGACTGTT 
CGGGGCATCT 
TGTGAGATGT 
AGAGCCTTCA 
ATTACCAAGG 
CATTGTAAGG 
AACAACGAGG 
ATTCCAATTG 
ATATGGAAGC 
AAAATGAATG 
GTGGTCAATC 
ATGCAAAGTA 
TTTACTCATG 
TTTTGAAAAT 
TATTCTTGTC 
ATCAAGCTTA 
ATGAGCAT6A 
GTCAGTTTGC 
TTAAATCTGT 
AGTATGTTGA 
GGAAAAATTC 
AAGAGTTACT 
AAAAGAACCG 
CATACTTTAC 
TTATTTTGTT 
TGCAATTTTG 
TTGCCTTTTT 
GAGTCTTACT 
GTTGCCCATC 
TTCA 



21 
I 

ATAGGGAAGA 
GCCCCGGACG 
GTTCAGTTTG 
AATCATGTGG 
TTTTACAG6A 
AGGGTTGCCC 
AAAATGTAAC 
AGATCCAACC 
TAAAATTCAA 
ATTCAATGAA 
TGAG6AGGAT 
TGCCTTACAT 
GCACCACCCC 
ATGAACTTGT 
ATGCCATCAT 
TGCTGGTGTT 
TTGTTTTACC 
ATTATGATTA 
CAATTTTTGC 
CTAAGTCAGC 
ATGCATACAA 
TAACAATAAG 
GAAAATGTTC 
ATAAGTGTCC 
AAGTAGAGGT 
AAA6TCCCAA 
AAGGGCGTGT 
ATGCTTACTG 
6CGGACAGTG 
6CGAGT6TGA 
TTTGCAAGTG 
CTTTGGATAC 
GCGAGTGTGG 
GTCAGACCTG 
ATAAAGGAGA 
TAGAAAGTCG 
AGAAGGATGT 
TCATGGTTCA 
TAGCTGGTGT 
TTTTAATGAT 
CCAAATGGGA 
CGAAGTATGA 
GCAATTTCCA 
TGCAG6TTTT 
AAT6TTGTAA 
AGCTAAG6TC 
TTG6ATTAAG 
TGAGAGTTTC 
AGTTAT6CAG 
TATTTTGCTA 
GAGTTGCTGG 
AGAGAGTTAG 
TAATGTTTGG 
AGCAATTTTC 
AAAGTATTTG 
TAATGTCTGG 
GGGTAAGACT 
AATGAACATG 
TTGAGTTAGT 
TT6TTTCACA 



31 
I 

ACAGCACCCC 
CCGCGCGGAA 
CTGTGTGTTT 
AGAATGTATA 
AGGAATGCCT 
TCCAGATGAC 
CAACCGTA6C 
ACAGCAGTTG 
GAGAGCTGAA 
AGACGATTTG 
TACTTCGGAC 
TAGCACAACA 
ATTTAGCTAC 
TGGAAAACAG 
GCAAGTTGCA 
TTCCACAGAT 
AAATGATGGA 
TCCTTCTATT 
AGTTACTGAA 
AGTAGGAACA 
TTCCCTTTCC 
TTACAAATCT 
CAATATTTCC 
AAAAAAGGAT 
TATTCTTCAG 
GTGTCATGAA 
TGGTAGACAT 
CAGGAAAGAA 
TGTTTGTAGG 
TAATTTCAAC 
TCGTGTGTGT 
TAGTACTTGT 
TGTCTGTAAG 
CCTTGGTGTC 
AAAGAAAGAC 
GGACAAATTA 
TGACGACTGT 
TGTTGTGGAG 
GGTTGCTGGA 
AATTCATGAC 
CACGGGTGAA 
GGGAAAAT6A 
TA6TCACAGT 
GAAAAT6TAC 
TTCATGCCAG 
ACATTGTGCC 
TGATATTTCT 
TGTTAATCAT 
AATCCAAAGT 
TTTGCCTGTT 
TGTAAAATAC 
GAAGGAAAAA 
TAACTTTTAT 
TGCTAAAAAG 
CTGAATGGGG 
TGCTTTCTAT 
TTTTTATGAG 
TGAAGTTATA 
GCCATAACAG 
CTAGTCACAT 



41 
I 

GGCGCCGATT 
AA GATGA ATT 
GCTCAAACAG 
CAAGCA6GGC 
ACTTCTGCAC 
ATA6AAAATC 
AAAGGAACAG 
GTTTTGCGAT 
GACTATCCCA 
GAGAATGTAA 
TTCAGAATTG 
CCAGCTAAGC 
AAAAATGTGC 
CGCATATCTG 
GTTTGTGGAT 
GCCGGGTTTC 
CAATGTCACC 
GCTCACCTTG 
GAATTTCAGC 
TTATCTGCAA 
TCAGAAGTCA 
TACTGCAAGA 
ATTGGAGATG 
TCTGACAGCT 
TACATCTGTG 
GGAAATGGGA 
TGTGAATGCA 
AACAGTTCAG 
AAGAGGGATA 
TGTGATAGAT 
GAGTGCAACC 
GAAGCCAGCA 
TGTACAGATC 
TGTGCTGAGC 
ACATGCACAC 
CCCCAGCCGG 
TGGTTCTATT 
AATCCAGAGT 
ATTGTTCTTA 
AGAAGGGAGT 
AATCCTATTT 
GTACTGCCCG 
TAGGTAGCTT 
AATATGTATA 
GGACTGACAA 
TTTTTGACCT 
ATAGCGATTG 
6TATTAAAAC 
AAATGTCCT6 
AGACATGACT 
GTTTGAAATA 
CCAATAGCTT 
GCCTTCACTT 
TCCTTGATTT 
ACCTTTTGAG 
CACCTCTTCT 
TACTTTTTCT 
CTGTGGCTAT 
ACCACTGTAT 
TCTTGTTTTA 



51 
I 

GCCGTACCAA 
TACAACCTUVT 
ATGAAAATAG 
CAAATTGT6G 
GATGTGATGA 
CCAGAGGCTC 
CAGAGAAGCT 
TjUVGATCAGG 
TTGACCTCTA 
AMGTCTT66 
GATTTGGCTC 
TCAGGAACCC 
TCAGTCTTAC 
GAAATTTGGA 
CACTGATTGG 
ACTTTGCTGG 
TGGAAAATAA 
TCCAGAAACT 
CTGTTTACAA 
ATTCTAGCAA 
TTTTGGAAAA 
ACGGGGTGAA 
AGGTTCAATT 
TTAATUVTTAG 
AATGTGAATG 
CATTTGAGTG 
GCACAGATGA 
AAATCTGCAG 
ATAGAAATGA 
CCAATGGCTT 
CCAACTACAC 
ACGGACAGAT 
CGAAGTTTCA 
ATAAAGAATG 
AGGAATGTTC 
TCCAACCTGA 
TTACGTATTC 
GTCCCACTGG 
TTGGCCTTGC 
TTGCTAAATT 
ATAAGAGTGC 
TGCAAATCCC 
TAGGGCAATA 
ATTTTTAAAA 
AA6ACTTGAG 
TTTCTTCCTG 
AAAGGGCAAT 
TGATTTTTAG 
CTAGCTAGTT 
GAT6ACATAT 
GTTGATCTAC 
TAAAACCTGT 
TACAAATTCA 
AGCACTATTT 
TTGAATTTAT 
AATCTTTTAA 
TTGT^GTTTT 
GCAACAGCTC 
GTTTACTTCT 
AGTGCCTTTA 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 



Seq ID NO: 8 Protein sequence: 
Protein Accession #: NP 002202 



75 I 



11 



21 



31 
I 



41 



51 



MNIiQPIFWIG LISSVCCVFA QTDENRCLKA NAKSCGECIQ AGPNCGWCTN STFLQEGMPT 60 



182 
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10 



15 



SARCDDLEAL 
LRIiRSGBPQT 
RIGFGSFVEK 
ISGNLDSPEG 
CHLENNMYTM 
SANSSNVIQI* 
GDEVQFBISI 
NGTFECGACR 
RDMTNEIYSG 
ASNGQICNGR 
CTQECSYFNI 
PBCPTGPDIZ 
PIYKSAVTTV 



KKKGCPPDDI 
FTLKFKRAED 
TVMPYISTTP 
GFDAIMQVAV 
SHYYDYPSIA 
IIDAYMSLSS 
TSNKCPKKDS 
CKEGRVGRHC 
KFCBCDNFNC 
6ICECGVCKC 
TKVESRDKIiP 
PIVAGWAGI 
VKPKYBGK 



ENPRGSKDZK 
YPIDLYYLMD 
AKIiRNPCTSE 
CGSIiIGWRNV 
HLVQKLSENN 
EVILBNGKLS 
DSPKIRPLGF 
ECSTDEVNSE 
DRSNGLICGG 
TDPKFQGQTC 
QPVQPDPVSH 
VLXGIiAIiliLI 



KNKNVTHRSK 
IiSYSMKDDLB 
QNCTSPPSYK 
TRIJiVPSTDA 
XQTZFAVTEB 
EGVTISYKSY 
TEEVEVILQY 
DMDAYCRKEN 
NGVCKCRVCE 
BMCQTCLGVC 
CKEKDVDDCW 
WKLLMIIHDR 



GTAEKIjKPED 
NVKSLGTDLM 
NVLSLTNKGE 
GFHFAGDGKI. 
FQPVYKELKN 
CKN6VNGT6B 
ICECECQSE6 
SSEICSNNGE 
CNPNYTGSAC 
AEHKBCVQCR 
FYFTYSVNGN 
REFAKFEKEK 



ITQIQPQQIiV 
NEMRRITSDF 
VFNELVGKQR 
GGIVIiPNDGQ 
LIPKSAVGTL 
NGRKCSNXSI 
IPESPKCHEG 
CVCGQCVCRK 
DCSLDTSTCE 
APNKGEiCKDT 
NEVMVHWEN 
MNAKWDTGEN 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



Seq ID NO: 9 Nucleotide sequence; ' 
Nucleic Acid Accession #:NM_002425 

Coding sequence: 23 .'.1453 (underlined sequ^ces correspond to start and stop codons) 
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25 
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40 
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50 



AAAGAAGGTA 
AGTCTGCTCT 
TGCCCAGCAA 
AAAGGACAGT 
GGTGACAGGG 
TCCTGACGTT 
TACATACAGG 
TGAGAAAGCT 
AGGAGAGGCT 
TGATGGCX:CA 
TATTCACTTT 
CGTTGCTGCT 
TTTGATGTAC 
TGATGTGAAT 
GGTGCCCACA 
GTCCTTCGAT 
TTGGCGAAGA 
CTCTCTTCCA 
TTTTAAAGGA 
AGGCATCCAT 
CAAGGAAAAG 
TAGCCAGTCC 
GCCTAAGGTT 
ACAGTTTGAG 
GTTACATTGC 
ATTATTCATC 
GAAGAAGAT6 
ACTTGCTTTT 
ATGTATTTTC 
CTT 



AGGGCAGTGA 
GCCTATCCTC 
TACCTAGAAA 
AATCTCATTG 
AAGCTAGACA 
G6TCACTTCA 
ATTGTGAATT 
CT6AAAGTCT 
GATATAATGA 
GGACACAGTT 
GAT6ATGATG 
CATGAACTTG 
CCACTCTACA 
GGCATTCAGT 
AAATCT6TTC 
GCCATCAGCA 
TCCCACTGGA 
TCATATTTGG 
AATGAGTTCT 
ACCCTGGGTT 
AAGAAAACAT 
ATGGAGCAAG 
GAT6CTGTAT 
TTTGACCCCA 
TAGGCGAGAT 
TAATGTATTA 
A6CCTTGCAG 
GAATTGCACT 
ATAGATGTGT 



GAATGATGCA 
TGAGTGGGGC 
AGTACTACAA 
TTAAAAAAAT 
CTGACACTCT 
GCTCCTTTCC 
ATACACCAGA 
66GAAGAGGT 
TCTCTTTCGC 
TGGCTCATGC 
AAAAATGGAC 
GCCACTCCCT 
ACTCATTCAC 
CTCTCTACGG 
CTTCGGGATC 
CTCTGAGG6G 
ACCCTGAACC 
ATGCTGCATA 
GGGCCATCAG 
TTCCTCCAAC 
ACTTCTTTGC 
GCTTCCCTAG 
TACAGGCATT 
AT6CCAGGAT 
AG6GGGAAGA 
TGAGCCAAAA 
ATATCTGCAT 
GAACA6AATT 
TATTACTTCC 



TCTTGCATTC 
AGCAAAAGAG 
CCTCGAAAA6 
CCAAGGAAT6 
G6AGGTGATG 
TGGCATGCCG 
TTTGCCAAGA 
GACTCCACTC 
A6TTAAAGAA 
CTACCCACCT 
AGAAGATGCA 
GGGGCTCTTT 
AGAGCTCGCC 
ACCTCCCCCT 
TGAGATGCCA 
AGAATATCTG 
TGAATTTCAT 
TGAAGTTAAC 
AGGAAATGAG 
CATAAGG7UU\ 
AGCX3GACAAA 
ACTAATAGCT 
TGGATTTTTC 
GGTGACACAC 
CA6ATATGG6 
TGGTTAATTT 
GTGTCATGAA 
AAGAAATACT 
TCAATAAAAA 



CTT6TGCTGT 
GAGGACTCCA 
GATGTGAAAC 
CAGAAGTTCC 
CGCAAGCCCA 
AA6T6GAGGA 
GATGCTGTTG 
ACATTCTCCA 
CAT6GAGACT 
GGACCTGGGC 
TCAGGCACCA 
CACTCAGCCA 
CAGTTCCGCC 
GCCTCTACTG 
GCCAAGTGTG 
TTCTTTAAAG 
TTGATTTCTG 
AGCAGGGACA 
GTACAAGCAG 
ATTGATGCAG 
TACTGGAGAT 
GATGACTTTC 
TACTTCTTCA 
ATATTAMGA 
TGTTTTTAAT 
TTCCTGCATQ 
GAATGTTTCT 
CATGTGCAAT 
GTTTTATTTT 



TGT6TCTGCC 
ACAA6GATCT 
AGTTTAGAAG 
TTG6GTT6GA 
GGTGTG6AGT 
AAACCCACCT 
ATTCTGCCAT 
GGCTGTATGA 
TTTACTCTTT 
TTTATGGAGA 
ATTTATTCCT 
ACACTGAAGC 
TTTCGCAAGA 
AGGAACCCCT 
ATCCTGCTTT 
ACAGATATTT 
CATTTTGGCC 
CCGTTTTTAT 
GTTATCCAAG 
CTGTTTCTGA 
TTGATGAAAA 
CAGGAGTT6A 
GTGGATCATC 
GTAACAGCTG 
AAATCTAATA 
TTCTGTGACT 
GGAATTCTTC 
AGGTGAGAGA 
GG6CCTGTTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



55 



Seq ID NO: 10 Protein sequence: 
Protein Accession #: NP 002416 



60 



65 



1 
I 

MMHIiAFLVLL 
KKIQGMQKFL 
TPDLPRDAVD 
AHAYPPGPGL 
SFTEIiAQFRIj 
LRGEYLPFKD 
AIRGNEVQAG 
FPRLIADDFP 



11 
I 

CLPVCSAYPIi 
GLEVTGKLDT 
SAIEKALKVW 
YGDIHFDDDE 
SQDDVNGIQS 
RYFWRRSHWN 
YPRGIHTIiGF 
GVEPKVDAVIj 



21 
I 

SGAAKEEDSN 
DTIiEVMRKPR 
EEVTPIiTFSR 
KWTEDASGTN 
LYGPPPASTB 
PEPBFHX*ISA 
PPTIRKZDAA 
QAFGPPYPFS 



31 
I 

KDLAQQYLEK 
CGVPDVGHFS 
LYEGEADIMI 
LFIiVAAHELG 
EPLVPTKSVP 
EWPSLPSYLD 
VSDKEiCKKTY 
GSSQFBFDPN 



41 

I 

YYNIjEKDVKQ 
SFPGMPKWRK 
SFAVKEHGDF 
HSLGIiFHSAN 
S6SEMPAKCD 
AAYEVNSRDT 
FFAADKYWRF 
ARMVTHZLKS 



51 

I 

FRRKDSNLIV 
THLTYRIVNY 
YSFDGPGHSL 
TEAIl^aYPIJYN 
PAIiSFDAIST 
VFIPKGNBFW 
DENSQSMBQ6 
NSWLHC 



60 
120 
180 
240 
300 
360 
420 



70 



75 



Seq ID NO: 11 Nucleotide sequence; 
Nucleic Acid Accession #: XM_058189 

Coding sequence: 16 9.. 774 (underlined sequences correspond to start and stop codons) 
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GAAGACCAGC TCAGCTCTTC AGTTGTTGAT CATTGTCTAT TGTTCTCCAA ACAGTAAACC 60 



183 



BNSDOCia <WO 02079A92A2Jj> 



wo 02/079492 



PCT/IIS02/04915 



AGTATTTCAC ACTGAGATTG TCGGCTGCGG GTATATTCCA ATTCCCCGTC TCCTCATGAA 120 

TATGAAGTGA AGGGCTCTGA CCCTGGAAGT GGTTCTJtVAGC AGGGCAA AAT GG GGTCTCGG 180 

AAGTGTGGAG GCTGCCTAAG TTGTTTGCTG ATTCCGCTTG CACTTTGGAG TATAATCGTG 240 

AACATATTAT TGTATTTCCC GAATGGGCAA ACTTCCTATG CATCCAGCAA TAAACTCACC 300 

AACTACGTGT GGTATTTTGA AG6AATCTGT TTCTCAGGCA TCATGATGCT TATAGT7UVCA 360 

ACAGTTCTTC TGGTACTG6A GAATAATAAC AACTATAAAT GTTGCCAGAG TGAAAACTGC 4 20 

AGCAAAAAAT ATGTGACACT GCTGTCAATT ATCTTTTCTT CCCTCGGAAT TGCTTTTTCT 4 80 

GGATACT6CC TGGTCATCTC TGCCTTGGGT CTTGTCCAAG GGCCATATTG CCGCACCCTT 540 

GATGGCTGGG AGTATGCTTT TGAAGGCACT GCTGGACGTT TCCTTACA6A TTCTAGCATA 600 

TGGATTCAGT GCCTGGAACC TGCACATGTT GTGGAGTG6A ACATCATTTT ATTTTCCATT 660 

CTCATAACCC TCAGTGGGCT TCAAGTGATC ATCTGCCTCA TCAGAGTAGT CATGCAACTA 720 

TCCAAGATAC TGTGTGGAAG CTATTCAGTG ATCTTCCAGC CTGGAATCAT TTGAA TAAGG 780 

ACAAAATGTT TTCCATTATC AAGACATGGC CATCTATCTA AATATTATAT CAACTGTGTA 840 

GACTTGAGGG CAATATTGAA ATGATGGTGC TTTCTGCATT TGGTGTTTAT TTGTAAAAAA 900 

TTTGCAGTCC TCACT6CACA T6CAAGTATA CCACCCTTCC ATTTAGTATG TTTTTTAAGT 960 

AATATGCATC AGA7VACTTCA GAAATACTTC TGCCCTTTGA TCAAACAAAT CCATTTCCAA 1020 

GAATCTGTAC TAGGGAAGTA AATAAGAATA TGAGAGA/U^C CTTTATGCAA ATATGTATAT 1080 

TGCAACATTA TTTAATATTC TGGAAAATTG GAAACACCCC AAAATTCTAA ACTCAGAGGA 1140 

AGGATTAAGT AAAGAGTGGT ACATACTGTA AATGTTTTCT GATATTA7VAA AAAAAATTAA 1200 
ATAAAAAATA AAGAGTACTA CATGGTTGTA AAA 

Seq ID NO: 12 Protein sequence: 
Protein Accession XP 058189 



1 11 21 31 41 51 

I I I I i I 

MGSRKCGGCIi SCI*LIPIAI»W SIIVNIIiLYF PNGQTSYASS NKLTNYVWYF EGICPSGIMM 60 
IjIVTTVLLVIi ENNNNYKCCQ SENCSKKYVT LLSllFSSIiG lAFSGYCLVI SALGLVQGPY 120 
CRTLDGWBYA PEGTAGRFLT DSSIWIQCLE PAHWEWNIl IiFSILITLSG LQVIICIilRV 180 
VMQLSKIIiCG SYSVIFQPGI I 



Seq ID NO: 13 Nucleotide secmeneer 
Nucleic Acid Accession #: NM_005397 

Coding sequence: 251.. 1837 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

AAACGCCGCC CAGGACGCAG CCGCCGCCGC CGCCGCTCCT CTGCCACTGG CTCTGCGCCC 60 

CAGCCCGGCT CTGCTGCAGC GGCAGGGAGG AAGAGCCGCC GCAGCGCGAC TCGGGAGCCC 120 

CGGGCCACAG CCTGGCCTCC GGAGCCACCC ACAGGCCTCC CCGGGCGGCG CCCACGCTCC 180 

TACCGCCCGG ACGCGCGGAT CCTCCGCCGG CACCGCAGCC ACCTGCTCCC GGCCCAGAGG 24 0 

CGACGACACG ^TGCGCTGCG CGCTGGCGCT CTCGGCGCTG CTGCTACTGT TGTCAACGCC 300 

GCCGCTGCTG CCGTCGTCGC CGTCGCCGTC GCCGTCGCCG TCX5CCCTCCC AGAATGCAAC 360 

CCAGACTACT ACGGACTCAT CTAACAAAAC AGCACCGACT CCAGCATCCA GTGTCACCAT 420 

CATGGCTACA GATACAGCCC AGCAGAGCAC AGTCCCCACT TCCAAGGCCA ACGAAATCTT 4 80 

GGCCTCGGTC AAGGCGACCA CCCTTGGT6T ATCCAGTGAC TCACC66GGA CTACAACCCT 54 0 

GGCTCAGCAA GTCTCAGGCC CAGTCAACAC TACCGTGGCT AGAGGAGGCG GCTCAGGC7UV 600 

CCCTACTACC ACCATCGAGA GCCCCAAGAG CACAAAAAGT GCAGACACCA CTACAGTTGC 660 

AACCTCCACA GCXTACAGCTA AACCTAACAC CACAAGCAGC CAGAATGGAG CAGAAGATAC 720 

AACAAACTCT GGGGGGAAAA GCAGCCACAG TGTGACCACA GACCTCACAT CCACTAAGGC 780 

AGAACATCTG ACGACCCCTC ACCCTACAAG TCCACTTAGC CCCCGACAAC CCACTTTGAC 840 

GCATCCTGTG GCCACCCCAA CAAGCTCGG6 ACATGACCAT CTTATGAAAA TTTCAAGCAG 900 

TTCAAGCACT GTGGCTATCC CTGGCTACa^C CTTCACAAGC CCGGGGAT6A CCACCACCCT 960 

ACCGTCATCG GTTATCTCGC AAAGAACTCA ACAGACCTCC AGTCAGATGC CAGCCAGCTC 1020 

TACGGCCCCT TCCTCCCAGG AGACAGTGCA GCCCACGAGC CCGGCAACGG CATTGAGAAC 1080 

ACCTACCCTG CCAGAGACCA TGAGCTCCAG CCCCACAGCA GCATCAACTA CCCACCGATA 1140 

CCCCAAAACA CCTTCTCCCA CTGTGGCTCA TGAGAGTAAC TGGGCAAAGT GTGAGGATCT 1200 

TGAGACACAG ACACAGAGTG AGAAGCAGCT CGTCCTGAAC CTCACAGGAA ACACCCTCTG 1260 

TGCAGGGGGC GCTTCGGATG AGAAATTGAT CTCACTGATA TGCCGAGCAG TCAAAGCCAC 1320 

CTTC7WVCCCG GCCCAAGATA AGTGCGGCAT ACGGCTGGCA TCTGTTCCAG GAAGTCAGAC 1380 

CGTGGTCGTC AAAGAAATCA CTATTCACAC TAAGCTCCCT GCCAAGGATG TGTACGAGCG 1440 

GCTGAAGGAC AAATGGGATG AACTAAAGGA GGCAGGGGTC AGTGACATGA AGCTAGGGGA 1500 

CCAG6GGCCA CCGGAGGAGG CCGAGGACCG CTTCAGCATG CCCCTCATCA TCACCATCGT 1560 

CTGCATGGCG TCATTCCTGC TCCTCGTGGC GGCCCTCTAT GGCTGCTGCC ACCAGCGCCT 1620 

CTCCCAGAGG AAGGACCAGC AGCX36CTAAC A6AGGAGCTG CAGACAGTGG AGAATGGTTA 1680 

CCATGACAAC CCAACACTGG AAGT6AT6GA GACXTTCTTCT GAGATGCAGG AGAAGAAGGT 1740 

GGTCAGCCTC AACGGGGAGC TGGGGGACAG CTGGATCGTC CCTCTGGACA ACCTGACCAA 1800 

GGACXJACCTG GATGAGGAGQ AAGACACACA CCT CTAGT CC GGTCTGCCGG TGGCCTCCAG 1860 

CAGCACCACA GAGCTCCAGA CCAACCACCC CAAGTGCCGT TT66ATGGGG AAGGGAAA6A 1920 

CTGGGGAGGG AGAGTGAACT CC6A6GGGTG TCCCCTCCCA ATCCCCCCA6 GGCCTTAATT 1980 

TTTCCCTTTT CAACCTGAAC AAATCACATT CTGTCCAGAT TCCTCTT6TA AAATAACCCA 2040 

CT AGTG CCTG AGCTCAGTGC TGCTGGATGA TQAGGGAGAT CAAGAATU^G CCACGTAAGG 2100 

GACTTTATAG ATGAACTAGT GGAATCCCTT CATTCTGCAG TGAGATTGCC GAGACCT6AA 2160 

6AGGGTAAGT GACTTGCCCA AGGTCAGAGC CACTTGGTGA CAGAGCCAGG ATGAGAACAA 2220 
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AGATTCCATT 
CCC666CAGG 
CTG6CTCGGT 
CAGCCTCGTG 
TGTTCCCACT 
ACCCCTTGTT 
CACAGGTCTC 
AGGGGACACT 
CTGGCCCATT 
TTTCCACCAG 
CACTGCTCAG 
GAGAGGCCTT 
GCCAGGAAGG 
TGAGAGTTTT 
AGACAGTTGG 
GAAAAGAGGG 
GACTGTTGTG 
CTGTGAGCCT 
CACAAAAGCT 
TGTAGAA6GC 
CTGTGTGGAA 
ATCAAGTAGG 
AC6CCACACC 
GCTATCCACT 
GTCCTGCACC 
GAAAAGGAAG 
GCGAGGCTAA 
6ATGGAGAAC 
TTTGCTTCCT 
CCGTGGGAGA 
GCCAGGGCTT 
CTATGGCCTC 
AGGTCGGTCC 
CTGGGCTGAT 
GACAGCTTCA 
GCCAAACAGC 
TGCTCTCTGA 
TCAGCATGAC 
ACTGTGGTTT 
CTACTGTCAT 
TTTGCGTTGT 
TTGATAAGGA 
ACAATATACT 
TTGATAATGT 
TTTGAAAAAG 
AATGTCTTAC 
TATAACAAGA 
TTTTTAATTT 
AAAG6TTGGA 
TTCAAGATCT 
AGTGACTTCC 
GACTGTGCGC 
ATGACACAAA 
GAACACAAAC 
TAAGTGAACA 
GAACTTACTC 
AAAGAGTAAA 
GCAAACACAG 
' ATGTTCAACA 
AAGTACACTG 
TTTTCTTACA 



TGCACCATGC 
GGTGAAACTC 
GGGATCT6AC 
TACCCATCTT 
TACTCTGTCC 
CTGGATTCGG 
CGTGCGTGAA 
CGAGTCCAGG 
GCCCCTGGCPi 
GGACTAAAAT 
CTGATGGGAG 
CCCGGGGCCC 
ACAGGCTTTC 
GTTATAGTTG 
CAGACGTCCA 
CCTGG7UUVGC 
GGAACGTCCC 
AGTGGGGCTG 
ATGTCTATTC 
TCCTAA6CA6 
TGTTGAACTT 
AAAATGG6CA 
TCCAGGGTCT 
CGG6ATGGCT 
ATCTGCCTAA 
GCCAGGGGTT 
CAAGTTCCAA 
AGTGCCTGTT 
TCTTGCCCAG 
GGCCAGACTG 
TTGACGGAGA 
TTGGAGACAG 
TGACTCTCCT 
TCAGAGGTTA 
GAATCTGACA 
CATGCATCTA 
TAATTCAGAG 
TTAAACAGGT 
CCCCAAGCTG 
GA6AGTGGGG 
AA6ACAOAAT 
AAGCTAGCAG 
G6AGAAACTT 
AAATACAGTG 
CATTATGTTA 
TGGAAATGAC 
CAAACTTATG 
TAAAATGCAA 
GTCAATATGC 
AGTCCAATCT 
TCAAAATCAC 
CTCAGAAGGA 
GGACAGAATT 
ATAAGAACTG 
ATTCTTTCTT 
CAACAGGACT 
ACTGTAGCAT 
CATGGAGGAC 
GTTTGCCCAG 
GAATTTACTG 
AAAATATATT 



CACACTGCTG 
CAGCAGGTGG 
GACCTGAAAG 
CCCACCCTCT 
TGGGGCCTCT 
ATTTTCTCAC 
CCAGGTCTGC 
CTGGTATCTC 
CTCCTTGCAG 
CTGCTGTAGC 
TGATTCCCTG 
CCCATCAGCG 
CCCTGTATAT 
TTTTCTAACC 
GTTAGCTCAT 
GCAGTGCATG 
ACTGTCCTGG 
CTGCGGTTCT 
CCCAATATAG 
TGG6CAT6GT 
TCGCCATGGT 
GAACTGCTTA 
TAAGAGTCAG 
TACTTTTTAA 
TTCCTTCCTC 
CACATA6GGC 
AAACATCTGC 
TGGGGGAAAA 
CGCTGGGTTC 
GATTATTCCT 
CAGCAAATAG 
ATGATGACTG 
GACTCTCCAT 
CTGCTTTATA 
CCTTGCCTTG 
TAAAGGTCAT 
AGCATTGGGT 
TGTAGCAAAG 
CACAGGAGGC 
AGACA6GCA6 
ACGGGTTTAA 
AAAGTTTATT 
TGAAGAACAA 
ACCATGTTAA 
GCT6AGTGAT 
AAGTTTTTGC 
ATAAAGTATT 
CCCTGCCCCC 
TCTGGTTGGC 
TTTTCTAGAG 
ATGGTTCAGG 
ATAATCGGTA 
CCTTTCCCAG 
GTCTTCTCAC 
TCTGCCAAGA 
GAGGGACCAA 
AGCTTTTGTC 
ACAGATGACT 
GAACTGGGGG 
AGAAACTTGT 
TTGGAAAATT 



TGTTCACATG 
CTGGGCTGGA 
TCCAGCTCCC 
ATGTTCTTAC 
GGGATTAGCA 
ATTTGCTTCG 
TTAGGGGACC 
AGGGCAGCTG 
AGGCCACCCA 
TAGTGAGAGC 
AGACCCAGTA 
ATGTGTCTTC 
CATAGGAAAC 
CAGCCCTCCA 
CTCACTCACT 
CTGGGTGCAT 
CCTG6AGCTA 
CTTGCAGTTT 
GACTTTTATG 
GAGGTTTCAT 
CTCCATCGTT 
GGGGAATGAA 
GCTCCG6CTG 
AAGGGTAGGG 
ACAGTCTGTA 
CCCAGCGAGT 
CCCGATGCTC 
CAGGAAATCT 
TCTCCACCCA 
CCTTTGCTGA 
GCCTCTGCAA 
GCAAGGACTA 
CGCTCTGTCC 
TTCGTCCAAA 
CTCTTGCCAC 
CATCTTCTGC 
CTGGGAAGAG 
ACAGTTTATC 
CAGAAACCAC 
CAAAGCTTAT 
TCTAGTCTAQ 
TAAACCACTT 
GTTCAAACTG 
CCTACCCTGC 
GGCCAAGTTT 
TTGATTTTTT 
TGTCTTGTAG 
TCCCCAGCAA 
AGGCAACCCT 
AAAAAGATAA 
ACAGAAACAA 
AATTAAGAAT 
TTGTTACCCT 
ACTTTCTCTG 
AACAAAGTTT 
GGAAACATGA 
ACGGTCACTA 
CTTTGGTGTT 
ATCATATATG 
TTGTAAAAAC 
GTATACTGTC 



TGCCTTCCGT 
AAGGAGGGCA 
AAGTTTTCCT 
CCCTCCCTAC 
CAGGTTATTC 
TGAGATGGGG 
TGCGTGCAGG 
ATGAGGGGTC 
CGATCTTCTT 
AGCGTGTTCC 
TGAAAGAGCA 
AGAGACAATC 
TCAGGGACAT 
CTGCCAAAGG 
CTGATTCTCC 
GAAGGGCAGC 
GGCCTTGCTG 
CTGGTGGCAT 
GGCTCGGCAG 
CT6ATTGA6A 
CT6G6C6TAA 
ATTGCCATTT 
TAGTAGCTCT 
GGAGGG6CTG 
GCCATCTGAT 
TTCCCAGGAG 
TAGTGTTTGG 
TGTTAGGCTT 
GTAGGTTTTC 
TCCTGGGTCA 
ATCAATCAAA 
GAGAGCAGGA 
AAGGAGAACC 
CTGTGTTAGT 
CAGGACACCT 
CACCTTTACT 
GTAAGAGGAA 
ATCAACTCTT 
AAGTATGATG 
6AAGGA6GTA 
GCRCCAGATT 
CTTGAGCJTTT 
ATACATATAC 
ACTGCTTTAA 
TTTCTCTGGA 
TTTTTAAACA 
ATCAGGTGTT 
AGTCACAGCT 
GTAGTCATGG 
TCTGAAGCTC 
GATTAAAACC 
TGCTACTCGA 
AGCAAGGCTA 
AATCATTTAG 
TGGATGAGCT 
TGGGGGAGGC 
GCTGATCCCT 
GGTCTTTTTG 
TCTTAGTGGA 
TATAGTTAAT 
AATTAAAGT 



CCAGAGCAGT 
GGGCTACATC 
TCTCCTACCC 
ACTCAGTGTT 
ATAACCTTGA 
GCTTAACCCA 
TGAGGAGAGA 
AGCAGGAACA 
TGGGCTTCCA 
TTTTGTTGTT 
GTGGCTGCAG 
CATTAAAGCA 
TTCAAGTTGC 
CCAAAAGCTC 
TGTGCCACAG 
CTGGGGGACA 
TTCCTCTTCT 
CTCAGGGGAA 
TTAGCT6CCA 
AGGGGGAATC 
ATTCCCTGGG 
TTCGGGTGAA 
GAT6AAATAG 
GGGAAGATCT 
ATCCTAGGGG 
TTAGAGGGAT 
AGGTGGGCAG 
GAGTGAGGTG 
TGTTGTGGTC 
CACTTCACCA 
GGCTGCAACC 
GTGCCTGGCC 
CGGAGAGGCT 
CTAGGCTTAG 
ATGTCAACAG 
GGGTTCTAAA 
CACTAGAAGC 
TCAGTGGTAA 
ACTAG6AA6C 
CAGAATATTC 
TTTTTCCCGC 
ATCTTTTTTQ 
ACATATTTTT 
6T6AACATAC 
CA6GAATGTA 
AAAAATGAAA 
TTGTTTTGTT 
CCATTTCAGT 
AGAAAGGTAT 
ACAAAGATGA 
TGGATCCACA 
AGGTGCCAGA 
GGGAGGGCAT 
GTTTAAGATG 
TTTATATATG 
AA6AGAGGGC 
CAGGTCTGCT 
TCTGCAGTGA 
CAGGGGTCTG 
AATTATTGCA 
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Seq ID NO: 14 Protein sequence; 
Protein Accession #: NP 0053B8 



MRCALALSAIi 

dtaqqstvpt 

TIESPKSTKS 
TTPHPTSPLS 
VISQRTQQTS 
PSPTVMESN 
AQDKCGIRIiA 
PEEAEDRFSM 
PTLEVMETSS 



11 
I 

laLLLSTPPLIt 
SKANEILASV 
ADTTTVATST 
PRQPTLTHPV 
SQMPASSTAP 
WAKCEDLETQ 
SVPGSQTVW 
PIiIITIVCMA 
EMQEKKWSL 



21 

1 

PSSPSPSPSP 
KATTLGViSSD 
ATAKPNTTSS 
ATPTSSGHDH 
SSQETVQPTS 
TQSEKQIiVLN 
KEITIHTKIjP 
SFLIiLVAAXiY 
NGELGDSHIV 



31 
I 

SPSQNATQTT 
SPGTTTIiAQQ 
QNGAEDTTNS 
LMKISSSSST 
PATALRTPTIj 
LTGNTLCAGG 
AKDVYERLKD 
GCCHQRIiSQR 
PLDNLTKDDIi 



41 
I 

TDSSNKTAPT 
VSGPVNTTVA 
GGKSSHSVTT 
VAIPGYTFTS 
PETMSSSPTA 
ASDEKLISLI 
KWDELKEAGV 
KDQQRLTEELi 
DEEEDTHIj 



51 
I 

PASSVTIMAT 
RGGGSGNPTT 
DLTSTKAEHL 
PGMTTTLPSS 
ASTTHRYPKT 
CRAVKATFNP 
SDMKI.GDQGP 
QTVENGYHDN 



60 
120 
180 
240 
300 
360 
420 
480 



185 



BNSOOCIO: <WO ^02079492A2_L> 



wo 02/079492 



PCT/US02/04915 



Seq ID NO: 15 Nucleotide gecruence; 
Nucleic Acid Accession #: NM_004105 

Coding sequence: 150.. 1631 (underlined sequences correspond to start and stop codons) 
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1 
I 

CTAGTATTCT 
AAAATAAAAA 
CAAGGTAACT 
CTCTGGCGCT 
ACGGATAT6A 
TCCCAGACGC 
TTCCGAAAAC 
CAGAAGGAAC 
GAGTGTTGCC 
AGACTGGCCG 
CCAACCCTTC 
GCCAAGACAT 
TCAATTTACG 
AGTGCGTAGA 
CACCAGGCTC 
CCTGCGTAGA 
TTCTTGGTTC 
ACTGTGAAGA 
ATGAACCTGG 
CATGTCAAGA 
GGAATTATCA 
TAACACCAGA 
A6TCAATAGT 
TCCAGATACA 
GAAAT6AAAA 
TCGTGAAGTC 
GCAGTATAGG 
TTTCATTTTA 
TTACCTTAAA 
ACACTCACCC 
TTCAAAGTTT 
TTCAAGAGAG 
ACCAAGCAAT 
CTCATAAGGA 
AACTGCTTTG 
•TCTGCCATAT 
ATAGTAATAA 
AGAAATTTTA 
AAAATGAACC 
AGAAAATAAA 
ATAATAACAT 
TTCACTGGAG 
AACTAAAATT 
TATTCCTGGC 
GGAATATTTT 
TTTAAATAAA 



11 
I 

ACTAGAACTG 
6GTTGAGCAG 
CTGCTAGCTA 
GGTCAAGTCA 
GTGG6ATCCT 
TTGTAAAGGT 
AGCCCAGATT 
CTCAGGGGCA 
CGGGGGTGGT 
AAATAACTTT 
CCACCGTATC 
AGACGAGTGC 
GGGATCCTTT 
CATAGATGAA 
ATTTTATTGC 
TATAAATGAA 
ATTCATCTGT 
CATTGATGAA 
GAAATTCTCA 
TATAAATGAG 
TGGCGGCTTC 
GAACCX3ATGT 
CTACAAATAC 
GGCCACAACT 
TGGAGAGTTC 
ATTATCA6GA 
GACCTTCCGC 
GTCTTTTCTA 
GCACTATTTT 
ATAACAAACA 
GTCTTTATTA 
CTAAGTATAC 
GATGATCTTC 
GGCAGCCATC 
TAAGAAAATG 
TTGTGTTGGT 
ATCATTGCTG 
GATTGTGAAT 
AGATACCCCC 
TTCCTTCTAA 
CCCTGAATTC 
AAGTTTTAAT 
TCTCATCTTC 
ATCCAGAGTG 
ATACGTCTCC 
AATAAATATT 



21 
I 

GAAGATTGCT 
CAATTCATAT 
AGATTCACAA 
CAGGACACCG 
GTGAGACAGC 
GGAATGAAGT 
ATTGTCAATA 
ACCACCGGGG 
TTTGTGGCCA 
GTCATCCGGC 
CAGTGTGCAG 
ACTGCAGGGA 
GCATGTCAGT 
TGTACCATCC 
CAGTGCAGTC 
TGTGATGCCA 
CAGTGCAATC 
TGCAGAACCT 
TGTATGTGCC 
TGTGAGACCA 
CGTTGTTATC 
GTTTGCCCAG 
ATGAGCATCC 
ATTTATGCCA 
TACCTACX3AC 
CCAAGAGAAC 
ACAAGCTCTG 
AGAGTCAACC 
ATTTATAGAT 
ATTACACCAT 
CTATATGTAA 
ACTATCTGGT 
TGTGGTGCTT 
ATAACCATTG 
GAAAAGGTCA 
TTTTATTTTC 
TACAACATGC 
ATTTTGTAAA 
TAGAAAATTA 
ACCACATTGG 
AGGCATTCAC 
TTCTAAGTAA 
GTACTTGATG 
ACAGT6AACT 
TTGTTTAAAA 
CCTTTAGAAG 



31 
I 

CTCCGAGTTT 
TACTGTCACA 
TGTTGAAAGC 
AA6AAACCAT 
AATGCAAAGA 
GTGTCAACCA 
ATGAACAGCC 
TTGTAGCTGC 
GTGCTGCTGC 
GGAACCCAGC 
CAGGCTACGA 
CGCAC7VACTG 
GCCCTCCTGG 
CTCCATATTG 
CTGGGTTTCA 
GCAATCAATG 
AAGGATATGA 
CAAGCTACCT 
CCCAGGGATA 
CAAATGAATG 
CACGAAATCC 
TCTCAAATGC 
GATCTGATA6 
ACACCATCAA 
AAACAAGTCC 
ATATCGTGGA 
TGTTAAGATT 
ACAGGCATTT 
ATATCTAGTG 
GGTATAAAGT 
ATTAGACATT 
GAAACTTGGA 
AAGGAAACTT 
AATAGCATGC 
ATAAAGATAT 
ATATCCAGCC 
TGGTTTCTGT 
AAACAGTAAG 
TACTATTGAG 
AACTGACCTG 
AA6ATGCAGA 
AATTTAAATC 
CTCACAGAGG 
TAAGCAAATT 
TCTGACTGCT 
ATCACTCTAA 



41 



GGTATTTTTG 
CCTTTTCCTA 
CACGTACACG 
TATTGATGAA 
CTATGGAGGA 
TCAGCAGGAA 
CAGCAGCATG 
AGTCGCAGGC 
TGACCCTCAG 
GCAAAGTGAA 
TAGAGCAGAC 
ATATCAGAAG 
CCACCAAAGA 
ATTGGCAGCA 
TGCTCAGCAG 
GCTAAGCAGT 
GTGTCAATAT 
CCAAGTGGTG 
CCGGGAGGAT 
TTGTCAAGAT 
CATGTGCCGA 
GTCTGTGCCA 
TACTTTTCGG 
TGTAA6TGCA 
CCTGGAGATG 
GACAATAATA 
AAGTCAGCCA 
CATCTACATC 
GGGCATTTAA 
AATCCACTAA 
TTCTTTCCTA 
ACTAGAGCTC 
AAGGGTAAGA 
ATTTCTTTAG 
TAAAGGTGGT 
AGGGTATTTT 
CAAAATTTTC 
7UUITCTATG6 
AAGAAGCAAA 
ACAT^TGGA 
CTAACACTTC 
AAG7JWUVTGA 
ACX:CTCCTAC 
TTACTTTGAT 
AA 



51 
I 

ATTTTGTTAA 
CTGTGCTGTG 
ACTATGCTGA 
CT^TGCACTG 
TGTGACATTG 
TACCTCTGCC 
ACACAACCAG 
GCAACCAGTG 
CCTGAAATGC 
CGCATTCCCT 
CACAACGTGT 
CAAGTGTGCA 
CGAGGGGAGC 
TGCGTGAATA 
AACAACTATA 
TGCTACAACA 
GACAGGCTCA 
CAATGTGTCA 
AGAAGTAGAA 
GAAATGTGTT 
CCCTACATTC 
6AACTGCCCC 
TCA6ACATCT 
ATTAAATCTG 
ATGCTTGTGC 
CTGACAGTCA 
GTG6GGCCAT 
AAGAATATTG 
TCTATACTGT 
TATGTAAAGA 
ACTGGTCTTC 
TAAAAGTGGG 
CACTAACAGT 
ATGAGTTTTT 
AAAATGGGGA 
TGTTTATTAT 
TAATTTTGTC 
CAGAATTCCC 
GGAGGATATG 
CTCGGAAAAT 
TAAAAGGTAT 
ACTAATTTAT 
TGATGGTTTT 
CCAATTCTAT 
GTATCATATT 
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Seq ID NO: 16 Protein sequence; 
Protein Accession Ut NP 004096 
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MLKALFLTML 
CVNHYGGYLC 
SAAAVA6PEM 
THNCRADQVC 
PGFQIiAANNY 
SSYIiCQYQCV 
PRNPCQDPYI 
NTINTFRIKS 
VIiRLTIIVCP 



11 

I 

TLALVKSQDT 
LPKTAQIIVN 
QTGRNNFVIR 
INI.RGSFACQ 
TCVDINECDA 
NEPGKFSCMC 
LTPENRCVCP 
GNENGEFYLR 
FSF 



21 
I 

EETITYTQCT 
NEQPQQETQP 
RNPADPQRXP 
CPPGYQKRGE 
SNQCAQQCYN 
PQGYQWRSR 
VSNAMCRBIiP 
QTSPVSAMIiV 



31 

I 

DGYEWDPVRQ 
AEGTSGATTG 
SNPSHRIQCA 
QCVDIDECTI 
ILGSFICQCN 
TCQDINECET 
QSIVYKYMSI 
LVKSIiSGPRE 



41 

i 

QCKDIDECDI 
WAASSMATS 
AGYEQSEHNV 
PPYCHQRCVN 
Q6YEI»SSDRL 
TNECREDEMC 
RSDRSVPSDI 
HIVDLEMLTV 



51 

t 

VPDACKGGMK 
GVLPGGGPVA 
CQDIDECTAG 
TPGSFYCQCS 
NCEDIDECRT 
WNYHGGPRCY 
FQIQATTIYA 
SSI6TFRTSS 
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Seq ID NO: 17 Nucleotide sequence : 
Nucleic Acid Accession #: NM_018894 

Coding sequence: 27.. 1967 (underlined sequences correspond to start and stop codons) 
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AAAACATTCA 
AGATAAAACC 
AAAGACACAT 
TGAGTGG6AT 
CGCTTGTAAA 
AACA6CCCA6 
AACCTCA6GG 
GCCCGGG6GT 
CCGAAATAAC 
TTCCCACCX3T 
CATAGACGAG 
ACGGGGATCC 
AGACATAGAT 
CTCATTTTAT 
AGATATAAAT 
TTCATTCATC 
AGACATTGAT 
TGGGAAATTC 
AGATATAAAT 
TCATGGCX5GC 
AGAGAACCGA 
AGTCTACAAA 
ACAGGCCACA 
AAATGGAGAG 
GTCATTATCA 
AGG6ACCTTC 
TTAGTCTTTT 
AAAGCACTAT 
CCCATAACAA 
TTTGTCTTTA 
GAGCTAAGTA 
AATGATGATC 
G6AG6CAGCC 
TTGTAAGAAA 
TATTTGTGTT 
TAAATCATTG 
TTAGATTGTG 
ACCAGATACC 
AAATTCCTTC 
CATCCCTGAA 
6AGAAGTTTT 
ATTTCTCATC 
GGCATCCAGA 
TTTATACGTC 
AAAAATAAAT 



11 
I 

ACAAATTAAT 
AATAGAGTGC 
GGCCTTCTTT 
CCTGTGAGAC 
G6TGGAAT6A 
ATTATTGTCA 
GCAACCACCG 
GGTTTTGTGG 
TTTGTCATCC 
ATCCAGTGTG 
TGCACTGCAG 
TTTGCATGTC 
GAATGTACCA 
TGCCAGTGCA 
GAATGTGATG 
TGTCAGTGCA 
GAATGCAGAA 
TCATGTATGT 
GAGTGTGAGA 
TTCCGTTGTT 
TGTGTTTGCC 
TACATGAGCA 
ACTATTTATG 
TTCTACCTAC 
GGACCAAGA6 
C6CACAAGCT 
CTAA6AGTCA 
TTTATTTATA 
ACAATTACAC 
TTACTATATG 
TACACTATCT 
TTCTGTGGTG 
ATCAT7VACCA 
ATGGAAAAGG 
GGTTTTTATT 
CTGTACAACA 
AATATTTTGT 
CCCTAGAAAA 
TAAACCACAT 
TTCAGGCATT 
AATTTCTAAG 
TTCGTACTTG 
GT6ACAGTGA 
TCCTTGTTTA 
ATTCCTTTAG 



21 



31 



GG6TGTAAGG 
AGAATAAGAC 
GTGTACATGA 
AGCAATGCAA 
AGTGT6TCAA 
ATAATGAACA 
GGGTTGTAGC 
CCAGTGCTGC 
GGCGGAACCC 
CAGCAGGCTA 
GGACGCAC7VA 
AGTGCCCTCC 
TCCCTCCATA 
GTCCTGGGTT 
CCAGCAATCA 
ATCAAGGATA 
CCTCAAGCTA 
GCCCCCAGGG 
CCACAAATGA 
ATCCACGAAA 
CAGTCTCAAA 
TCCGATCTGA 
CCAACACCAT 
6ACAAACAAG 
AACATATCGT 
CTGTGTTAAG 
ACCACAGGCA 
GATATATCTA 
CATGGTATAA 
TAAATTAGAC 
GGTGAAACTT 
CTTAAGGAAA 
TTGAATAGCA 
TCAATAAAGA 
TTCATATCCA 
TGCTGGTTTC 
AAAAAACAGT 
TTATACTATT 
TGGAACTGAC 
CACAAGATGC 
TAAAATTTTU^ 
ATGCTCACAG 
ACTTAAGCAA 
AAATCTGACT 
AAGATCACTC 



AACTGGAAT^ 
TCAAGTCAAG 
CATGCATTCT 
AGATATTGAT 
CCACTATGGA 
GCCTCAGCAO 
TGCCAGCAGC 
TGCAGTCGCA 
AGCTGACCCT 
CGAGCAAAGT 
CTGTAGAGCA 
TGGATATCAG 
TTGCCACCAA 
TCAATTGGCA 
ATGTGCTCAG 
TGAGCTAAGC 
CCTGTGTCAA 
ATACCAAGTG 
ATGCCGGGAG 
TCCTTGTCAA 
TGCCATGTGC 
TAGGTCTGTG 
CAATACTTTT 
TCCTGTAAGT 
G6ACCTGGAG 
ATTGACAATA 
TTTAAGTCAG 
GTGCATCTAC 
AGTGGGCATT 
ATTAATCCAC 
GGATTCTTTC 
CTTACTAGAG 
TGCAAGGGTA 
TATATTTCTT 
GCCTAAAGGT 
TGTAGGGTAT 
AAGCAAAATT 

gagaaatcta 
ctgaagaagc 
agaacaaaat 
atcctaacac 
aggaag^Caaa 

ATTACC CTCC 
GCTTTACrrTT 
TAAAA 



41 
I 

CCTGGACTCC 
TAAGTAAGGT 
CAACAATGCA 
GAATGTGACA 
GGATACCTCT 
GAAACACAAC 
ATGGCAACCA 
GGCCCTGAAA 
CAGCGCATTC 
GAACACAACG 
GACCAAGTGT 
AAGCGAGGGG 
AGATGCGTGA 
GCAAACAACT 
CAGTGCTACA 
AGTGACAGGC 
TATCAATGTG 
GTGAGAAGTA 
GATGAAAT6T 
GATCCCTACA 
CGAGAACTGC 
CCATCAGACA 
CG6ATTAAAT 
GCAATGCTTG 
ATGCTGACAG 
ATAGTG6GGC 
CCAAA6AATA 
ATCTCTATAC 
TAATATGTAA 
TAAACTGGTC 
CTATAAAAGT 
CTCCACTAAC 
AGAATGAGTT 
TAGAAAATGG 
GGTTGTTTAT 
TTTTTiATTTT 
TTCCAGAATT 
TGGGGAGGAT 
AAACTCGGAA 
6GATAAAAGG 
TTCACTAATT 
T6ATGAT66T 
TACCCAATTC 
6ATGTATCAT 



51 
I 

TACCACATGC 
TAAACACCAT 
CTGACGGATA 
TTGTCCCAGA 
GCCTTCCGAA 
CAGCAGAAGG 
GTGGAGTGTT 
TGCAGACTGG 
CCTCCAACCC 
TGTGCCAAGA 
GCATCAATTT 
AGCAGTGCGT 
ATACACCAGG 
ATACCTGCGT 
ACATTCTTGG 
TCAACTGTGA 
TCAATGAACC 
GAACATGTCA 
GTTGGAATTA 
TTCTAACAGC 
CCCAGTCAAT 
TCTTCCAGAT 
CTGGAAATGA 
TGCTCGTGAA 
TCAGCAGTAT 
CATTTTCATT 
TTGTTACCTT 
TGTACACTCA 
AGATTCAAAG 
TTCTTCAAGA 
GGGACCAAGC 
AGTCTCATAA 
TTTAACTGCT 
GGATCTGCCA 
TATATAGTAA 
GTCAGAAATT 
CCCAAAAT6A 
ATGAGAAAAT 
AATATAATAA 
TATTTCACTG 
TATAACTAAA 
TTTTATTCCT 
TATGGAATAT 
ATTTTTAAAT 



Seq ID NO: 18 Protein s equence; 
Protein Accession #: NP_061489.1 



MHSQQCTDGY 
PQQETQPAEG 
ADPQRIPSNP 
GYQKRGEQCV 
CAQQCYNIliG 
YQWRSRTCQ 
AMCREliPQSI 
PVSAMIiVIiVK 



11 
I 

BWDPVRQQCK 
TSGATTGWA 
SHRIQCAAGY 
DIDECTIPPY 
SFICQCNQGY 
DINECETTNE 
VYKYMSIRSD 
SLSGPREHIV 



21 

I 

DIDECDIVPD 
ASSMATSGVL 
EQSEHNVCQD 
CHQRCVNTPG 
BLSSDRLNCB 
CREDEMCWNY 
RSVPSDIFQI 
DLEMLTVSSI 



31 

I 

ACKGGMKCVN 
PGGGFVASAA 
IDECTAGTHN 
SPYCQCSPGF 
DIDECRTSSY 
HGGFRCYPRN 
QATTIYANTI 
GTPRTSSVTiR 



41 



51 



HYGGYLCLPK 
AVAGPEMQTG 
CRADQVCINIj 
QliAANNYTCV 
LCQYQCVNEP 
PCQDPYILTP 
NTFRIKSGNE 
LTIIVGPPSP 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



TAQIIVNNEQ 
RNNFVIRKNP 
RGSFACQCPP 
DINECDASNQ 
GKFSCMCPQG 
ENRCVCPVSN 
NGBFYLRQTS 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 19 Nucleotide sequence:. 
65 Nucleic Acid Accession #: NM_006S00 

Coding sequence; 27.. 1967 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

70 I I I I I I 

ACTTGCGTCT CGCCCTCCGG CCAAQ CATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 60 

TCGCC6CCTG CTGCTGCTGT CCTCX5CGTCG CGGGT6TGCC CGGAGAGGCT GAGCAGCCTG 120 

CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 

AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 240 

75 TCATCTTCCG TGTGCGCCAG GGCCAGGGCC AGAGCGAACC TGGGGAGTAC GAGCAGCGGC 300 

TCA6CCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC 360 



187 
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GCATCTTCTT GTGCCAGGGC AAGCGCCCTC GGTCCCAGGA GTACCGCATC CAGCTCCGCG 420 

TCTACAAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC CCTGGGCATC CCTGTGAACA 4 80 

GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA CGGGTACCCC ATTCCTCAAG 54 0 

TCATCTG6TA CAAGAATGGC CGGCCTCTGA AGGAGGAGAA GAACCGGGTC CACATTCAGT 600 

5 CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTGCA GAGTATTCTG AAGGCACAGC 660 

TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACCGG CTGCCCAGTG 720 

GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 

TQT6GCTGGA AGTGGAGCCC GTGGGAATGC TGAAG6AAGG GGACCGCGTG 6AAATCAGGT 840 

GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG CAAGCAGAAC CCCAGCACCA 900 

10 GGGAGGCT^GA GGAAGAGACA ACCAACX3ACA ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA 960 

AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT GGACACCATG ATATCGCTGC 1020 

TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCCGAGTG AGTCCCGCAG 1080 

CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA CCCTGACCTG TGA6GCAGAG AGTA6CCAGG 1140 

ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 1200 

15 TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA TCGCTGCGTG GCGTCTGTGC 1260 

CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 

GGATGGCATT CAAGGAGAGG TIAGGTGTGGG TGAAAGAGAA TATGGTGTTG AATCTGTCTT 1380 

GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 1440 

AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 

20 TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 

TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACA ACCACTGGCC 1620 

TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 

TGCCGGAGCC GGAGAGCCGG GGCGTGG7CA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 174 0 

TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

25 GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 

TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GG6CAGCAGC GGTGACAAGA 1920 

GGGCTCCX3GG AGACCAGGGA GAGAAATACA TCGATCTGAG GCAT TAG CCC CGAATCACTT 1980 

CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 204 0 

CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

30 GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 2220 

CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTAT6GCTGT 2280 

AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACT6AAGTG AGGACACACC GGAGCCAGGC 24 00 

35 GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG A6AGCACCCC AGCGGCATCC -2460 

AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA 6TCTCCTGTG 2520 

ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2580 

GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 2640 

TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 

40 TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 

CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 

CACTGCACTC CAGCCTGGGC ' AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 

ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 

TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 

45 GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3060 

TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 

AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCGG6TGT GTGTGTCTGT 318 0 

CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 324 0 

TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300 

50 AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 3360 

AACCTGGGGG CCTGTGAMC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 

AGAGATCAG6 GGTTACCTCT GCTTCTGAGC AAATGGCTCA A6CTCTACCA GA6CAGACAG 3460 

CTACCCTACT TTTCAGCAGC AAAACGTCCC 6TATGACGCA 6CACGAAGG6 CCTGGCAGGC 3540 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA CTT 



55 



60 



Seq ID NO: 20 Protein sequence; 
Protein Accession #: NP 006491 



1 11 21 31 41 51 

I I I I I I 

MGLPRLVCAF LLAACCCCPR VAGVPGEAEQ PAPELVEVEV GSTALLKCGL SQSQGNLSHV 60 

DWFSVHKEKR TlilFRVRQGQ GQSEPGEYEQ RliSLQDRGAT LALTQVTPQD ERIPLCQGKR 120 

65 PRSQEYRIQIj RVYKAPEEPN IQVNPLGIPV NSKEPEEVAT CVGRNGYPIP QVIWYKNGRP 180 

LKEEKNRVHI QSSQTVESSG LYTLQSIIiKA QIiVKEDKDAQ FYCELNYRLP SGNHMKESRE 240 

VTVPVFYPTE KVWIjEVEPVG MUKEGDRVEI RCIADGNPPP HFSISKQNPS TREAEEETTN 300 

DNGVLVLEPA RKEHSGRYEC QAWNLDTMIS LLSEPQELLV NYVSDVRVSP AAPERQEGSS 360 

LTLTCEAESS QDLEPQWIiRE BTDQVLERGP VLQLHDLKRE AGGGYRCVAS VPSIPGIiNRT 420 

70 QLVKIjAZFGP PWMAFKERKV WVKENMVIiNL SCBAS6HPRP TIS)9NVNGTA SEQDQDPQRV 480 

LSTIjNVLVTP EIiIiETGVECT ASNDLGKNTS ILPLBIiVNIiT TLTPDSNTTT GLSTSTASPH 540 

TRANSTSTER KLPEPESRGV VIVAVIVCIL VXAVIiGAVLY FLYKKGKLPC RRSGKQEITIi 600 
PPSRKTELW EVKSDKIiPEE MGLLQ6SS60 KRAPGDQGEK YIDLRH 

75 
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Seg ID NO: 21 Nucleotide sequence t 
Nucleic Acid Accession #: NM_002421 

Coding sequence: 72.. 1481 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GGGATATTGG AGTAGCAAGA GGCTGGGAAG CCATCACTTA CCTTGCACTG AGAAAGAAGA 60 

CAAAGGCCAG T ATG CACAGC TTTCCTCCAC TGCTGCTGCT GCTGTTCTGG GGTGTGGTGT 120 

CTCACAGCTT CCCAGCGACT CTAGAAACAC AAGAGCAAGA TGTGGACTTA GTCCAGAAAT 180 

ACCTGGAAAA ATACTACAAC CTGAAGAATG ATGGGAGGCA AGTTGAAAAG CGGAGAAATA 240 

GTGGCCCAGT GGTTGAAAAA TTGAAGCAAA TGCAGGAATT CTTTGGGCTG AAAGTGACTG 300 

GGAAACCA6A TGCTGAAACC CTGAAGGTGA TGAAGCAGCC CAGATGTGGA GTGCCTGATG 360 

TGGCTCAGTT TGTCCTCACT GAGGGGAACC CTCGCTGGGA GCAAACACAT CTGACCTACA 420 

GGATTGAAAA TTACACGCCA GATTTGCCAA GAGCAGATGT GGACCATGCC ATTGAGAAAG 480 

CCTTCCAACT CTGGAGTAAT GTCACACCTC TGACATTCAC CAAGGTCTCT GAGGGTCAAG 540 

CAGACATCAT GATATCTTTT GTCAGGGGAG ATCATCGGGA CAACTCTCCT TTTGATGGAC 600 

CTGGAGGAAA TCTTGCTCAT GCTTTTCAAC CAGGCCCAGG TATTGGAGGG GATGCTCATT 660 

TTGATGAAGA TGAAAGGTGG ACCAACAATT TCAGAGAGTA CAACTTACAT CGTGTTGCGG 720 

CTCATGAACT C6GCCATTCT CTTGGACTCT CCCATTCTAC TGATATCGGG GCTTTGAT6T 780 

ACCCTAGCTA CACCTTCAGT GGTGATGTTC AGCTAGCTCA GGATGACATT GATGGCATCC 840 

AAGCCATATA TGGACGTTCC CAAAATCCTG TCCAGCCCAT CGGCCCACAA ACCCC3UUkAG 900 

CGTGTGACAG TAAGCTAACC TTTGATGCTA TAACTAOGAT TCGGGGAGAA OTGATCTTCT 960 

TTAAA6ACAG ATTCTACATG CGCAC3\AATC CCTTCTACCC GGAAGTTGAG CTCAATTTCA 1020 

TTTCTGTTTT CTG6CCACAA CTGCCAAATG 6GCTTGAAGC TGCTTACGAA TTTGCCGACA 1080 

GAGAT6AA6T CCGGTTTTTC AAAGGGAATA AGTACTGGGC TGTTCA6GGA CAGAATGTGC 1140 

TACACGGATA CCCCAAG6AC ATCTACA6CT CCTTTGGCTT CCCTAGAACT GT6AAGCATA 1200 

TCGATGCTGC TCTTTCTGAG 6AAAACACTG GAAAAACCTA CTTCTTTGTT GCTAACAAAT 1260 

ACTGGAGGTA TGATGAATAT AAACGATCTA TGGATCCAGG TTATCCCAAA ATGATAGCAC 1320 

ATGACTTTCC TGGAATTGGC CACAAAGTTG ATGCAGTTTT CATGAAAGAT GGATTTTTCT 1380 

ATTTCTTTCA TGGAACAAGA CAATACAAAT TTGATCCTAA AACGAAGAGA ATTTTGACTC 1440 

TCCAGAAAGC TAATAGCTGG TTCAACTGCA GGAAAAATJG AACATTACTA ATTTGAATGG 1500 

AAAACACATG GTGTGAGTCC AAAGAAGGTG TTTTCCTGAA GAACTGTCTA TTTTCTCAGT 1560 

CATTTTTAAC CTCTAGAGTC ACTGATACAC AGAATATAAT CTTATTTATA CCTCAGTTTG 1620 

CATATTTTTT TACTATTTAG AATGTAGCCC TTTTTGTACT GATATAATTT AGTTCCACAA 1680 

ATGGTGGGTA CAAAAAGTCA AGTTTGTGGC TTATGGATTC ATATAGGCCA GAGTTGCAAA 1740 

GATCTTTTCC AGAGTATGCA ACTCTGACGT TGATCCCAGA GAGCAGCTTC AGTGACAAAC 1800 

ATATCCTTTC AAGACAGAAA GAGACAGGAG ACATGAGTCT TTGCCGGAGG AAAAGCAGCT 1860 

CAAGAACACA TGTGCAGTCA CTGGTGTCAC CCTGGATAGG CAAGGGATAA CTCTTCTAAC 1920 
ACAAAATAAG TGTTTTATGT TTGGAATAAA GTCAACCTTG TTTCTACTGT TTT 



Seq ID NO: 22 Protein sequence; 
Protein Accession #: NP 002412 



1 11 21 31 41 51 

I I I I I I 

MHSFPPLIiIiL LFWGWSHSP PATIiBTQEQD VDLVQKVIiBK YYNLKNDGRQ VBKRRNSGPV 60 

VEKLKQMQEF FGUCVTGKPD ABTLKVMKQP RCGVPDVAQF VLTEGNPRWE QTHLTYRIEN 120 

YTPDLPRADV DHAIEKAFQI* WSNVTPLTFT" KVSEGQADIM ISFVRGDHRD NSPFDGPGGN 180 

IiAHAFQPGPG IGGDAHFDED ERWTNNFREY NLHRVAAHEL GHSLGLSHST DIGAIiMYPSY 240 

TFSGDVQIiAQ DDIDGIQAIY GRSQNPVQPI GPQTPKACDS KLTFDAITTI RGEVMFFKDR 300 

FYMRTNPFYP EVELNFISVP WPQLPNGLEA AYEFADRDEV RFFKGNKYWA VQGQNVLHGY 360 

PKDIYSSFGF PRTVKHIDAA LSEENTGKTY FFVANKYWRY DEYKRSMDPG YPKMIAHDFP 420 
GIGHKVDAVF MKDGFFYFFH GTRQYKFDPK TKRILTLQKA NSWFNCRKN 



Seq ID NO; 23 Nucleotide sequence; 

Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence: 141-1580 (underlined sequences correspond to start and stop codons) 



1 11 21 

I I I 

TCTGCGTGTG CCGGG6CTAG 6G6CT6GAA6 

AGGCAAACAG AGGAG66AAG GCGTCTTAG6 

CTCTACAGGC CTGTGTCGCT ATGGGTTCCC 

TCC6CGAGTT CACTCGCCAC TCCTCCGACG 

GCGGGATCCT CACTGACGTC ACGCTGCTGG 

CAGTTCTCAT C6CCTGCAGT GGCTTCTTCT 

6GGTGGACGT GCrTCTCTCTG CCCGGGGGTC 

ACTTCATGTA CACTTCGCGC CTGCGCCTCT 

CCGCCACCTA TTTGCAGATG GAGCACGTGG 

GCTATGAACC TCTGGGCATC TCCCTGCGCC 

CGGCCCCTCC ACCAGGTAGT CCCaGGCGCT 



31 41 51 

III 

TCCT6GCTCT AGTTGCACCT CGGAA66AAA 60 

ACTGCCT6GA TCCAGAGCAC TTTCCTCG6C 120 

CCGCCGCCCC GGAGG6AGCG CTGGGCTACG 180 

T6CTGGGCAA CCTCAACGAG CT6CGCCTGC 240 

TTGGC6GGCA ACCCCTCAGA GCACACAAG6 300 

ATTCAATTTT CCGGGGCCGT GCGGGAGTCG 360 

CCGAAGCGAG AGGCTTCGCC CCTCTATTGG 420 

CTCCAGCCAC TGCACCAGCA GTCCTAGCGG 4 80 

TCCAGGCATG CCACCGCTTC ATCCAGGCCA 540 

CCCTGGAAGC A6AACCCCCA ACACCCCCAA 600 

CCGAAGGACA CCCAGACCCA CCTACTGAAT 660 
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CTCGAAGCTG CAGTCAAGGC CCCCCCAGTC CAGCCAGCCC TGACCCCAAG GCCTGCAACT 720 

GGTU^AAAGTA CAAGTACATC GTGCTAAACT CTCAGGCCTC CCAAGCAGGG AGCCTGGTCG 780 

GGGAGAGAAG TTCTGGTCAA CCTTGCCCCC AAGCCAGGCT CCCCAGTGGA GACGAGGCCT 840 

CCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GTGAAGAAGG ACCCATTCCT GGTCCCCAGA 900 

5 GCAGGCTCTC TCCAACTGCT GCCACTGTGC AGTTCAAATG TGGGGCTCCA GCCAGTACCC 960 

CCTACCTCCT CACATCCCAG GCTCAAGACA CCTCTGGATC ACCCTCTGAA CGGGCTCGTC 1020 

CACTACCGGG AAGTGAATTT TTCAGCTGCC AGAACTGTGA GGCTGTGGCA GGGTGCTCAT 1080 

CG6GGCTGGA CTCCTTGGTT CCTGGGGACG AA6ACAAACC CTATAAGTGT CAGCTGTGCC 114 0 

GGTCTTCGTT CCGCTACAAG GGCAACCTTG CCAGTCATCG TACAGTGCAC ACAGGGGAAA 1200 

10 AGCCTTACCA CTGCTCAATC TGCX3GAGCCC GTTTTAACCG GCCAGCAAAC CTGJ\AAACGC 1260 

ACAGCCGCAT CCATTCGGGA GAGAAGCCGT ATAAGTGTGA GACGTGC66C TCGCGCTTTG 1320 

TACAGGTGGC ACATCTGCGG GCGC»CGT6C TGATCCACAC CGGGGAGAAG CCCTACCCTT 1380 

GCCCTACCTG CGGAACCCGC TTCCGCCACC TGCAGACCCT CAAGAGCCAC GTTCGCATCC 1440 

ACACCGGAGA GAAGCCTTAC CACTGCX5ACC CCTGTGGCCT GCATTTCCGG CACAAGAGTC 1500 

15 AACTGCGGCT GCATCTGCGC CAGAAACACG GAGCTGCTAC CAACACCAAA GTGCACTACC 1560 

ACATTCTCGG GGGGCCC^AG CTGA6C6CAG GCCCAGGCCC CACTTGCTTC CTGCGGGTGG 1620 

GAAAGCTGCA GGCCCAGGCC TTGCTTCCCT ATCS^GGCTTG GGCATAGGGG TGTGCCAGGC 1680 

CACTTTGGTA TCAGAAATTG CCACCCTCTT AATTTCTCAC TGGGGAGAGC AGGGGTGGCA 1740 

GATCCTGGCT AGATCTGCCT CTGTTTTGCT GGTCAAAACC TCTTCCCCAC AAGCCAGATT 1800 

20 GTTTCTGAGG AGAGAGCTAG CTAGGGGCTG GGAAAGGGGA GAGATTGGAG TCCTGGTCTC 1860 

CCTAAGGGAA TAGCCCTCCA CCTGTGGCCC CCATTGCATT CAGTTTATCT GTAAATATAA 192 0 

TTTATTGAGG CCTTTGGGTG GCACCGGGGC CTTCATTCGA TTGCATTTCC CACTCCCCTC 1980 

TTCCACAAGT GTGATTAAAA GTGACCAGAA ACACAGAAGG TGAGATCACA GCTCTGCTGG 2040 

CAGAGATTAC TAGCCCTTGG CTCTCTCGTT TGGCTTGGGT ATTTTATATT ATTTCTGTCA 2100 

25 TAACTTTTAT CTTTAGAATT GTTCTTTCTC CTGTTTGTTT GCTTGTTAGT TTGTTTAAAA 2160 

TGGAAAAAGG GGTTCTCTGT GTTCTGCCCC TGTAATTCTA GGTCTGGAAC CTTTATTTGT 2220 

TCTAGGGCAG CTCTGG6AAC ATGCGGGATT GTGGAATTGG GTCAGGAACC CTCTCTGGTA 2280 

TTCTGGATGT TGTAGGTTCT CTAGCAGTCT AGAAATGGAT ACAGACATTT CTCTGTTCTT 2340 

CAAGGGTGAT AGGAACCATT ATGTTGAGCC CAAAATGGAA GTAAT7U\TAA ATGCCTCCTG 2400 

30 GAGGCTGTGG GTGTGGGGGA TTCTGTATCT GGATTCCGTA TCACTCCAAC TGGAGGCTGT 2460 

GG6TGTGGGG GATTCTGTAT CT6GATTCC!G TATCACTCCA AGTG6AG6CT GGCAGGTTTT 2520 

TCTGCAAGAT GGTCCAGAAT CTAAAATGTC CCATTAATCT 6GTCACTTGG GTTTGGCTCT 2580 

GCTGTATCCA TCTATAGTGG TAGA6ACCCA CCAGGGCTCA AGTGGAGTCC ATCATCCTCC 2640 

CACGGGGGCC T6TTCTTAGC ACTGAGTTGA TCGCTCCATG GGGGAGAGAT CAGACATTCC 2700 

35 TTATCAGAGA TGATGTGACC TTTTCTGACT CTGCCCAGTC TCTATGAATG TTATGGCCTA 2760 

GGGAAGAATC ATGAAACTCT TTAGCTTGAT TAGAT6GTAA ACAGTGTTAA CCCATCCTTT 2820 

ACTACAGAGG CATATGGGTT TGAATGTTAC CTGGG6TTCT CTCTATTGAG TTGA6CCCCT 2880 

TCTTCCTTTA GTGGGTTTTG GACATCTTCT GGCAAGT6TC CAGATGCCAG AACCTTCTTT 2940 

TCCTCTAGAA GGGATGGTGC TTGGTAACCT TACCTTTTAA AAGCTGGGTC TGTGACCTGG 3000 

40 TCTTCCCATC CCTGCATTCC TGTCTGGAAC CAGTGAATGC ATTAGAACCT TCCATAGGAA 3060 

AAGAAAAGGG GCTGAGTTCC ATTCTGGGTT TGCTGTAGTT TGGTTGGGAT TATTGTTGGC 3120 

ATTACAGATG TAAAAGATTG ACTAGCCCAT AGGCCAAAGG CCTGTTCTAG TTGACCAAGT 3180 

TTCAAGTAGG ATT7UVGAGGT TGGTTGAGGG GTGCAGTTTC TGGTGTAGGC CAGGTAGGTA 324 0 

GAAAGTGAGG AACAGGGTTG CCTCTTGGCT GGGTGGAGTC TCTGAAATGT TAGAAGAAGC 3300 

45 GCTGAAGCCT TGATTGATAG TTCTGCCCCT TGTTGCCCTG GGGCTTATCT GATTATGGGA 3360 

CGAGGGTAGA 7VAGTAAGAAG CACTTTTGAA TTTGTGGGGT AGAACTTCAA CAATAAGTCA 3420 

GTTCTAGTGG CTGTCGCCTG GGGACTAGTG AGAAAGCTAC TCTTCTCCCT CTTCCCTCTT 34 80 

TCTCCCCATG GCCCCACTGC AGAATTAAAG AAGGAAGAAG GGAAGGCGGA GGAGTCTATA 3 54 0 

AGAAGGAATC ATGATTTCTA TTTAGCAGAT TGGATGGGCA GGTGGAGAAT GCCTGGGGGT 3600 

50 AGAAATGTTA GATCTT6CAA CATCAGATCC TT66AATAAA GAAGCCTCTC TGYGCWRAAA 3660 
AAAAAAAAAA AAAAAA 

Seg ID NO: 24 Protein Beouence! 
55 Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

An ' I ' ' ' ' 

50 MGSPAAPEGA LGYVREFTRH SSDVIiGNLNE LRLRGILTDV TLLVGGQPLR AHKAVLIACS 60 

GFFYSIFRGR AGVGVDVLSIj PGGPEARGPA PLLDFMYTSR LRLSPATAPA VLAAATYIiQM 120 

EHWQACHRF IQASYEPLGI SLRPLEAEPP TPPTAPPPGS PRRSEGHPDP PTESRSCSQG 180 

PPSPASPDPK ACNWKKYKYI VLNSQASQAG SLVGERSSGQ PCPQARLPSG DEASSSSSSS 24 0 

SSSSEEGPIP GPQSRIjSPTA ATVQPKCGAP ASTPYLLTSQ AQDTSGSPSE RARPLPGSEF 3 00 

65 FSCQNCEAVA GCSSGIiDSLV PGDEDKPYKC QLCRSSFRYK GNLASHRTVH TGEKPYHCSI 360 

CGARFNRPAN IiKTHSRIHSG EKPYKCETCG SRFVQVAHIiR AHVLIHTGEK PYPCPTCGTR 420 
FRHIiQTIiKSH VRXHT6EKPY HCDPCGLHFR HKSQZiRIiHIiR QKH6AATNTK VHYHXLGGP 



70 Seq ID NO: 25 Nucleotide sequence; 
Nucleic Acid Accession #: U21551 

Coding sequence: 1..1155 (underlined sequences correspond to start and stop codons) 



75 



1 - 11 21 31 41 51 

I 1 I I t i 

ATGG ATTGCA GTAACGGATC GGCAGAGTGT ACC6GA6AAG GAGGATCAAA A6A6GT6GTG 60 
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GGGACTTTTA AGGCTAAAGA CCTAATAGTC ACACCAGCTA CCATTTTAAA GGAAAAACCA 120 

GACCCCAATA ATCT6GTTTT TGGAACTGTG TTCACGGATC ATATGCTGAC GGTGGAGTGG 180 

TCCTCAGAGT TTGGATGGGA GAAACCTCAT ATCAAGCCTC TTCAGAACCT GTCATTGCAC 240 

CCTGGCTCAT CAGCTTTGCA CTATGCAGTG 6AATTATTTG AAGGATTGAA GGCATTTCGA 300 

5 GGAGTAGATA ATAAAATTCG ACTGTTTCAG CCAAACCTCA ACATGGATAG AAT6TATCGC 360 

TCTGCTGTGA GGGCAACTCT GCCGGTATTT GACAAAGAAG AGCTCTTAGA 6TGTATTCAA 420 

CAGCTTGTGA AATTGGATCA AGAATGGGTC CCATATTCAA CATCTGCTAG TCTGTATATT 480 

CGTCCTGCAT TCATTG6AAC TGAGCCTTCT CTTGGAGTCA AGAAGCCTAC CAAAGCCCTG 540 

CTCTTTGTAC TCTTGAGCCC AGTGGGACCT TATTTTTCAA GTGGAACCTT TAATCCAGTG 600 

10 TCCCTGTGGG CCAATCCCAA GTATGTAAGA GCCTGGAAAG GTGGAACTGG GGACTGCAAG 660 

ATGGGAGGGA ATTACGGCTC ATCTCTTTTT GCCCAATGTG AAGACGTAGA TAATGGGTGT 720 

CAGCAGGTCC TGTGGCTCTA TGGCAGAGAC CATCAGATCA CTGAAGTGGG AACTATGAAT 780 

CTTTTTCTTT ACTGGATAAA TGAAGATGGA GAAGAAGAAC TGGCAACTCC TCCACTAGAT 840 

GGCATCATTC TTCCAGGAGT GACAAGGCGG TGCATTCTGG ACCTGGCACA TCAGTGGGGT 900 

15 GAATTTAAGG TGTCAGAGAG ATACCTCACC ATGGATGACT TGACAACAGC CCTGGAGGGG 960 

AACAGAGTGA GAGAGATGTT TAGCTCTGGT ACAGCCTGTG TTGTTTGCCC AGTTTCTGAT 1020 

ATACTGTACA AAGGCGAGAC AATACACATT CCAACTATGG AGAATGGTCC TAAGCTGGCA 1080 

AGCCGCATCT TGAGCAAATT AACTGATATC CAGTATGGAA GAGAAGAGAG CGACTGGACA 1140 
ATT6TGCTAT C CTGA 

20 

Seq ID NO: 26 Protein sequence : 
Protein Accession #: AAB08528 

25 1 11 21 31 41 51 

I I 11 11 

MDCSN6SAEC TGEGGSKBW GTPKAKDMV TPATZI<K£KP DPNNLVFGTV PTDHMIiTVEW 60 

SSEF6WEKPH IKPLQNIiSIiH PGSSAIiHYAV BLFEGIiKAFR GVDNKIRLFQ PNIiNMDRNYR 120 

SAVRATIiPVF DKEELLECIQ QLVKLDQEWV PYSTSASIiYI RPAFIGTEPS IiGVKKPTKAI* 180 

30 IjFVIjIiSPVGP YFSSGTFNPV SLWANPKYVR AWKGGTGDCK MGGNYGSSI*F AQCEDVDNGC 240 

QQVIiWIiYGRD HQITBVGTMN LFIiYWINEDG BBBIiATPPLD GIILPGVTRR CIIjDLAHQWG 300 

EPKVSERYIiT MDDLTTAIiEG NRVREMPSSG TACWCPVSD ILYKGETIHI PTMENGPKIiA 360 
SRZIiSKLTDI QYGREESDWT IVLS 

35 

Seq ID NO: 27 Nucleotide sequence : 
Nucleic Acid Accession #: XM_039209 

Coding sequence: 6 56.. 2758 (underlined sequences correspond to start and stop codons) 

40 

1 11 21 31 41 51 

I I I I t I 

TCGCGCGGGG GCCGCCCCCT CCCCTTCCCT CCACCCTGGG CGGGGGCGCG CGAGAAGCGG 60 

TGACGTCAAG GGGCGCGCTG TGGCAGCACC TCCCCGCGCG CTAGTTAAAA AGAAGAAGAA 120 

45 AAGAGGGAAC GAAACATGAG AGGCTGTGTG AGAAGCTGCA GCCGCC6GCA GAGGAGACCT 180 

CAGCATCATC TAGAGCCCAG C6CTGGCCCT GCCTCCGCCT GCCCCGCC6C CGCCGTCGCC 240 

GTTTCTGTTC CTGCTACTGT CCCACCTAAA CAACTCCCGT TACACG6ACA A6TGAACATC 300 

TGTGGCTGTC CTCTCCTTTT CTTCCTCCTC TTCCAACTCC TTCTCCTCCT CCCACTTCCX: 360 

AGCCGCAGCA GAAAGCCCCC AACCCAACTG ACACTGGCAC AACT6CAAAC GGTGTC ATCC 420 

50 GCACAACTTT ATCTCGCTCC TCGGGCTCCC CTAAG6CATT 6GACCCATCG CC6CGTCTTT 480 

TATTTTTT6C AAAGTTGCAT CGCTGTACAT ATTTTTGTCC CCGCCACCTC CCTCTGTCTC 540 

TGGAGTGCCC TACAGCCCCG CAAACTCCTC CTGGA6CTGC GCCCTAGTGC CCCTGCTGGG 600 

CAGTGGCGTT CCCCCCCATC CTCCCGC6CC CAGCCCCTGC TGCTCTG6GC AGACGATGCT 660 

GAAGATGCTC TCCTTTAAGC T6CTGCTGCT GGCCGTGGCT CTGGGCTTCT TTGAAGGAGA 720 

55 TGCTAA6TTT GGGGAAAGAA ACGAAG6GAG CGGAGCAAGG AGGAGAAGGT GCCTGAATGG 780 

GAACCCCCCG AAGCGCCTGA AAAGGAGAGA CAGGAGGATG ATGTCCCAGC TGGAGCTGCT 84 0 

GAGTGGGGGA GAGATGCTGT GC6GTGGCTT CTACCCTCGG CTGTCCTGCT GCCTGCGGAG 900 

TGACAGCCCG GGGCTAGGGC GCCTGGAGAA TAAGATATTT TCTGTTACCA ACAACACAGA 960 

ATGTGGGAAG TTACTGGAGG AAATCAAATG TGCACTTTGC TCTCCACATT CTCAAAGCCT 1020 

60 GTTCCACTCA CCTGAGAGAG AAGTCTTGGA AAGAGACCTA GTACTTCCTC TGCTCTGCAA 1080 

AGACTATTGC AAAG/UVTTCT TTTACACTTG CCGAGGCCAT ATTCCAGGTT TCCTTCAAAC 114 0 

J^ACTGCGGAT GAGTTTTGCT TTTACTATGC AAGAAAAGAT GGTGGGTTGT GCTTTCCAGA 1200 

TTTTCCAAGA AAACAAGTCA GAGGACCAGC ATCTAACTAC TTGGACCAGA TGGAAGAATA 1260 

TGACAAAGTG GAAGAGATCA GCAGAAAGCA CAAACACAAC TGCTTCTGTA TTCAGGAGGT 1320 

65 TGTGAGTGGG CTGCGGCAGC CCGTTGGTGC CCTGCATAGT GGGGATGGCT CGCAACGTCT 1380 

CTTCATTCTG GAAAAAGAAG GTTATGTGAA GATACTTACC CCTGAAGGAG AAATTTTCAA 1440 

GGAGCCTTAT TTGGACATTC ACAAACTTGT TCAAAGTGGA ATAAA6GGAG GAGATGAAAG 1500 

AGGACTGCTA AGCCTCGCAT TCCATCCCAA TTACAAGAAA AATGGAAAGT T6TATGTGTC 1560 

CTATACCACC AACCAAGAAC GGTGGGCTAT CGGGCCTCAT GACCACATTC TTA6GGTTGT 1620 

70 GGAATACACA GTATCCA6AA AAAATCCACA CCAAGTTGAT TTQAGAACAG CCAGAGTCTT 1680 

TCTTGAAGTT 6CAGAACTCC ACAGAAAGCA TCTGGGA6GA Ca^CTGCTCT TTGGCCCTGA 1740 

CGGCTTTTTG TACATCATTC TTGGTGATGG GATGATTACA CTGGATGATA TGGAAGAAAT 1800 

GGATGGGTTA AGT6ATTTCA CAGGCTCAGT GCTACGGCTG GATGTGGACA CAGACATGTG 1860 

CAACGTGCCT TATTCCa^TAC CAAGQAGCAA CCCACACTTC AACAGCACCA ACCAGCCCCC 1920 

75 CGAAGTGTTT GCTCATGGGC TCCACGATCC AGGCAGATGT GCTGTGGATA GACATCCCAC 1980 

TGATATAAAC ATCAATTTAA CGATACTGTG TTCAGACTCC AAT6GAAAAA ACAGATCATC 2040 
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AGCCAGAATT CTACAGATAA TAAAGGGGAA AGATTATGAA AGTGAGCCAT CACTTTTAGA 2100 

ATTCAAGCCA TTCAGTAATG GTCCTTTGGT TGGTGGATTT GTATACCGGG GCTGCCAGTC 2160 

AGAAAGATTG TATGGAAGCT ACGTGTTTGG AGATCGTAAT GGGAATTTCC TAACTCTCCA 2220 

GCAAAGTCCT GTGACAAAGC AGTGGCAAGA AAAACCACTC TGTCTCGGCA CTAGTGGGTC 2280 

5 CTGTAGAGGC TACTTTTCCG GTCACATCTT GGGATTTGGA GAAGATGAAC TAGGTGAAGT 234 0 

TTACATTTTA TCAAGCAGTA AAAGTATGAC CCAGACTCAC AATGGAAAAC TCTACAAAAT 2400 

TGTAGATCCC AA7UVGACCTT TAATGCCTGA GGAATGCAGA GCCACGGTAC AACCTGCACA 2460 

GACACTGACT TCAGAGTGCT CCAGGCTCTG TCGAAACGGC TACTGCACCC CCACGGGAAA 2520 

GTGCTGCTGC AGTCCAGGCT GGGAGGGGGA CTTCTGCAGA ACTGCAAAAT GTGAGCCAGC 2580 

10 ATGTCGTCAT GGAGGTGTCT 6TGTTAGACC GAACAAGTGC CTCTGTAAAA AAGGATATCT 2640 

TGGTCCTCAA TGTGAACAAG TGGACAGAAA CATCCGCAGA GTGACCAGGG CAGGTATTCT 2700 

TGATCAGATC ATTGACATGA CATCTTACTT GCTGGATCTA ACAAGTTACA TTGT ATAGT T 2760 

TCTGGGACTG TTTGAATATT CTATTCCAAT GGGCATTTAT TTTTTATCCT GTCATTAAAA 2820 

AAAT^AAGACT GTTATCCTGC TACACACTCC TGTGATTTCA TTCTCTTTTA TTAATTTAAA 2880 

15 AATAATTTCC AGAAATGTGC AGATCCTCTG TGTGTATGTC AGCATGTTTG TTCACATATG 2940 

CACATACACA TACTCATAAC CCCTATATGC GTTGTTGCAT AACAGATGAT TTTTTAAAAT 3000 

ATATACTTCC TTATGCAAAG TAATTTACAC AGAAATTCXIA TTGTAAATTG ATAATGGATT 3060 

TTTTATGTTA CTAGAAGAGA TTATTTGACT TCCCAGGAAT TTTCTGTCTG TAATCACTAA 3120 

AGTCAACTTT AATAGAGTTT TGAAACAGTA CT6T6CAATC CGAT6GATCT AATTAAAAAA 3180 

20 AAGGCAATAT TTTTATATTA AAGTACTATA CTA6GAGAGA ATGTTTCAGA ACTCCCT6AT 3240 

GAATTTCTAA GTGAGCAACT TGATATAAAA TTGTAATCTT CATTTTTGTC AGTGTATCCA 3300 

GTTACAGAAT GCTACACACT TACCTTTTTA TTGGCTGAGA AATCTGGTTA TTTCATCTTA 3360 

ATCTCAAGAT TGTTTTCAAG TGTTTTATAA TTAAATCATA ATAGCATATT TTAAAATCAA 3420 

TCTTCCTAAA AGGTCTGCTT TTATTGTATA TTTTATTTAA CAATAGGCAC TGGGTTTGTG 3480 

25 TTACATATTT ATATATTTTA TTTTATTTTT ATAATATAGA CATCACCTAG 

Seq ID NO: 28 Protein sequence ; 
Protein Accession #: XP_039209 

30 

1 11 21 31 41 51 

I I I I I I . 

MLKMItSFKIiIj liliAVAIiGFFB GDAKFGERNE G5GARRRRCL NGNPPKRLKR RDRRMMSQLE 60 

IiIiSGGEMLCG GFYPRIaSCCL RSDSPGLGRIi EHKIFSVTNN TEC6KLLEEI KCALCSPHSQ 120 

35 SIiFHSPEREV LERDIiVIiPIO. CKDYCKBFFY TCRGHIPGFIi QTTADEFCFY YARKDGGIiCP 180 

PDFPRKQVRG PASNYIiDQME EYDKVEEZSR KHKHNCFCZQ EWSGLRQPV GALHSGDGSQ 240 

RLFILEKEGY VKILTPEGEI FKEPYLDZHK LVQSGIKGGD ERGLLSIiAFH PNYKKNGKIiY 300 

VSYTTNQERH AIGPHDHIIiR WEYTVSRKN PHQVDLRTAR VFLEVAEItHR KHLGGQLIiFG 360 

POOFLYZIIiG D6MZTLDOMB EMDGLSDFTG SVZiRIiDVDTD MCNVPYSZPR SUPHFKTSTNQ 420 

40 PPEVFAHGLH DP6RCAVDRH PTDINZNLTZ IjCSDSNGKNR SSARILQZZK GKDYESBPSI* 480 

IiEFKPFSNGP LVGGFVYRGC QSERLYGSYV FGDRNGNFLT LQQSPVTKQW QEKPLCLGTS 540 

GSCRGYFSGH ILGFGEDELG EVYILSSSKS MTQTHNGKLY KIVDPKRPLM PEECRATVQP 600 

AQTIiTSECSR LCRNGYCTPT GKCCCSPGWE GDFCRTAKCE PACRHGGVCV RPNKCLCKKG 660 
YLGPQCEQVD RNIRRVTRAG ZLDQIZDMTS YIiLDXiTSYZV 

45 

Seq ID NO: 29 Nucleotide sequence: 
Nucleic Acid Accession #: NM_024756 

Coding sequence: 75.. 2924 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I - I I I I 

AA6ACAACGT .CACTAGCAGT TTCTGGAGCT ACTTGCCAAG 6CT6A6TGT6 A6CTGAGCCT 60 

6CCCCACCAC CAA GATGA TC CTGAGCTTGC T6TTCAGCCT TGG6GGCCCC CTG6GCTGGG 120 

55 GGCTGCTGGG GGCATGGGCC CAGGCTTCCA GTACTAGCCT CTCTGATCTG CAGAGCTCCA 180 

GGACACCTG6 6GTCTGGAA6 GCAGAGGCTG AGGACACXTAG CAAGGACCCC GTT6GACGTA 240 

ACTGGTGCCC CTACCCAATG TCCAAGCTGG TCACCTTACT AGCTCTTTGC AAAACAGAGA 300 

AATTCCTCAT CCACTCGCAG CA6CC6T6TC CX3CAGGGAGC TCCAGACTTGC CAGAAAGTCA 360 

AAGTCATGTA CCGCATGGCC CACAAGCCAG TGTACCAG6T CAA6CAGAAG GTGCTGACCT 420 

60 CTTTGGCCTG GAGGTGCTGC CCTGGCTACA CGGGCCCCAA CTGCGAGCAC CACGATTCCA 480 

TGGCAATCCC TGAGCCTGCA GATCCTGGTG ACAGCCACCA GGAACCTCAG GATGGACCAG 540 

TCAGCTTCAA ACCTGGCCAC CTTGCTGCAG TGATCAATGA GGTTGAGGTG CAACAGGAAC 600 

AGCAGGAACA TCTGCTGGGA GATCTCCAGA ATGATGTGCA CCGGGTGGCA GACAGCCTGC 660 

CAGGCCTGTG GAAAGCCCTG CCTGGTAACC TCACAGCTGC AGTGATGGAA GCAAATCAAA 720 

65 CAGGGCACGA GTTCCCTGAT AGATCCTTGG AGCAGGTGCT GCTACCCCAC GTGGACACCT 780 

TCCTACAAGT GCATTTCAGC CCCATCTGGA GGAGCTTTJ\A CCAAAGCCTG CACAGCCTTA 84 0 

CCCAGGCCAT AAGAAACCTG TCTCTTGACG TGGAGGCCAA CCGCCAGGCC ATCTCCAGAG 900 

TCCAGGACAG TGCCGTGGCC AGGGCTGACT TCCAGGAGCT TGGTGCCAAA TTTGAGGCCA 960 

AGGTCCAGGA GAACACTC7VG AGAGTGGGTC AGCTGCGACA GGACGTGGAG GACCGCCTGC 1020 

70 ACGCCCAGCA CTTTACCCTG CACCGCTCGA TCTCAGAGCT CCAAGCCGAT GTGGACACCA 1080 

AATTGAAGAG GCTGCACAAG GCTCAGGAGG CCCCAGGGAC CAATGGCAGT CTGGTGTTGG 114 0 

CAACX3CCTGG GGCTGGGGCA AGGCCTGAGC CGGACAGCCT GCAGGCCAG6 CTGGGCCAGC 1200 

TGCAGAGGAA CCTCTCAGAG CTGCACATGA CCACGGCCCG CTAGGGAGGAG GA6TTGCAGT 126 0 

ACACCCTGGA GGACATGAGG GCCACCCTGA CCCGGCACGT GGATGAGATC AA6GAACTGT 1320 

75 ACTCCGAATC GGACGAGACT TTCGATCAGA TTA6CAAGGT GGAGC6GCA6 GTGGAG6AGC 1380 

T6CAGGTGAA CCACAC6GCX3 CTCCGTGAGC TGCGCX3TGAT CCTGATGGAG AAGTCTCTGA 1440 
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TCATGGAGGA GAACAAGGAG GAGGTG6AGC 
AGCACCTGCA GGGTGGCCAT GCCGACCTCA 
AGCTCTATTT AGACCTGGAC GTCATCCGG6 
AGGAGACCCA 6GTGAGCCTG GACGAGC6GC 
5 TGCAGAACGC CGTGGACGCC GTGTCGCTG6 
GG6CGCGGGC G6CCACGTCG CGGCTCCGGA 
GCGCGCT6AA GGCGGCCGCX3 ^GCCGAGGCCC 
TCGCCGCCCT GCTGGAGGAC GC6CT6CGGC 
AGGAGGTGCT GGAG6AGAT6 TCTGAGCAGA 

10 A6ATCC6CGT 6GCCCT6CAG GAC6CCGCTA 
ACGAGCTGGC CGCCCGAGTG ACGGCCCTGG 
AGCACCT6GA GCCCAGCCAC GACGCGGGCC 
GGCTGGCGCG GGAGCTCCAG AGCCTGAGCA 
AGGCCGAGGC CGGGGCCGGG GCCGCCTCCC 

15 CACTCTTCGC CACTCAGCGC AGCTTGGAGC 
GG7UVCTTCCA AGGGCTCATG GT^GCCAACG 
TGCTGAGCAG GAAAGGGAAG AAGCAGCAGA 
AGAAGGAAGC GGAGCCTTTG GTGGACATAC 
GCGCGGCGCT CTGGGAGGCA GGATCCCCTG 

20 CGGCTGCCCT GCAGACAGTG AAGTTCAACA 
TCCCTGAACA TGGCTACTTC CGAGCCCCTG 
TTGAATTTGG CCCAGGGCCA GGCACCGGGC 
CAGTCTGTAC CACTGbGCAG GGGAGTGGAA 
TGCAGAAGGG TGAGCGAGTA TGGTTTGAGT 

25 TGTCGGGCAC TGCATTTGGG GGCTTCCTGA 
GATCAGACAT CATGGACTCG CCCAGCTCTC 
TGGAGGTCAT TCAGTTGGTC TGTCTCTTCC 
TACGTGGCTT CCCTGTAACC ACATG6G6CT 
AATGCTTCTC CGGGCAGGAC ATG6TCCTAG 

30 TCAGACAGCA CX3GCCTGGGC TCCAACTCTT 
GT6TTTTGCT CCTCCTGTGG TTGGAAACTT 
TCCTCTTCTC TTCTCCCTTA TCGTATGATA 
AAAATGGAG6 CAACATTTTG GCCAACATTG 
AACCTGCTTG GTCTCTATTA GTOVGTAATG 

35 GAAATTAGTA TCTTTAGTTT CAGTCATTCC 
CTAAAATATC ATCTTGAATT GTAATCCCTA 
GAGGTAATTG GATCTTGGGG GCGGTTCCCC 
TCTGATGATT TTATAAGTTT GATAGTTCCT 
TGAAGATGCC TTGGTTCCTC TTCACTGTCT 

40 CAGCCATGTG GAACAGTGAG TCAATTAAAC 



GGCAGCTCCT GGAGCTCAAC CTCACGCTGC 1500 

TCAAGTAC6T GAAGGACTGC AATTGCCAGA 1560 

A66GCCA6AG GGACX3CCACG CGTGCCCTGG 1620 

GGCAGCTGGA C6GCTCCTCC CTGCAGGCCC 1680 

CCGTGGACGC GCACAAAGCG 6AG66CGAGC 1740 

GCCAAGT6CA GGCGCTGGAT GACGAG6TGG 1800 

GCCACGAG6T GCGCCA6CTG CACAGCGCCT 1860 

ACGAGGCX3GT GCTGGCCGCG CTCTTCGG6G 1920 

CGCCGGGACC GCTGCCCCTG AGCTACGAGC 1980 

6CX36GCTGCA 6GAGCAGGCG CTC66CTG6G 2040 

AGCAGGCCTC GGAGCCCCCG C66CCGGCAG 2100 

GCGAGGAGGC CGCCACCACC GCCCTGGCCG 2160 

ACGACGTCAA GAATGTCGGG CGGTGCTGCG 2220 

TCAACGCCTC CCTTGACGGC CTCCACAACG 2280 

AGCACCAGCG GCTCTTCCAC AGCCTCTTTG 234 0 

TCAGCCTGGA CCTGGGGAAG CTGCAGACCA 24 00 

AAGACCTGGA AGCTCCCCGG AAGAGGGACA 2460 

GGGTCACAGG GCCTGTGCCA GGTGCCTTGG 2520 

TGGCCTTCTA TGCCA6CTTT TCAGAAGGGA 2580 

CCACATACAT CAACATTGGC AGCAGCTACT 2640 

AGCGTGGTGT CTACCTGTTT GCAGTGAGCG 2700 

AGCTGGTGTT TGGAGGTCAC CATCGGACTC 2760 

GCACAGCAAC GGTCTTTGCC ATGGCTGAGC 2820 

TAACCCAGGG ATCAATAACA AAGAGAAGCC 2880 

TGTTTAAGAC CTGAACCCCA GCCCCAATCT 2940 

CTCGGCCTGG GGCTCTGGCC AAGGATGGGC 3000 

CTGGAAACCT TCTGCAAAGA TGGTGTGGTG 3060 

T66CCATTTC TCCAT6ATGA GAAGGACTGG 3120 

GAAGCCTGAA CCTTGGCTTG GCATGCCTTC 3180 

CACCACACCC TGTATTCTAC AACTTCTTTG 3240 

CTGTACAACA CTTTAAACTT TTCTCTT6CT 3300 

GAAAGACATT CTTCCCCAGG AGGAATGTTT 3360 

G7VAAGCACTA 6AG6GCAATG 6GATTAAACC 3420 

T^GACGACAG CCTG6CCAAC CAAGG6AAA6 3480 

TTGTAGGATA TGGTTTAGCT GTGCCCCCAC 3540 

TAATCCCCAC ATCAAGGGAG AGATCAGGTG 3600 

CATGCTGTTC TTGTGATAGT TCTCACGAGA 3660 

CCTGTGTTCA TTCTCCTTCC TGCCACCTTG 3720 

GCCATGATTG TAAGTTTCCT GAGGCCTCCC 3780 
CTCTTTCCTT TATAMTT 
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Seq ID NO: 30 Protein sequence; 
Protein Accession #: NP 079032 



1 11 21 31 41 51 

I I I I I I 

MIIiSLLFSLG GPLGWGLLGA WAQASSTSLS DLQSSRTPGV WKABABDTSK DPVGRNWCPY 60 

PMSKLVTLIiA LCKTEKPLIH SQQPCPQGAP DCQKVKVMYR MAHKPVYQVK QKVLTSIiAWR 120 

50 CCPGYTGPNC EHHDSMAIPE PADPGDSHQE PQDGPVSFKP GHIiAAVINEV BVQQEQQEHIj 180 

LGDLQNDVHR VADSIiPGLWK AI»P6NIiTAAV HEANQTGHEF PDRSIiEQVIjXi PHVDTFIiQVH 240 

FSPIWRSFNQ SLHSIiTQAIR NLSLDVEANR QAISRVQDSA VARADFQEIiG AKFEAKVQEN 300 

TQRVGQIiRQD VEDRLHAQHF TU1RSISELQ ADVDTKI>KRL HKAQEAPGTN GSIiVIiATPQA 360 

GARPEPDSLQ ARIiGQLQRNlj SEUOWTARR EEELQYTLBD MRATLTRHVD EIKELYSESD 420 

55 ETFDQISKVE RQVEELQVNH TALREIiRVHi MEKSLIPIBBN KEEVERQLLE LNLTLQHLQG 480 

GHADLIKYVK DCNCQKLYIiD LDVIREGQR0 ATRAIiEETQV SLDERRQLDG SSLQALQNAV 540 

DAVSliAVDAH KAEGERARAA TSRIiRSQVQA LDDEVGAI*KA AAAEARHEVR QLHSAFAALL 600 

EDALRHEAVI* AALFGEEVIiE EMSEQTPGPL PLSYEQIRVA LQDAASGLQE QALGVHDELAA 660 

RVTALEQASE PPRPAEHLEP SHDAGREEAA TTAIiAGIiARE LQSIiSNDVKN VGRCCEAEAG 720 

60 AGAASLNASb DGLHNALFAT QRSLEQHQRL FHSLFGNFQG IjMEANVSIiDL GKLQTMIiSRK 780 

GKKQQKDLEA PRKRDKKEAE PtiVDIRVTGP VPGAIjGAALW EAGSPVAFYA SFSEGTAALQ 840 

TVKFNTTYIN IGSSYFPEHG YFRAPERGVY LFAVSVEFGP GPGTGQIiVFG GHHRTPVCTT 900 
GQGSGSTATV FAMAEIiQKGE RVWFELTQGS ITKRSIjSGTA FGGFLMFKT 

65 Seq ID NO: 31 Nucleotide secruence; 

Nucleic Acid Accession #: AB037715 

Coding sequence: 370.. 34 8 9 (xinderlined sequences correspond to start and stop codons) 



70 1 11 21 31 41 51 

I I I I I I 

GAACGCTCAC AGAACAGGCA GTGCMTTCC ATGTTCCTCT TAAGTATGTT AGCCCTACCG 60 

GGAGCTGAGC TGGCCAGTCT ACTTGGAGAG GAAAAGTAGA TCTGGGGAAG GTGGAAGGGT 120 

CAGTTCCTAA GTGACTTCCT CCTCGGGGAT GGTAAGGGCA TTTGCTGATC TCCAGTGACT 180 

75 GCCTGGTGCC TCATGGTCAG ACTCGGCTGT CTCACTCCCA GATATCTGAT TTTCCAAAAA 240 

GGGACACACC TATCTGCAGC AAAGAAGACA CTGACCAGAT T6C6AGCGGT GCTTTTGGAT 300 
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GCTCTGTAGC CACCCGGGGC CCAGGA6GAC 
TCGGAGACCa 3SGCAGTGCA 6CTGGTGCCC 
GAGGGCCGCC GATGTCAAGT ACATCTTCTT 
CCCAAGCTGT TGGCCAA6GA GCTTCTTGAC 
5 AAGGAGTACT TTGGAATAGC ATTCACAGAT 
GATCGAAGAG TATTG6AACA TGACTTCCCT 
TGTGTCAGGT TCTATATAGA AAGCATTTCA 
TTCTTTCTGA ACGCGAAGTC CTGCATCTAC 
GTGTTTGAAT TAGCTTCCTA TATTTTACAG 

10 GTTGTGAGGA GTGACTTGAA GAAGCTGCCA 
CCTTCCCTG6 CCTACTGTGA AGACAGAGTC 
ACAAGAGGTC AAGCAATCGT AAACTACATG 
GTTCACTATT ATGCAGTGAA GGACAAGCAG 
AAAGGGATCT TCCAGTATGA CTACCATGAT 

15 AGACAGTTGG AAAACCTGTA CTTCAGAGAA 
CGCAGGGCTT CAGTGAC7VAG GAGGACGTTT 
TATGCATGTC CGGCATTGAT CAAGTCCATC 
TATCTGGACA GAAAGCAGAG TAAGTCCAAA 
GCCATCGACC TGACCGAGAC GGGGACGCTG 

20 AAGGGGAAGA TCATCAGCGG CAGCAGCGGC 
GATAGCTCGC AGTCGGCCAA GAAGGACATG 
CTGGAGGAAA CCCTGCGTCA GAGGCTG6AG 
GAGCTCACGG GCAAGCTGCC AGTAGAATAT 
GTTCGGAGAA GAATAGGAAC AGCCTTCAAA 

25 GAG6AAGCTG AGCTGGAAC6 CCTGGAACGA 
GCCGCCCGCC GCCTAGCCAG TGACCCCAAC 
ACCTCGTATC TGAATGCACT GAAGAAACTG 
CXS'CATCAAGT CTGGGAAGAA ACCCACCCAG 
ATTGCCAGTG AAGACAGCTC CCTCTCAGAT 

30 GTTACCAGCA CAATATCCCC CCTACATTCT 
TCX3CACAACA GGCCTCCTCC TCCCCAGTCC 
CGCAACGACT ATGACAAGTC ACCCATCAAG 
GAACCCTATG AGAAGGTCAA GAAGCGCTCC 
TTCCCCAGCA CAGGAAGCTG TGCGGAAGCC 

35 CCCATCCGCG GCCTCCCGCA CTGGAACTCC 
CGGGTCCGGA GTCCCCACTA CGTCCATTCC 
CTGCACAGCC TCGCACTGCA CTTTAGGCAC 
CTCCTGGGCT CGGAAAACGA CACCGGGAGC 
AGCAACGGCT CAGACCCCAT GGACGACTGC 

40 CACTACTACC CGGCGCAGAT GAACGCCAAC 
AAGGCGCGCC AGAGGCAGAG GCAGCGGCAG 
TCGGGCAGCA TGCCCAACCT GGCGGCGCGC 
GGCGGTGTGT ACCTGCACAG CCAGAGCCAG 
CCGCT6TACA TCX3AGGGCGG ' CGCCACGCCC 

45 GAGTGCCACT ACAGGGTCAA GGCTCAGTTC 
CTGTTCAAGG AGAGCTGGCG CG6CGGCGGC 
CCGTCGCGAT CX3CAGATCCT GCGGACTCCG 
6GC6CGGGCC GTGCCGCCGT CTCAGACGAG 
TCGCACAAGG AGCACAGCCG CCTGTC6CAC 

50 CAGTACAGCA CCTCCTCCCA GAGCACCTTC 
CAGATGTGCA AGGCCACGTC AGCTGCCTTA 
A6TGAAATTG GAGCCACCCC CCCAAGCAGC 
GAAGCAACAG AAAACTCACC CATTCTGGAT 
GATGA ATAGA GGAGCTACAA TGATAGCTGT 

55 GCTGATGTCC AGTGGTACGG GCAGGAAAAA 
CCGGCCTAAT CTGACCGCCT CAACGCCATT 
TTACCCAGAC GCACCGTCAC CCTGCACCAG 
CTCCGCATTC CCTCCCCCTT GAAAACCTGA 
CACTGTGTGT CCCCTGGCGC TCTTGCCCAT 

60 • CTTGGTGGCT TCCCTCTGCC ATGACAGCCC 
GGCATCCAAT TCCTGCGGAT AAGTAGCGTT 
CAGGGTGACC CAGAAAGACG ATTCAGCTGT 
CAAGCACTTC ATGAAGAGGA GGCCTCGTGG 
GATGGGACAG CTTGTGGGGA TGGCTATGGG 

65 ACACCAGAAA TGCATC6GAG GACCACAATC 
TAAAAACATA AAAAATTTU^ AGGGGCCAAG 
TTTTAAATTC TGAACTGCTA CTACACACAA 
CTCTCTCTAG CCCTCTCCCT TACTGGCCCA 
GCCCCAATGC CACGGTAAAG GCGAG6AAGT 

70 ATCCATCTGG ACACAAAGAG AGACCTGTGG 
CATGCA6GGG GTTCAGCCGA GCXTCAAGACT 
AACGTAA6GT GATAATGGCC AAAAGTGGTT 
ATCCTATTTT TTTGCATAA6 GTGTTTCATT 
ACATTGCGAT CCATTCAGTG TTTAACTGTC 

75 GTGACAAAAG AGCTCAGATC CGACTTCTCC 
TGCCCTTAGG TAGAAAGATT T6ACTC6TGT 



TGACTCGGCA GCAGGATTCG TGCATGGGAA 360 

GACTCAGCTC TCGGCCTGCT GATGATGACG 420 

GAT6ACAGGA AGCTGGAACT CCTAGTACAG 480 

CTTGTGGCTT CTCACTTCAA TCTGAA6GAA 540 

GAAACGGGAC ACTTAAACTG 6CTTCA6CTA 600 

AAAAAGTCAG GACCCGTGGT TTTATACTTT 660 

TACCTGAAGG ATAATGCTAC CATTGAGCTT 720 

AAGGAGCTTA TTGACGTTGA CA6CGAAGTG 780 

GAGGCAAAGG GAGATTTTTC TAGCAAT6AA 840 

GCCCTTCCC3V CCCAAGCCCT 6AAG6AGCAC 900 

ATTQAGCACT ACAAGAAACT 6AACGGTCAG 960 

AGCATCGTGG AGTCTCTCCC AACCTACGGG 1020 

GGCATACCAT GGTGGCTGGG CCTGAGCTAC 1080 

AAAGTGAAGC CTU^GAAAGAT ATTCCAATGG 114 0 

AAGAAGTTTT CCGTGGAAGT TCATGACCCA 1200 

GGGCACAGCG GCATTGCAGT GCACACGTGG 1260 

TGGGCTATGG CCATAAGCCA ACACCAGTTC 1320 

ATCCATGCAG CACGCAGCCT GAGTGAGATC 1380 

AAGACCTCGA AGCTGGCCAA CATGGGTAGC 1440 

AGCCTGCTGT CTTCAGGTTC TCAGGAATCA 1500 

CTGGCTGCCT TGAAGTCCAG GCAGGAAGCT 1560 

GAACTGAAGA AGCTGTGTCT CCGAGAAGCT 1620 

CCCCTGGATC CAGGGGAGGA ACCACCCATT 1680 

CTGGATGAAC AGAAAATCCT GCCCAAAGGA 1740 

GA6TTTGCCA TTCAGTCCCA GATTACGGAG 1800 

GTCAGCTUU^ AACTGAAGAA ACAAAG6AAA 1860 

CAG6AGATTG AAAATGCAAT CAATGAGAAC 1920 

AGG6CTTCGC TGATCATAGA CGATGGAAAC 1980 

GCCCTTGTTC TTGAGGATGA AGACTCTCAG 2040 

CCTCACAAGG GACTCCCTCC TCGGCCACCG 2100 

CTGGAGGGAC TCCX3ACAGAT GCACTATCAC 2160 

CCCAAAATGT GGAGTGAGTC CTCTTTAGAT 2220 

TCTCACAGCC ATTCCAGCAG CCACAAGCGC 2280 

GGCGGAGGAA GCAACTCCTT GCAGAACAGC 2340 

CAGTCCAGCA TGCCGTCCAC GCCAGACCTG 2400 

ACGAGGTCGG TGGACATCAG CCCCACCCGA 2460 

CGGAGCTCCA GCCTGGAGTC CCAGGGCAAG 2520 

CCCGACTTCT ACACCCCGCG GACTCGTAGC 2580 

TCGTCGTGCA CCAGCCACTC GAGCTCGGAG 2640 

TACTCCACGC TGGCCGAGGA CTCGCCGTCC 2700 

CGGGCGGCGG GCGCACTGGG CTCAGCCAGC 2760 

GGGGGTGCGG GGGGCGCGGG GGGCGC6GGG 2820 

CCCAGCTCGC AGTACCGCAT CAAGGAGTAC 2880 

GTGGTGGTGC GCAGCCTGGA GAGCGACCAG 294 0 

AAGACGTCCA ACTCCTACAC GGCGGGCGGC 3000 

GGC6ACGAGG 6CGACACGGG CCGCCTGACG 3060 

TCGCTGGGCC GCGA6GGC6C CCACGACAA6 3120 

CT6C6CCAGT GGTACCA6CX3 TTCCACCGCC 3180 

ACCA6CTCCA CCTCCTCGGA CAGCGGCTCG 3240 

GTGGCGCTiCA GCAGGGTCAC CAGGATGCCC 3300 

CCTCAAAGCC AGAGAAGCTC GACACCGTCA 3360 

CCCCACCACA TCCTAACCTG GCAGACTGGA 3420 

GGGTCTGAGT CTCCACCTCA CCAAAGTACT 3480 

TTCCTGGATT CCTCCCTCTA TCCAGAACTA 3540 

GCCAAGCCCG GGACCCTCGT GTGAGCCAGC 3600 

CTGAGATCAC CTCACTGCCT CTCATTTGCC 3660 

CTTTGGCCCT CAGCACTTTT TTTCTCCTGT 3720 

CTGAGGAGAC ATTCTGGAAG GTTCCGGTCC 3780 

AGAGAGCCAG ACACCAATCC TCAATGGCAC 3840 

CTAGGCCAGG AACCATCAGG GGGGCCAGCC 3900 

GGGAGAGAAC GGGAAAGGGG ACTTGGGTTA 3960 

GTCCAGCCTG CCACCCATAC GTAGGCCAAC 4020 

CATATTCAGT TTACACCTGA AATATTCCTT 4080 

GGAAGGGGAG GTTGAGAAAG GAAGTTCTCG 414 0 

AGTTCTATGC TGCCAAAGAT TAAAAATAAA 4200 

AGGAAGACAT TCTTTCTGCA AGGAAATTTC 4260 

GTGAAAGTCA ACCCTATGTA AACTGGTGTC 4320 

CTTCTCTCTC CGTAGAGAGC CTGAAAAACT 4380 

CTTGGCTGGC GTT6CTGACT CACAGTCGCC 4440 

GAGTCATAGA GGGTACTGTT A6CCCCGGTC 4500 

CAAAGCTGCT TTCCTTTCAG GATTTGTA6T 4560 

CTCTCTCATT AAACCAACCA GTAAAA6CGT 4620 

TTCGTTTTTA TGG6AAACCA AGGGAA2UIGC 4660 

GTGGCTCATT TTCTGTTCGT TAGCACTTGT 4740 

TATGTGTCAC TTATTCCAAG AACCCAACTA 4800 

GTCTACTAGC . CAACAGGCAG AGCAGGGTT6 4860 
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AAAAAAATAT CAGCTCCCAA AGGGCCCATG TGTCTACATC ATCAGTTACT GTCATGCACC 4920 

ACATTTGTGT GCAGATACCA AAAGAGGAGG AAAGAAGA7UV AAAATTAATG TGTGGGAGCT 4 980 

GCACGTTTAC ATGTTTTGAG CTATGCTTCA AACACAACTG GAAAGCCATC AATCTTCAAA 504 0 

GGCCTCAAAA ATACTTTTAT AGTAACT^GT GCACGACTTT AGTTGGGTTA TTCAAGATGG 5100 

CACAAAAAGG TTTCC6CAGA GGTGGTATGC TGTGCTTTTG GCGCAAGTGG TGGGGGGATG 5160 

GGGGTGGGGG TGGAATTTTT TTCTCACTCT AATGACTTCC TATTGGAAAG GCATTGACAG 5220 

CCAGGGACAG 6AGCCAGGGT GGGGGTAGTT TTGTGGGAAA GCAGAACTGA AGTTA6CTTA 5280 

AGCATAAAAA CAAAGJ\AAAA TCTTCGCTTT TCATGTATGT 6GAATCCAAG AATT^CCATA 5340 

GGCTCTACCA GACCAGGAGG GTAAGGATGG ACACTT^TVAAT GAAACAAATA CCAAGGTATT 5400 

CCTTCTGCTG CAGCCTGGAG ACCACCGAGA GTCGAGCTGG GGCACACACA CACCTGGCCG 5460 

GGACCCGGCA GGGACAAGGC GGGCCGTGGC CTCCTCCACC AAGTCTCTCT AGACAATTCA 5520 

GGGCCTGCTT TCCCCAGCTC CATGCATGGC TGGACTGGTG ATTCCAGGGT GCAGAAGGGA 5580 

TTCATATTCC CAGAACGCTT TAAGTGTACA CCTGCAGGAT AAAGAGATAC CGGTTACATT 564 0 

ATTAAATGAT TCTAGGGATT CACTGGGGGA TATTTTTGTT GCTTTTACTT TCATGGTTAG 5700 

AGCTACAAAG AACAGTGATT TTTTTTTTTT CTCCCTTCCC CATTCAGAAA CATTATACAT 5760 

TGGGCCATTT TTCTTTCTCC CAAAGAAGAT TCATGGATAG TCAGACTGAA CTGTGTGCAA 5820 

CAGGAAAAGT CAAAAGGGAA AAGGCAGCTG ATGAGGTTAC ATGGTTACAT GTTCTACATC 5880 

ATGCAGAGTA GCTTGAAATC TAGTCTGGAG AAAACTGGAT CAAGATTCTA GCCCACTGGA 5940 

GTTGCAAGGA ATGAGAGGCA AAAATTCTAA AGATTTGG6T TATATTTTCS^ ACTTGGGG6A 6000 

CAGAGAGAAA TGGAGAGCAG GAATTACAGT TCCAACAAAC ATCATGATAG TCTGGTAGTC 6060 

AAGACAGAGA TTAAGTAAAA CAGGTTTTAC T6TTTAGCTG AGTTCAGTTA ATACAAAAT6 6120 

TACATAAAAC GTTAGTCCTT TGAGACTGAC ATGATTAATG ATCAGTGTG6 TGGGAAATGA 6180 

TGTAGTTATT GTACACAAGC ACTTGCAAAC TCTTTATCCC TATTTCTTTA AAACAAT^TA 6240 

AGGTGAAATA CX3AAGTCCTT GGTCTGATAT AAAGCCCCTA TTGGATTCTT CGGATGCGTA 6300 

AAAGAAATTG CCTGTTTCAG CCAGAAGACT GGTGAAAACA CATACS^TCAG ACTATGTTGT 6360 

GAGCCA6GTT GATTTTTTAT TTTATTATAT GCAGGTGAGT GTTGAAACTG TTAAAATTCC 6420 

AATTTGTTTT CATTCAGTAT TAGTTTAGTT CTAAATATAG CAAACCCCAT CCAGGTGCTA 6480 

TCAGATGACC AGTTACTGCT TAGTTAACTA GGTGTAAAGT TTTACATATA CATTAATTTC 6540 

AATAGTTTAT TACAAGTTGT GT7AAATGGA CTCTAGTTTA ATAATGGGGG AAAAAAGATT 6600 

AGGTTGCTCC TGAAACTGAC TGTAGAGCAT GTAAAATGAT TTTACTGGAT TCTGTTCAAC 6660 

TGTAATCAAT GAAAAAGATG TACGTTGTAG ACAAAGTTGC AGAATTATU^ AAAGAAATCT 6720 

GCTTTTAATT TATTCTTTTT GTATTAAGAA TTTGTATAGT ATCTTTACAT TTTGCAAAAC 6780 
AGT6TTGTCA ACACTTATTA AAGCATTTTC AAAATG 

Seq ID NO: 32 Protein secmence: 
Protein Accession #: BAA92532 



MAVQLVPDSA 
FGIAFTDETG 
NAKSCIYKBL 
AYCEDRVIBH 
FQYDYHDKVK 
PALIKSXWAM 
IISGSSGSLL 
GKLPVEYPLD 
RLASDPNVSK 
EDSSLSDAIiV 
YDKSPIKPKM 
GLPHWNSQSS 
SENDTGSPDP 
QRQRQRQRAA 
lEGGATPVW 
SQILRTPSLG 
TSSQSTFVAH 
ENSPILDGSE 



11 
I 

LGLIiMMTBGR 
HIiNwiiQIiDRR 
ZDVDSEWFB 
YKKIiNGQTRG 
PRKZFQWRQL 
AISQHQFYLD 
SSGSQESDSS 
PGEEPPIVRR 
KLKKQRKTSY 
IiEDEDSQVTS 
WSESSIiDEPY 
MPSTPDLRVR 
YTPRTRSSNG 
GALGSASSGS 
RSLESDQECH 
REGAHDKGAG 
SRVTRMPQMC 
SPPHQSTDE 



21 
I 

RCQVHLLDDR 
VIiEHDFPKKS 
LASYILQEAK 
QAIVNYMSIV 
BNLYFREKKF 
RKQSKSKIHA 
QSAKKDNLAA 
RIGTAFKLDE 
liNAIiKKLQEI 
TISPLHSPHK 
EKVKKRSSHS 
SPHYVHSTRS 
SDPMDDCSSC 
MPNIiAARGQA 
YSVKAQFKTS 
RAAVSDEIiRQ 
KATSAALPQS 



31 
I 

KIjEIjIjVQPKIj 
GPWIiYFCVR 
GDFSSNEWR 
ESLPTYGVHY 
SVEVHDPRRA 
ARSIiSEIAID 
LKSRQEAIjEE 
QKILPKGEEA 
ENAINENRIK 
GLPPRPPSHN 
HSSSHKRPPS 
VDISPTRIjHS 
TSHSSSEHYY 
GGAGGAGGGV 
NSYTAGGLPK 
WYQRSTASHK 
QRSSTPSSEI 



41 
I 

IiAKEIiIiDIiVA 
FYIESISYliK 
SDLKKLPALP 
YAVKDKQ6IP 
SVTRRTFGHS 
IjTETGTIjKTS 
TLRQRLEEIiK 
ELERLEREFA 
SGKKPTQRAS 
RPPPPQSLEG 
TGSCAEAGGG 
LALHFRHRSS 
PAQMNANYST 
YIiHSQSQPSS 
ESWRGGGGDE 
EHSRLSHTSS 
GATPPSSPHH 



51 
I 

SHFNLKEKEY 
DNATIBLFFZf 
TQAXiKEHPSIi 
WWLGIiSYKGI 
GIAVHTWYAC 
KLANMGSKGK 
KLCIiRET^LT 
IQSQITEAAR 
LIIDDGNIAS 
LRQMHYHRND 
SNSLQNSPIR 
SIjESQGKIjLG 
LAEDSPSKAR 
QYRIKEYPLY 
GDTGRI*TPSR 
TSSDSGSQYS 
IIiTWQTGEAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq ID NO: 33 Nucleotide sequence;. 
Nucleic Acid Accession #: NM_014331 

Coding sequence: 1..1506 (underlined sec[uences correspond to start and stop codons) 



'l 11 21 31 41 51 

I I I I I I 

ATGG TCAGAA AGCCTGTTGT 6TCCACCATC TCCATVAGGAG GTTACCTGCA GGGAAATGTT 60 

AACGGGAG6C TGCCTTCCCT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 120 

AAGAGGAAAG TCACTTTACT GAGGGGAGTC TCCATTATCA TTGGCACCAT CATTGGAGCA 180 

GGAATCTTCA TCTCTCCTAA GGGCX3TGCTC CAGAACACGG GCAGCGTGGG CATGTCTCTG 24 0 

ACCATCTGGA CGGTGTGTGG GGTCCTGTCA CTATTTGGAG CTTTGTCTTA TGCTGAATTG 300 

GGAACAACTA TAAAGAAATC TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTCCA 360 

TTACCAGCTT TTGTACGAGT CTGGGTGGAA CTCCTCATAA TACGCCCTGC AGCTACTGCT 420 

GTGATATCCC T6GCATTTGG AC6CTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 480 
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CCTGAACTTG CGATCAAGCT CATTACAGCT 
AGCATGAGTG TCAGCTGGAG CGCCCGGATC 
GCAATTCTGA TAATTATAGT CCCTGGAGTT 
TTTAAAGACG CGTTTTCAGG AAGAGATTCA 
5 TATG6AATGT ATGCATATGC TGGCTGGTTT 
AACCCTGT^ AAACCATTCC CCTTGCAATA 
TATGTGCTGA CAAAT6TGGC CTACTTTAC6 
AAT6CAGTGG CAGTGACCTT TTCTGA6C66 
ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC 

10 TTATTCTATG TTGCGTCTCG A6AGGGTCAC 
CGCAAGCACA- CTCCTCTACC AGCTGTTATT 
TTCTCTGGAG ACCTC6ACAG TCTTTTGAAT 
G6GCTGG&A6 TTGCTGG6CT GATTTATCTT 
TTCAAGGTGC CACTGTTCAT CCCAGCTTTG 

15 CTTTCCCTCT ATTCX3GACCC ATTTA6TACA 
GTCCCTGCGT ATTATCTCTT TATTATATGG 
TCAGAGAAAA TAACCAGAAC ATTACAAATA 
TTATGAACTA ATGGACTTGA GATCTTGGCA 
TTTTTACTTC ATTTTCTGAA AGTCTAGAGA 

20 CAGTTATTTT TATTCATATA TTTTAGCATA 
AACTCTATGT AGTTATAGAA AGTGAATATG 
GTCTCTGATA CCTACCTATT GGGGTTAGGA 
TCTCTACAAC ATATGTTAGC ACGGCAAAGA 
TATATATGGG TTTTGTAAAG ATGGTTTTAC 

25 TTTTCAATTC TGAAAAAAAG CATACATCAT 
ATTTTACATT GACATTGCAT TGCTTCCCCT 
GCTTTAATGG ATTATACCCA GAGCACTTTG 
TTAAAGAAGA GTTTCTAGGG GCTACTGTTT 
AAAAATCCTT GAGAATTTAT TATGTCAGAT 

30 TTATCTGTCA 'iTY TT T T riT TCACATCAGT 
AGCAAGAGTT AGTTTGGTAT TAAATCCTCA 
TACCCCTGAT GAGTCTATCT AAACATATGC 
TGAGAGAAAT AACCAACAAA GAAGATGTTC 
CTACATGCAA TGTTAGTAAT TCTGAAGTTT 

35 TGAATTTT6A CAGTTTGT6C ATTTTCTTTA 
CTTCAGATGA AACTGTCCAG ATTAATTAGG 
AAGAAATGTC GCTGTAAATA AGATTTACAA 
ATCTAGGCTT TGTCAGTAAT TTCCACACCT 
CTGATAAGAA GAAAATTGAA ATGAGAATCT 

40 GTTTTGCCAG TATTAGAAAA TACTGTGAGC 
GCACTTTGGG AGGCTGAGGG GGTGGATCAC 
CAACATGGAG AAACCCCATC TCTACTAAAA 
GCTGGTAATC TCAGCTATTG AGGAGGCTGA 
GAGGTTGCAG TGAGCCAAGA TTGCACCACT 

45 CCATCTCCAA AAAAAT^AAAA AAAA 



GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 540 

CAGATTTTCT TAACCTTTTG CAAGCTCACA 600 

ATGCAGCTAA TTAAAGGTCA AACGCAGAAC 660 

AGTATTACGC GGTTGCCACT GGCTTTTTAT 720 

TACCTCAACT TTGTTACTGA AGAAGTAGAA 780 

TGTATATCCA TGGCCATTGT CACCATTGGC 840 

ACCATTAATG CTGAG6AGCT GCTGCTTTCA 900 

CTACTGGGAA ATTTCTCATT AGCAGTTCCG 960 

ATGAACGGTG GTGTGTTTGC TGTCTCCA6G 1020 

CTTCCAGAAA TCCTCTCCAT GATTCATGTC 1080 

GTTTTGCACC CTTTGACAAT GATAATGCTC 1140 

TTCCTCAGTT TTGCCAGGTO GCTTTTTATT 1200 

C6ATACAAAT GCCCA6ATAT GCATC6TCCT 1260 

TTTTCCTTCA CAT6CCTCTT CATGGTTGCC 1320 

GGGATTGGCT TCGTCATCAC TCTGACTGGA 1380 

GACAAGAAAC CCAGGTGGTT TAGAATAATG 144 0 

ATACTGGAAG TTGTACCAGA AGAAGATAAG 1500 

ATCTGCCCAA GGGGAGACAC AAAATAGGGA 1560 

ATTACAACTT TGGTGATAAA CAAAAGGAGT 1620 

TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

CAGTTATTCT ATGAGTCGCA CT^TTCTTGA 1740 

GAAAAGACTA GACAATTACT ATGTGGTCAT 1800 

ACCTTCAAAT TGAAGACTGA GATTTTTCTG 1860 

ACACTACAGA TGTCTATACT GTGAAAAGTG 1920 

GATTATGGCA AAGAGGAGAG AAAGAAATTT 1980 

TAGATACCAA TTTAGATAAC AAACACTCAT 2040 

AACAAAGGTC AGTGGGGATT GTTGAATACA 2100 

ATGAGACACA TCCAGGAGTT ATGTTTAAGT 2160 

GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 

TTGATCAGGA AAGTGTATAA CACATCTTAG 2280 

TTAGAACAAC CACCTGTTTC ACTAATAACT 2340 

ATTTTAAGCC TTCAAATTAC ATTATCAACA 2400 

AAAATAATAG TCCCATATCT GTAATCATAT 2460 

TTTAAATTTA TGGCTATTTT TACACGATGA 2520 

TACATTTTAT ATTCTTCTGT TAAAATATCT 2580 

A7UVAGGCATA TATTAACATA AAAATTGCAA 2640 

CTGATGTTTC TAGAAAATTT CCACTTCTAT 2700 

TAATTATCAT TCAACTTGCA AAAGAGACAA 2760 

GTGGATAAGT GTTTGTGTTC AGAAGATGTT 282 0 

CGGGCAT6GT GGCTTACATC TGTAATCCCA 2880 

CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 294 0 

ATACAA/UITT AGCTGGGCAT GGTGGCACAT 3000 

G6CAGGAGAA TTGCTTGAAC CCGGGAGGC6 3060 

GTACTCCAGC CTGGGTGACA AAGTCAGACT 3120 



Seq ID HO: 34 Ppot^jLn aequenc^; 
Protein Accession #: NP 055146 

50 

1 . 11 21 31 41 51 

I I I I i t 

MVRKPWSTI SKGGYLQGNV NGRLPSLGHK EPPGQEKVQL KRKVTLIiRGV SIIIGTZIGA 60 
55 GIPISPKGVL QNTGSVGMSL TIWTVCXSVLS IiFGALSYAEL GTTIKKSGGH YTYILBVPGP 120 
IiPAPVRVWVE LLIIRPAATA VISIAFGRYI IiEPFPIQCBI PELAIKLITA VGITWMVLN 180 
SMSVSWSARI QIFLTPCKIiT AILIIIVPGV MQIiIKGQTQN FKDAFSGRDS SITRLPLAFY 24 0 
YGMYAYAGWF YLNFVTEEVE NPEKTIPLAI CISMAIVTIG YVIiTNVAYPT TINAEELLIiS 300 
NAVAVTFSER LLGNFSLAVP IFVALSCFGS MNGGVFAVSR LFYVASREGH LPEILSMIHV 360 
60 RKHTPLPAVI VIiHPLTMIMli FSGDLDSLLN FLSFARWLPI GIiAVAGIiIYIi RYKCPDMHRP 420 
FKVPLFIPAIi FSFTCLFMVA LSLYSDPFST GIGFVITLTQ VPAYYIjFIIW DKKPRWPRIM 480 
SEKITRTIiQI ILEWPEEDK Ii 



65 Seq ID NO: 35 Nucleotide secmence; 

Nucleic Acid Accession # : NM_002422 

Coding sequence: 64.. 14 97 (underlined sequences correspond to start and stop codons) 



70 1 11 21 31 41 51 

I i I I I I 

ACAAGGAG6C AGGCT^ACA GCAAGGCATA GAGACAACAT AGAGCTAAGT AAAGCCAGTG 60 

GA AATGA AGA GTCTTCCAAT CCTACTGTTG CTGTGCGTGG CAGTTTGCTC AGCCTATCCA 120 

TTGGATG6AG CTGCAAGGGG TGAGGACACC AGCATGAACC TTGTTCAGAA ATATCTAGAA 180 

75 AACTACTACG ACCTCAAAAA AGATGTGAAA CAGTTT6TTA 6GAGAAAGGA CAGTGGTCCT 240 

GTT6TTAAAA AAATCCGAGA AATGCAGAAG TTCCTT66AT TGGAGGTGAC GGGGAAGCT6 300 
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GACTCCGACA CTCTGGAGGT GATGCGCAAG CCCAGGTGTG GAGTTCCT6A TGTTGGTCAC 360 

TTCAGAACCT TTCCTGGCAT CCCGAAGTGG AGGAAAACCC ACCTTACATA CAGGATTGTG 420 

AATTATACAC CAGATTTGCC AAAA6ATGCT GTTGATTCTG CTGTTGAGAA AGCTCTGAAA 480 

GTCTGGGAAG AGGTGACTCC ACTCACATTC TCCAGGCTGT ATGAAGGAGA GGCTGATATA 540 

5 ATGATCTCTT TTGCAGTTAG AGAACATGOA GACTTTTACC CTTTTGATGG ACCTGGATU^T 600 

GTTTTGGCCC ATGCCTATGC CCCTGGGCCA GGGATTAATG GAQATGCCCA CTTTGATGAT 660 

GATGAACAAT G6ACAAAGGA TACAACAGGG ACCAATTTAT TTCTCGTTGC TGCTCATGAA 720 

ATTGGCCACT CCCTGGGTCT CTTTCACTCA GCCAACACTG AAGCTTTGAT GTACCCACTC 780 

TATCACTCAC TCACAGACCT GACTCGGTTC CGCCTGTCTC AAGATGATAT AAATGGCATT 840 

10 CAGTCCCTCT ATGGACCTCC CCCTGACTCC CCTGAGACCC CCCTG6TACC CACGGAACCT 900 

GTCCCTCCAG AACCTGGGAC GCCAGCCAAC TGTGATCCTG CTTTGTCCTT TGATGCTGTC 960 

AGCACTCTGA GGGGAGAAAT CCTGATCTTT AAAGACAGGC ACTTTTGGCG CAAATCCCTC 1020 

AGGAAGCTTG AACCTGAATT GCATTTGATC TCTTCATTTT GGCCATCTCT TCCTTCAGGC 1080 

GTGGATGCCG CATATGAAGT TACTAGCAAG GACCTCGTTT TCATTTTTAA AGGAAATCAA 1140 

15 TTCTGQGCCA TCAGAGGAAA TGAGGTACGA GCTGGATACC CAAGAGGCAT CCACACCCTA 1200 

GGTTTCCCTC CAACCGTGAG GAAAATCGAT GCAGCCATTT CTGATAAGGA AAAGAACAAA 1260 

ACATATTTCT TTGTAGAGGA CAAATACTGG AGATTTGATG AGAAGAGTUUV TTCCATGGAG 1320 

CCAGGCTTTC CCAAGCAAAT AGCTGAAGAC TTTCCAGGGA TTGACTCAAA GATTGATGCT 1380 

GTTTTTGAAG AATTTGGGTT CTTTTATTTC TTTACTGGAT CTTCACAGTT GGAGTTTGAC 1440 

20 CCAAATGCAA AGAAAGTGAC ACACACTTTG AAGAGTAACA GCTGGCTTAA TTGTTGAAAG 1500 

AGATATGTAG AAGGCACAAT ATGGGCACTT TAAATGAAGC TAATAATTCT TCACCTAAGT 1560 

CTCTGTGAAT TGAAATGTTC GTTTTCTCCT GCCTGTGCTG TQACTCGAGT CACACTCAAG 1620 

GGAACTTGAG CGTGAATCTG TATCTTGCCG GTCATTTTTA TGTTATTACA GGGCATTCAA 1680 

ATGGGCTGCT GCTTAGCTTQ CACCTTGTCA CATA6AGTGA TCTTTCCCAA GAGAAGGGGA 1740 

25 AGCACTCGTG TGCAACAGAC AAGTGACTGT ATCTGTGTAG ACTATTTGCT TATTTAATAA IBOO 

AGACGATTTG TCAGTTGTTT T 
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Seq ID NO: 36 Protein aeauence! 
Protein Accession #: NP 002413 



35 



40 



1 
I 

MKSIiPIIiLIiL 
VKKIREMQKF 
YTPDIjPKDAV 
LAHAYAPGPG 
HSLTDLTRPR 
TLRGEILIPK 
WAIRGNEVRA 
GFPKQIAEDP 



11 
I 

CVAVCSAYPL 
LGIiEVTGKLD 
DSAVEKALKV 
INGDAHFDDD 
LSQDDINGIQ 
DRHFWRKSLR 
GYPRGIHTIiG 
PGIDSKIDAV 



21 
I 

DGAAR6EDTS 
SDTLEVMRKP 
WEEVTPLTFS 
EQWTKDTTGT 
SliYGPPPDSP 
KLEPEIjHLIS 
PPPTVRKIDA 
PBEPGFFYFP 



31 
I 

MNLVQKYLEN 
RCGVPDVGHF 
RLYEGEADIM 
NLFIiVAAHEI 
ETPLVPTEPV 
SFWPSLPSGV 
AISDKEKNKT 
TGSSQLEFDP 



41 
I 

YYDLKKDVKQ 
RTFPGIPKWR 
ISFAVREHGD 
GHSIiGLFHSA 
PPEPGTPANC 
DAAYBVTSKD 
YFFVEDKYWR 
NAKKVTHTIiK 



51 
I 

FVRRKDSGPV 60 

KTHLTYRIVN 120 

PYPFDGPGNV 180 

NTEALMYPLY 240 

DPALSFDAVS 300 

LVFIFKGNQF 360 

FDBKRNSMEP 420 
SNSWLNC 



45 Seq ID NO: 37 Nucleotide sequence: 
Nucleic Acid Accession #: NM_003246 

Coding sequence: 112.. 3 624 (miderlined sequences correspond to start and stop codons) 



50 



55 



60 



65 



70 



75 



1 
I 

GGACGCACAG 
CGCCGCGCTC 
GCCTGGGGAC 
TCTGGCGGAG 
TCTGGGCGCC 
GCCAACCTGA 
GCAGATikAAGG 
CTGCTGGCCC 
AAGGCGGGCA 
GAAGAAGCTC 
AGGGCCCAGC 
CAAAGCGTCT 
GTCAATGACA 
GAAGACATCC 
AACAACGTGG 
AAGGACTTGC 
CTCAGG66CC 
GAGAACAAAG 
CAGTACAGAA 
TCAGTTACCA 
CCTGATGGAG 
CCATGGTCCG 
CGCTCCTGC3G 
CACATTCA6G 
T6GTCATCTT 



11 
I 

GCATTCCCCG 
CGGTACACAC 
TAGGCGTCCT 
ACAACAGCGT 
GACTGGTGAA 
TCCCCCCTGT 
GTTTCCTCCT 
TGGAGCGGAA 
CCCTGGACCT 
TCCTGGCAAC 
TGTACATCGA 
TCACCAGAGA 
ATTTCCAGGG 
TCAGGAACAA 
TGAATGGTTC 
AAGCCATCTO 
TGCGCACCAT 
AGTTGGCCAA 
ATAAC6AGGA 
TCTGCAAAAA 
AATGCTGTCC 
AGTG6ACCTC 
ATAGCCTCAA 
AGTGTQACAA 
GTTCTGT6AC 



21 
I 

CGCCCCTCCA 
AGGATCCCTG 
GTTCCTGATG 
GTTTGACATC 
GGGCCCCGAC 
GCCTGATGAC 
TCTGGCATCC 
AGACCACTCT 
CAGCCTGACC 
CGGCCAGTGG 
CTGTGAAAAG 
CCTGGCCAGC 
GGTGCTGCAG 
AGGCTGCTCC 
CAGCCCTGCC 
CGGCATCTCC 
T6TGACCAC6 
TGAGCT6AGG 
ATG6ACTGTT 
GGTGTCCTGC 
TCGCTGTTGG 
CTGTTCTACG 
CAACCGATGT 
AAGATTTAAA 
ATGTGGTGAT 



31 
I 

GCCCTCGCCG 
CTGGGCACCA 
CATGTGTGTG 
TTTGAACTCA 
CCTTCCAGCC 
AAGTTCCAAG 
CTGAGGCAGA 
GGCCAGGTCT 
GTCCAAGQAA 
AAGAGCATCA 
ATGGAGAATG 
ATCGCCAGAC 
AATGTGAGGT 
AGCTCTACCA 
ATCCGCACTA 
TGTGAT6AGC 
CTGCAGGACA 
CX3GCCTCCCC 
GATAGCTGCA 
CCCATCATGC 
CCCAGCGACT 
AGCTGTGGCA 
GAGGGCTCCT 
CAGGATGGTG 
GGTGTGATCA 



41 

I 

CCCTCGCCAC 
ACAGCTCCAC 
GCACCAACCG 
CCGGGGCCGC 
CAGCTTTCCG 
ACCTGGTGGA 
TGAAGAAGAC 
TCAGCGTGGT 
AGCAGCACGT 
CCCTGTTTGT 
CTGAGTTGGA 
TCCGCATCGC 
TTGTCTTTGG 
GTGTCCTCCT 
ACTACATTGG 
TGTCCAGCAT 
GCATCC6CAA 
TATGCTATCA 
CTGAGTGTCA 
CCTGCTCCAA 
CTGCGGACGA 
ATGGAATTCA 
CGGTCCAGAC 
GCTGGAGCCA 
CAAGGATCC6 



51 
I 

CGCTCCCGGC 
C^TGGGGCTG 
CATTCCAGAG 
CCGCAAGGGG 
CATCGAGGAT 
TGCTGTGCGG 
CCGGGGCACG 
GTCCAATGGC 
GGTGTCTGTG 
GCAGGAAGAC 
CGTCCCCATC 
AAAGGGGGGC 
AACCACACCA 
CACCCTTGAC 
CCACAAGACA 
GGTCCT6GAA 
AGTGACTGAA 
CAACXSGAGTT 
CTGTCA6AAC 
TGCCACAGTT 
TGGCTGGTCT 
GCAGCGCGGC 
ACGGACCTGC 
CTGGTCCCCG 
6CTCTGCAAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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TCTCCCAGCC CCCAGATGAA TGGGAAACCC TGTGAAGGCG AAGCGCGGGA GACCAAAGCC 1560 

TGCAAGAAAG ACGCCTGCCC CATCAATGGA GGCTGGGGTC CTTGGTCACC ATGGGACATC 1620 

TGTTCTGTCA CCTGTGGAGG AGGGGTACAG AAACGTAGTC GTCTCTGCAA CAACCCCGCA 1680 

CCCCAGTTTG GAGGCAAGGA CTGCGTTGGT GATGTAACAG AAAACCAGAT CTGCAACAAG 174 0 

5 CAGGACTGTC CAATTGATGG ATGCCTGTCC AATCCCTGCT TTGCCGGCGT GAAGTGTACT 1800 

AGCTACCCTG ATGGCAGCTG GAAATGTGGT GCTTGTCCCC CTGGTTACAG TGGAAATGGC 1860 

ATCCAGTGCA CAGATGTTGA TGAGTGCAAA GAAGTGCCTG ATGCCTGCTT CAACCACAAT 1920 

GGAGAGCACC GGTGTGAGAA CAC6GACCCC GGCTACAACT 6CCTGCCCTG CCCCCCACGC 1980 

TTCACC66CT CACAGCCCTT CG6CCAGGGT 6TCGAACATG CCACG6CCAA CAAACAGGT6 2040 

10 TGCAAGCCCC GTAACCCCTG CACGGAT6G6 ACCCAC6ACT GCAACAA6AA C6CCAA6T6C 2100 

AACTACCTGG GCCACTATAG CGACCCCATG TACCGCTGCX5 AGTGCAAGCC TGGCTACGCT 2160 

66CAATGGCA TCATCTGCGG GGA6GACACA GACXTTGGATG GCT6GCCCAA TGAGAACCTG 2220 

GTGTGCGTGG CCAATGCX3AC TTACCACTGC AAAAAG6ATA ATTGCCCCAA CCTTCCCAAC 2280 

TCAGGGCAGG AAGACTATGA CAAGGATG6A ATTGGTGATG CCT6TGATGA TGACGATGAC 2340 

15 AATGATAAAA TTCCAGATGA CAGGGACAAC TGTCCATTCC ATTACAACCC AGCTCAGTAT 2400 

GACTATGACA GAGATGATGT GGGAGACCGC TGTGACAACT GTCCCTACAA CCACAACCCA 24 60 

GATCAGGCAG ACACAGACAA CAATGGGGAA GGAGACGCCT GTGCTGCAGA CATTGATGGA 2520 

GACGGTATCC TCAATGAACG GGACAACTGC CAGTACGTCT ACAATGTGGA CCAGAGAGAC 2580 

ACTGATATGG ATGGGGTTGG AGATCAGTGT GACAATTGCC CCTTGGAACA CAATCCGGAT 264 0 

20 CAGCTGGACT CTGACTCAGA CCGCATTGGA GATACCTGTG ACAACAATCA GGATATTGAT 2700 

GAAGATGGCC ACCAGAACAA TCTGGACAAC TGTCCCTATG TGCCCAATGC CAACCAGGCT 2760 

GACCATGACA TAGATGGCAA ' GGGAGATGCC TGTGACCACG ATGATGACAA CGATGGCATT 2 820 

CCTGATGACA AGGACAACTG CAGACTCGTG CCCAATCCCG ACCAGAAGGA CTCTGACGGC 2880 

GATGGTCGAG GTGATGCCTG CAAAGATGAT TTTGACCATG ACAGTGTGCC AGACATCGAT 294 0 

25 GACATCTGTC CTGAGAATGT TGACATCAGT GAGACCGATT TCCGCCGATT CCAGATGATT 3000 

CCTCTGGACC CCAAAGGGAC ATCCCAAAAT GACCCTAACT GGGTTGTACG CCATCAGGGT 3060 

AAAGAACTCG TCCAGACTGT CAACTGTGAT CCTGGACTCG CTGTAGGTTA TGATGAGTTT 312 0 

AATGCTGTGG ACTTCAGTGG CACCTTCTTC ATCAACACCG AAAGGGACGA TGACTATGCT 3180 

GGATTTGTCT TTGGCTACCA GTCCAGCAGC CGCTTTTATG TTGTGATGTG GAAGCAAGTC 324 0 

30 ACCCAGTCCT ACTGGGACAC CAACCCCACG AGGGCTCAGG GATACTCGGG CCTTTCTGTG 3300 

AAAGTTGTAA ACTCCACCAC AGGGCCTGGC 6AGCACCTGC 6GAAC6CCCT GT6GCACACA 3360 

GGAAACACCC CTGGCCAGGT 6CGCACCCTG TG6CATGACC CTCGTCACAT AGGCT66AAA 3420 

6ATTTCACCG CCTACAGAT6 GCX^TCTCAGC CACA66CCAA AGACGGGTTT CATTAGAGTG 3480 

GT6ATOTATG AAGG6AAGAA AATCATGGCT 6ACTCAGGAC CCATCTATGA TAAAACCTAT 3540 

35 6CTGGT66TA GACTAGGGTT GTTT6TCTTC TCTCAAGAAA T66T6TTCTT CTCTGACCTG 3600 

AAATACGAAT GTAGA6ATCC CTAATCATCA AATTGTTGAT TGAAAGACTG ATCATAAACC 3660 

AAT6CTGGTA TTGCACCTTC TGGAACTATG GGCTTGAGT^ AACCCCCAGG ATCACTTCTC 3720 

CTTGGCTTCC TTCTTTTCTG TGCTTGCATC A6TGTGGACT CCTAGAACGT GCGACCTGCC 3780 

TCAAGAAAAT GCAGTTTTCA AAAACAGACT CATCAGCATT CAGCCTCCAA TGAATAAGAC 3840 

40 ATCTTCCAAG CATATAAACA ATTGCTTTGG TTTCCTTTTG AAAAAGCATC TACTTGCTTC 3 900 

AGTTGGGAAG GTGCCCATTC CACTCTGCCT TTGTCACAGA GCAGGGTGCT ATTGTGAGGC 3960 

CATCTCTGAG CAGTGGACTC AA7VAGCATTT TCAGGCATGT CAGAGAAGGG AGGACTCACT 4020 

AGAATTAGCA AACAAAACCA CCCTGACATC CTCCTTCAGG AACACGGGGA GCAGAGGCCA 4080 

AAGCACTAAG GGGAGGGCGC ATACCCGAGA CGATTGTATG AAGAAAATAT GGAGGAACTG 4140 

45 TTACATGTTC GGTACTAAGT CATTTTCAGG GGATTGAAAG ACTATTGCTG GATTTCATGA 4200 

TGCTGACTGG CGTTAGCTGA TT/VACCCATG TAAATAGGCA CTTAT^TAGA AGCAGGAAAG 4260 

GGAGACAAAG ACTGGCTTCT GGACTTCCTC CCTGATCCCC ACCCTTACTC ATCACCTTGC 4320 

AGTGGCCAGA ATTAGGGAAT CAGAATCAAA CCAGTGTAAG 6CAGT6CT6G CTGCXIATTGC 4380 

CTGGTCACAT TGAAATTGGT GGCTTCATTC TAGATGTAGC TT6T6CAGAT GTAGCA66AA 4440 

50 AATA66AAAA CCTACCATCT CAGTGAGCAC CAGCTGCCTC CCAAAGGAGG GGCAGCC6TG 4500 

CTTATATTTT TATGGTTACA ATGGCACA7UV ATTATTATCA ACCTAACTAA AACATTCCTT 4560 

TTCTCTTTTT TCCGTAATTA CTAG6TAGTT TTCTAATTCT CTCTTTTG6A AGTATGATTT 4620 

TTTTAAAGTC TTTACGAT6T AAAATATTTA TTTTTTACTT ATTCTGGAAG ATCTGGCTGA 4660 

A66ATTATTC AT6GAACAG6 AAGAAGCGTA AAGACTATCC AT6TCATCTT TGTTGA6AGT 4740 

55 CTTCGTGACT GTAAGATT6T AAATACAGAT TATTTATTAA CTCTGTTCTG CCTOGAAATT 4800 

TAGGCTTCAT ACGGAAAGTG TTTGAGAGCA AGTAGTTGAC ATTTATCAGC AAATCTCTTG 4860 

CAAGAACAGC ACAAGGAAAA TCAGTCTAAT AAGCTGCTCT GCCCCTTGTG CTCAGAGTGG 4920 

ATGTTATGGG ATTCCTTTTT TCTCTGTTTT ATCTTTTCAA GTGGAATTAG TTGGTTATCC 4 980 

ATTTGCAAAT GTTTTAAATT GCAAAGAT^G CCATGAGGTC TTCAATACTG TTTTACCCCA 504 0 

60 TCCCTTGTGC ATATTTCCAG GGAGAAGGAA AGCATATACA CTTTTTTCTT TCATTTTTCC 5100 

AAAAGAGAAA AAAATGACAA AAGGTGAAAC TTACATACAA ATATTACCTC ATTTGTTGTG 5160 

TGACTGAGTA AAGAATTTTT GGATCAAGCG GAAAGAGTTT AAGTGTCTAA CAAACTTAAA 5220 

GCTACTGTAG TACCTAAAAA GTCAGTGTTG TACATAGCAT AAAAACTCTG CAGAGAAGTA 5280 

TTCCCAATAA GGAAATAGCA TTGAAATGTT AAATACAATT TCTGAAAGTT ATGTTTTTTT 534 0 

65 TCTATCATCT GGTATACCAT TGCTTTATTT TTATAAATTA TTTTCTCATT GCCATTGGAA 54 00 

TAGAATATTC AGATTGTGTA GATATGCTAT TTAAATAATT TATCAGGAAA TACTGCCTGT 5460 

AGAGTTAGTA TTTCTATTTT TATATAATGT TTGCACACTG AATTGAAGAA TTGTTGGTTT 5520 

TTTCTTTTTT TTGTTTTTTT tTTTTTTTTT TTTTTTTTTG CTTTTGACCT CCCATTTTTA 5580 

CTATTTGCCA ATACCTTTTT CTAGGAATGT GCTTTTTTTT GTACACATTT TTATCCATTT 5640 

70 TACATTCTAA AGCAGTGTAA GTTGTATATT ACTGTTTCTT ATGTACAAGG AACAACAATA 5700 
AATCATATGG AAATTTATAT TT 



75 



Seq ID NO: 38 Protein sequence; 
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1 11 21 31 41 51 

I I I I I I 

MGIiAWGLGVL FLMHVCGTNR IPESGGDNSV FDIPELTGAA RKGSGRRIiVK GPDPSSPAFR 60 

lEDANIiIPPV PDDKFQDLVD AVRAEKGFLL lASLRQMKKT RGTIiIAIiERK DHSGQVFSW 120 

5 SNGKAGTIiDL SLTVQGKQHV VSVEEALIAT 6QWKSITLFV QEDRAQLYID CEKMENAELD 180 

VPIQSVFTRD LASIARLRIA KGGVNDNPQG VLQNVRFVPG TTPEDILRNK GCSSSTSVLL 240 

TLDNNWNGS SPAIRTNYIG HKTKDLQAIC GISCDBLSSM VLELRGLRTI VTTIiQDSIRK 300 

VTEENKELAN ELRRPPLCYH NGVQYRNNEE WTVDSCTECH CQNSVTICKK VSCPIMPCSN 360 

ATVPDGECCP RCWPSDSADD GWSPWSEWTS CSTSCGNGIQ QRGRSCDSLN NRCEGSSVQT 420 

10 RTCHIQBCDK RFKQDGGWSH WSPWSSCSVT CGDGVITRIR LCNSPSPQMN GKPCEGBARE 4 80 

TKACKKDACP INGGWGPWSP WDICSVTCGG GVQKRSRIiCN NPAPQFGGKD CVGDVTENQI 540 

CNKQDCPIDG CLSNPCFAGV KCTSYPDGSW KCGACPPGYS GNGIQCTDVD ECKEVPDACF 600 

NHNGEHRCEN TDPGYNCLPC PPRFTGSQPF GQGVEHATAN KQVCKPRNPC TDGTHDCNKN 660 

AKCNYLGHYS DPMYRCECKP GYAGNGIICG EDTDLDGWPN ENLVCVANAT YHCKKDNCPN 720 

15 LPNSGQEDYD KDGIGDACDD DDDNDKIPDD RDNCPPHYNP AQYDYDRDDV GDRCDNCPYN 780 

HNPDQADTDN NGEGDACAAD IDGDGILNER DNCQYVYNVD QRDTDMDGVG DQCDNCPLEH 840 

NPDQLDSDSD RIGDTCDNNQ DIDBDGHQNN LDNCPYVPNA NQADHDKDGK GDACDHDDDN 900 

DGIPDDKDNC RLVPNPDQKD SDGDGRGDAC KDDFDHDSVP DIDDICPENV DISETDFRRF 960 

QMIPLDPKGT SQNDPNWWR HQGKEIiVQTV NCDPGIiAVGY DEPNAVDFSG TPPINTERDD 1020 

20 DYAGFVFGYQ SSSRFYWMW KQVTQSYWDT NPTRAQGYSG LSVKWNSTT GPGEHIjRNAIi 1080 

WHTGNTPGQV RTIiWHDPRHI GWKDPTAYRW RLSHRPKTGP IRWMYEGKK IMADSGPIYD 1140 
KTYAGGRLGL FVFSQEMVFF SDIiKYECRDP 



25 

Seq ZD NO: 39 Nucleotide sequence; 
Nucleic Acid Accession #: BC004299 

Coding sequence: 6 9.. 123 5 (underlined sequences correspond to start and stop codons) 

30 

1 11 21 31 41 51 

I I I I I I 

CCCGACCCGT GCGAGGGCCA GGTCCGCGCC TGCCCCGCCA GGCGAAGCGA GGCGACCCGC 60 

GTGCGGCCAT GGCTTCGCTG CTGGGAGCCT ACCCTTGGCC CGAGGGTCTC GAGTGCCCGG 120 

35 CCCTGQACGC CGAGCTGTCG GATGGACAAT CGCCGCCGGC CGTCCCCCGG CCCCCGGGGG 180 

ACAAGGGCTC CGAGAGCCGT ATCCGGCGGC CCATGAACGC CTTCATGGTT TGGGCCAAGG 240 

ACGAGAGGAA ACGGCTGGCA GTGCAGAACC CGGACCTGCA CT^CGCCGAG CTCAGCAAGA 300 

TGCTGGGAAA GTCGTGGAAG GCGCTGACGC TGTCCCAGAA GAGGCCGTAC GTGGACGAGG 360 

CGGAGCGGCT GCGCCTGCAG CACATGCAGG ACTACCCCAA CTACAAGTAC CGGCCGCGCA 420 

40 GGAAGAAGCA GGCCAAGCGG CTGTGCAAGC GCGTGGACCC GGGCTTCCTT CTGA6CTCCC 480 

TCTCCCGGGA CCAGAACGCC CTGCCGGAGA AGAGAAGCGG CAGCCGGGGG GCGCT6GGGG 540 

AGAAGGAGGA CAG6GGTGAG TACTCCCCCG GCACTGCCCT GCCCA6CCTC G6GGGCT6CT 600 

ACCACGA6GG 6CCGGCTGGT GGTGGC6GCG GCGGCACCCC GAGCA6T6T6 GACAC6TACC 660 

C6TACGG6CT GCCCACACCT CCTGAAATGT CTCCCCTG6A CGTGCTGGAG CCGGAGCA6A 720 

45 CCTTCTTCTC CTCCCCCTGC CAGGAGGAGC ATGGCCATCC CCGCCGCATC CCCCACCTGC 780 

CAGGGCACCC GTACTCACCG GAGTACX3CCC CAAGCCCTCT CCACTGTAGC CACCCCCT6G 840 

GCTCrCTGGC CCTTGGCCAG TCCCCCGGCG TCTCCATGAT GTCCCCTGTA CCCGGCTGTC 900 

CCCCATCTCC TGCCTATTAC TCCCCQGCCA CCTACCACCC ACTCCACTCC AACCTCCAAG 960 

CCCACCT6G6 CCAGCTTTCC CCGCCTCCTG AGCACCCTGG CTTCGACGCC CTGGATCAAC 1020 

50 TGAGCCAGGT GGAACTCCTG GGGGACATGG ATCGCAATGA ATTCGACCAG TATTTGAACA 1080 

CTCCTG6CCA CCCAGACTCC GCCACAGGGG CCATGGCCCT CAGTGGGCAT GTTCCGGTCT 1140 

CCCAGGTGAC ACCAACGGGT CCCACAGAGA CCAGCCTCAT CTCCGTCCTG GCTGATGCCA 1200 

CGGCCACGTA CTACAACAGC TACAGTGTGT C ATAGA GCTG GAGGCGCCCC GTCCGGTCAG 1260 

CCCTCGCGCC CTCTCCTTCT TGTGCCTTGA GTGGCAGAGG AGCCGTCCAG CCACACCAGC 132 0 

55 TTTCCTCCCA CCGCTCAGGG CAGGGAGGTC TGAACTGCGG CCCCAGAGCC TTTGGCCTAA 1380 

GCTGGACTCT CCTTATCCGA GTGCCGCCTC TATCCCCTTC CCCACGTTCC AGCCCCTGCA 1440 

GCCCACATTT TAAGTATATT CCTTCAAGTG AGTTTTCCTC CAGCCCCTGA GAGTTGCTGT 1500 

CTCCCAGTGG AATGTTCACT GACGTCTTTT CTTGGTAGCC ATCATCGATU^ CTAATGGGGG 1560 

GACAGACTTG ATAGCCAAGG TCCCTTCTGG TCCAGTTTTC TGATTTAGGG TTCTCTCAAG 1620 

60 ATTAATAAAG GAAGATGGGG AAATTTGACT CATTAATGAG CTCGCTAACC TACGATCTGQ 1680 

TGATAATTTT GTGTGCACAG CCCAAGGACC ACGAGGCTTT CTGCACTTTC TGCACCCCCT 1740 

TCCAAAGTGA CCACAAAATT TCAAAGGGAC TCATACAATT TGAGAAAAAA CAGTCT^CCT 1800 

GATTTGAGAA ATTAACCAGT ATGGCTAACT ATATCACAGA AAATGGGATT QAGTTAAAAC 1860 

TATTTTATTT TAAATATACA TTTTAAAGCA GTTCTTTTTT TTTGTTAATT TGTTTATTAT 1920 

65 ACACACACTT CAAGAGCCAC CGCGCCCAGC CTACATTTAT AATTTTCATT CTCTTTTACC 1980 

TATAAAATTC AGTGTATTAG TTTCATTACA TAGGAGAAAT TATATTTCTA AACATTTTAT 2040 
GATGTTTAAA AACAAAACAG GCT6TTGTAA AAAAAAAAAA AAAAAAAAA 

Seq ID NO: 40 Protein sequencer 
70 Protein Accession #: AAH04299 

' 1 11 21 31 41 51 

I I I I I I 

MASIiliGAYPW PEGIiECPAIiD AELSDGQSPP AVPRPPGDKG SESRIRRPMN AFMVWAKDER 60 

75 KRIAVQNPDL HNAELSKMLG KSWKAI-TLSQ KRPYVDEAER UUjQHMQDYP NYKYRPRRKK 120 

QAiCRIiCKRVD PGFLLSSIjSR DQNALPBKRS GSRGAL6BKB DRGEYSPGTA IiPSItRGCYHE 180 
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GPAGGGGGGT PSSVDTYPYG LPTPPEMSPL DVLEPEQTFF SSPCQEEHGH PRRIPHLPGH 240 

PYSPEYAPSP LHCSHPLGSIi ALGQSPGVSM MSPVPGCPPS PAYYSPATYH PLHSNLQAHL 300 

GQLSPPPEHP GFDALDQLSQ VELLGDMDRN EFDQYIiNTPG HPDSATGAMA LSGHVPVSQV 360 
TPTGPTETSIi ISVLADATAT YYNSYSVS 



10 



Seq ID NO: 41 Nucleotide sequence! * 
Nucleic Acid Accession #: NM_00444d 

Coding sequence: 1..1389 (underlined sequences correspond to start and stop codons) 



15 



20 



25 



30 



35 



40 



1 

I 

ATGATTCAGA 
GAGGACCAGT 
ACCGCGTCCT 
CAGGATTGGC 
CAGGTGAATG 
GTGGGCAGCC 
CCACCCCCAA 
TGGAGTACAG 
GACGTCAACA 
GACGACTTCC 
TACCTCAGAG 
AACTCTCCAC 
TCAGCCTG6A 
TCCACAGTGC 
CCAACAAGTA 
CTGGA6CTCC 
GAGTTCAAGA 
CCCAACATGA 
ATGACCAAGG 
GCCCTCCAGC 
ATGGGCTCCT 
CTCCCCGTGA 
G6GG6TATAT 
TACTACTAA 



11 

I 

CTGTCCCGGA 
CGTTGTTTGA 
CCTCCAGCGA 
TGTCTCAACC 
GCTCAAGGAA 
CAGACACCGT 
ACATGACCAC 
ACCATGTGCG 
TCTTGTTATT 
AGAGGCTCAC 
AGACTCCTCT 
GGTTAATGCA 
CCGGTCACGG 
CCAAAACTGA 
GCCGCCTTGC 
TGTCGGACAG 
TGACX3GATCC 
ACTACGAT7UV 
TCCATGGGAA 
CCCACCCCCC 
ATCACGCCCA 
CATCTTCCAG 
ACCCCAACAC 



21 

I 

CCCAGCAGCT 
GTGTGCCTAC 
GTATGGACAG 
CCCAGCCAGG 
CTCTCCTGAT 
TGGGATGAAC 
GAACGAGCGC 
GCAGTGGCTG 
CCAGAACATC 
CCCCAGCTAC 
TCCACATTTG 
TGCTAGAAAC 
CCACCCCACG 
AGACCAGCGT 
AAATCCA6GC 
CTCCAACTCC 
CGAC6A6GTG 
GCTCAGCCGC 
GCGCTACGCC 
GGAGTCATCT 
CCCACAGAAG 
TTTTTTTGCT 
TAGGCTCCCC 



31 
I 

CATATCAA66 
GGAACGCCAC 
ACTTCCAAGA 
GTCACCATCA 
GAATGCAGTG 
TACGGCAGCT 
AGAGTTATCG 
GAGTGGGCGG 
6ATGGGAAGG 
AACGCCGACA 
ACTTCAGATG 
ACAGATTTAC 
CCCCAGTCGA 
CCTCAGTTAG 
AGTG6CCA6A 
AGCTGCATCA 
GCCCGGCGCT 
GCCCTCCXjTT 
TACAAGTTCG 
CTGTACAA6T 
ATGAACTTTG 
GCCXrCAAACC 
ACCA6CCATA 



41 
I 

AAGCCTTATC 
ACCTGGCTAA 
TGAGCCCACG 
AAATGGAATG 
TGGCCAAAGG 
ACATGGAGGA 
TGCCAGCAGA 
TGAAAGAATA 
AACTGTGCAA 
TCCTTCTCTC 
ATGTTGATAA 
CATATGAGCC 
AAGCTGCTCA 
ATCCTTATCA 
TCCAGCTTTG 
CCTGGGAA6G 
GG6GAGAGGG 
ACTACTATGA 
ACTTCCACGG 
ACCCCTCAGA 
TG6C6CCCCA 
CATACTGGAA 
TGCCTTCTCA 



51 
I 

A6TTGTGAGT 
GACAGAGATG 
CGTCCCTCAG 
TAACCCTAGC 
CGGGAAGATG 
GAAGCACATG 
TCCTACGCTA 
TGGCCTTCCA 
GATGACCAAG 
ACATCTCCAC 
AGCCTTACAA 
CCCCAGGAGA 
ACCATCTCCT 
GATTCTTGGA 
GCAGTTCCTC 
CACCAAC66G 
6AAGA6CAAA 
CAAGAACATC 
GATC6CCCAG 
CCTCCCGTAC 
CCCTCCAGCC 
TTCACCAACT 
TCTGGGCACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 42 Protein sequence; 
Protein Accession #: NP 004440 



45 



50 



55 



MIQTVPDPAA 
QDWLSQPPT^ 
PPPNMTTNER 
DDFQRLTPSY 
SAWTGHGHPT 
LELLSDSSNS 
MTKVHGKRYA 
LPVTSSSFFA 



11 
I 

HIKEALSWS 
VTIKMBCNPS 
RVIVPADPTL 
NADIUiSHLH 
PQSKAAQPSP 
SCITWEGTN6 
YKFDFHGIAQ 
APNPYMNSPT 



21 
I 

EDQSLFECAY 
QVNGSRNSPD 
WSTDHVRQWI, 
YliRETPLPHIi 
STVPKTEDQR 
EFKMTDPDEV 
ALQPHPPESS 
GGXYPNTRIjP 



31 
I 

GTPHXiAKTEM 
ECSVAK6GKM 
EWAVKEYGLP 
TSDDVDKALQ 
PQIiDPYQILG 
ARRW6ERKSK 
XfYKYPSDLPY 
TSHMPSHLGT 



41 
1 

TASSSSDYGQ 
VGSPDTVGMN 
DVNIIiLFQNI 
NSPRIjNHARN 
PTSSRIiANPG 
PNMNYDKLSR 
MGSYHAHPQK 
YY 



51 
I 

TSKMSPRVPQ 
YGSYMEEKHM 
DGKELCKMTK 
TDLPYEPPRR 
SGQIQI>WQFI< 
ALRYYYDKNX 
MNFVAPHPPA 



60 
120 
180 
240 
300 
360 
420 



60 



Seq ID NO: 43 Nucleotide sequence : 
Nucleic Acid Accession #: NM_005100 

Coding sequence: 192.. 5537 (underlined sequences correspond to steort and stop codons) 



65 



70 



75 



CCTTCTTTTA 
CAGTCGGCTG 
CTTGCCCCTG 
GCGGAGGAGC 
AGGGGAGCTC 
CGCCAGACAC 
AGAAGAATGG 
AGGAGGGTGA 
AGGA6GAAGA 
AAAGAGACTC 
AT6GGCAG6A 
AGCTAACACA 
TTGTTGGCTT 



11 

I 

AGGAGTTTGC 
GCGGCGAAGG 
TCCCTGCGGC 
CATGGGCGCC 
CACGCCGGCT 
CACCGC66AC 
TCAGCTGTCC 
CCTAAATG6C 
AGAA6TCATT 
CGATAAAGAG 
G6A6AACC6A 
ACCCACTGAG 
TAAATTCACT 



21 
I 

CGCGAGCGCG 
AAGGCGCTCT 
TTGGGGAAAG 
GGGAGCTCCA 
6AGCCCGAGC 
CCCGCCATCG 
ACCATCAAT6 
CAGAAAGGAG 
GTCACG6AG6 
ATGGCTACTA 
AATATC6AAC 
TCCCAGGCTA 
GTGAAAAAGG 



31 
I 

TCTCCTTCAT 
CGGGACCTCA 
CGTAACCCGG 
CCGAGCAGCG 
CCAGCGGCGG 
CTGCCTCGGA 
GCGTAGCTGA 
CCCTGAACG6 
TT6GACAGA6 
AGTCA6CGGT 
AGATTCCTTC 
ATGATATTGG 
ATA3U3ACAGA 



41 
I 

TCGCAGGCTG 
CGGGCGCGCG 
CGGCTAGGCG 
CAGCCCGGAG 
CGGCCCCTC6 
CCCCGCCACC 
GCAAGAT6A6 
TCAA66A6CC 
AGACTCTGAA 
TGTTCACGAC 
TTCA6AAAGC 
ATTTAAGAA6 
GAAGCCT6AC 



51 

1 

GGCGCGTTCG 
TCTTTTGGCT 
CGGGAGAAGT 
CAGCCGCCCG 
6CCGAG6CGG 
AAGCTCCTAC 
CTCAGCCTCC 
CTAAACAGCC 
GATGTGAGCG 
ATCACAGATG 
AATTTAGAA6 
GTGTTTAAGT 
ACTGTCCAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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TACTCACTGT GAAGAAAGAT GAAGGGGAGG 
CCAGCCTTGG 6GCTGGA6AA 6CAGCATCCA 
7UICCCGAA6A GACCCT6AAG CGTGAGCAAA 
CTGGCCAA6C AGTG6AG6AA TGCAAAGAGG 
5 GCAAGTCTGC AGAATCTCC6 ACTAGTCCCG 
AATTCTTCAC TCAAGGTTGG GCCGGCT6GC 
AGGAT6AAGT GGAAGCTTCA GAGAAGAAAA 
AAGAA6ACGG AAAGGCAGAG GTTGCCTCCG 
CACAG6A6CC 6GCAGAAAGT 6CCCACGAGC 
10 AGCT6CCCTC AGAGGAGCAA GTCAGTGGCT 
CGTTGGCGAC AGAAGTGTTT GATGAGAAAA 
AAGTCCACGT CAGCACCGTG GAGGAGAGAA 
CAGCAGGGTC TGTGCCAGCT GAAGAATTG6 
AACCTGCCAA GGAGCTGGTG AAGCTCAAAG 
15 AGGGAGCTGA CCTCAGTCCT GATGAGAAGG 
GTGAGGTGGA AATGCTGTCA TCACAGGAGA 
AGCTTTTTAC CAGCACTGGC TTAAAAAAGC 
GAGGAG6AGA CGAGGAATCA GGGGAGCACA 
AGGAGGAGCA A7VAGGGCGAG AGCTCTGCCT 
20. GTCTGGAAAA GGGCTTAGCC GAGGTGCAGC 
CCGATGGAGA GAAAAAAAGA GAAGGTGTCA 
CGCCCAAGAA GCGTGTTAGA CGGCCTTCGG 
TCAAGAGCGC TACCTTGTCT TCCACCGAGA 
AAGGGAGCGT GGAAGAGCCA AAGCCXSGAAG 
25 CTTGGGAAGC TTTAATTTGT GTGGGATCAT 
CTGATGAGGA AGGGG6ACCA AAA6CAAT66 
GAAAAGACAA AGAGACGG6G ACAGACX36GA 
GGCAGGGAAG TTCCTCCCC6 GA6CAAGCTG 
CCTGGGAGTC ATTTAAAAGG TTAGTCACGC 
30 AGAAAAGCGA AGACTCCATA GCTGGGTCTG 
CCGGTAAAGA A6AATCCTGG GTCTCAATCA 
GGCCAGATGG GAAACAAGAA CAAGCCCCTG 
AAGATQACTC TGATGTCCCG GCCGTGGTCC 
AGAAAATGGA GGCACAGCAA GCCCAAAAAG 
35 CTGAGGTGTC CAAGGAGCTC AGCGAGAGTC 
ACGGGACGAG GGCAGCTACC ATTATTGAAG 
TQACAGAACC TCTTGAACAA GTAGAAGCTG 
AAAGAGAAGT AATTGCAGAA GAAGAACCCC 
GAGAGGCCCG GGGCGACACG GTCGTTAGTG 
40 CTGCAGAAAC TGCAGGGCCA TTGGGTTCCG 
AGACCACAGA AATGGTGTCA GCAGTCTCCC 
AGGCCACTCC GGTGCAGGAG GTGGT^GGTG 
GGACTCAAGA GGTCCTCCAG GCA6TGGCAG 
GCACCGGTGG GCCAGTUVGAT GTGCTTCAGC 
45 AAGAGCAGGC TGAAGCGTCG GGTCTGAAGA 
CTCAGGA6GC AAAAACTGAG CCTTTTACAC 
AAAGCTTTGA AAAA6CTCCT CAAGTCACAG 
CTTGTCAA6C CGAAACCTTA GCT6GGGTAA 
TCCCCCCTGA CTCGQTGGAA ACCCCTACAG 
50 CCGACTTTGA C6CACCAGGC ACAACCCAGA 
ATGAGGTCGC ATCTGGTACC CAGTCAGGGG 
AAGAGAG6CC TCCAGCACCT TCCAGTTTTG 
A6ATGGAAGA CACTCTAGAG CATACAGATA 
TGTCAAAGAC TGAGGGGACT CAAGAGGCTG 
55 TACCATTTTT CGTVAGGACTT GAGGGGTCTA 
AGGTCACTGA AGTTGCCCTT AAAGGTGAAG 
ATGCTCTTGA ACTGCAGAGT CACGCTAAGT 
TAGTTCAAGT CGAAAGGGAG AAAACAGAAG 
TTGAGCACGA AACAGCTGTT ACCGTATCTG 
60 TGAATGTGCC CATCATAGAT GGGGCAAAGG 
CCTGCCTAGG TCAAGAGGAG GCAGTATGCA 
CATTCACTCT AACAGCGGCT GCAGAGGAGG 
TAGAAACAGG TGAAACGTTG GAGCCTGCAG 
CTGAAAAAAA TGAAGACTTT GCCGCTCATC 
65 ACTGTCAGGC AAAATCGACA CCAGTGATAG 
CCGACCTGGA AGGAGAGAAA ACCACATCAC 
AGGTTGCTTG CCAGGAGGTC AAAGTGAGTG 
GGATTTTGGA ACTTGAGACC A7UU16CA6TA 
TTGACCAGTT TGTACGTACA GAAGAAACA6 
70 CACAAGCTCA CGTGATAAAA GCTGACAGCC 
GAGAG6AACC TCAGGCCTCT GCACAGGATG 
CAGAGTCAAC CGCAGTGGGA CAAGCACATT 
CAGAAAAGAC CATGACTGTT GA6GTAGAA6 
AGGTC6TCCT CCCATCTGAG GAAGAGGGA6 
75 AT6ATGGTCA TGCCTTGTTA GCA6AAA6AA 
ATGAAAAAGG TGATGATGTT GAT6ACCCTG 



GAGCAGCAGG GGCTGGCGAC CACCAGGACC 640 

AAGAAAGCX3A ACCCAAACAA TCTACAGAGA 900 

GCCACGCAGA AATTTCTCCC CCAGCCX3AAT 960 

AAGGAGAAGA GAAACAAGAA AAAGAACCTA 1020 

TGACCAGTGA AACA6GATCA ACCTTCAAAA 1080 

GCAAAAAGAC CAGTTTCAG6 AAGCCGAAGG 1140 

AGGAACAAGA GCCAGAAAAA GTA6ACACAG 1200 

AGAAACTGAC CGCCTCCGAG CAAGCCCACC 1260 

CCCGGTTATC AGCTGAATAT GAGAAAGTTG 1320 

CGCAGGGACC TTCTGAAGAG AAACCTGCTC 1380 

TAGAAGTCCA CCAAGAAGAG GTTGTGGCCG 144 0 

CCGAAGAGCA G/U^CGGAG GTGGAAGAAA 1500 

TTGGAATGGA TGCAGAACCT CAGGAAGCCG 1560 

AAACGTGTGT TTCCGGAGAG GACCCTACAC 1620 

TGCTGTCCAA ACCCCCCGAA GGCGTTGTGA 1680 

GAATGAAGGT GCAGGGAAGT CCACTAAAGA 1740 

TTTCTGGAAA GAAACAGAAA GGGAAAAGAG 1800 

CTCAGGTTCC AGCCGATTCT CCGGACAGCC 1860 

CATCCCCTGA GGAGCCCGAG GAGATCACGT 1920 

AGGATGGGGA AGCTGAAGAA GGAGCTACTT 1980 

CTCCCTGGGC ATCATTCAAA AAGATGGTGA 2040 

AAAGTGATAA AGAAGATGAG CTGGACAAGG 2100 

GCACAGCCTC TGAAATGCAA GAAGAAATGA 2160 

AACCAAAGCG CAAG6T6GAT ACCTCAGTAT 2220 

CCAAGAAAAG AGCAAG6A6A AGGTCCTCTT 2280 

GAGGAGACCA CCAGAAAGCT . GATGAGGCCG 2340 

TCCTTGCTGG TTCCCAAGAA CATGATCCAG 2400 

6AAGCCCTAC CGAAGGGGAG GGCGTTTCCA 2460 

CAA6AAAAAA ATCAAAGTCC AAGCTGGAAG 2520 

GTGTAGAACA TTCCACTCCA GACACTGAAC 2580 

AGAAGTTTAT TCCTGGACGA AGGAAGAAAA 2640 

TTGAAGACGC AGGGCCAACA GGGGCCAACX3 2700 

CTCTGTCTGA GTATGATGCT GTAGAAAGGG 2760 

GCX3CAGAGCA GCCCGAGCAG AAGGCAGCCA 2820 

AGGTTCATAT GATGGCAGCA GCTGTCGCTG 2880 

AAAGGTCTCC TTCTTGGATA TCTGCTTCAG 2940 

AAGCCGCACT GTTAACTGAG GAGGTATTGG 3000 

CCACGGTTAC TGAACCTCTG CCAGAGAACA 3060 

AGGCGGAATT GACCCCCGAA GCTGTGACAG 3120 

AAGAAGGAAC CGAAGCATCT GCTGCTGAAG 3180 

AGTTAACCGA CTCCCCAGAC ACCACAGAGG 3240 

GCGTACCTGA CATAGAAGAG CAAGAGAGGC 3300 

AAAAAGTGAA AGAGGAATCC CAGCTGCCTG 3360 

CTGTGCAGAG AGCAGAGGCA GAAAGACCAG 3420 

AAGAGACG6A TGTAGTGTTG AAAGTAGATG 3480 

AAGGGAAGGT GGTGGGGCAG ACCACCCCAG 3540 

AGAGCATAGA GTCCAGTGAG CTTGTAACCA 3600 

AATCACAGGA GATGGTGATG GAACAGGCTA 3660 

ACAGTGAGAC TGATGGAAGC ACCCCCGTAG 3720 

AAGAC2GAGAT T6TGGAAATC CATGAGGAGA 3780 

GCACAGAAGC AGAG6CAGTT CCTGCACA6A 3840 

TGTTCCA66A A6AAACTAAA GAACAATCAA 3900 

AAGAGGTGTC A6TGGAAACT GTATCCATTC 3960 

ACCAGTATGC TGATGAGAAA ACCAAAGACG 4020 

TAGACACAGG CATAACAGTC AGTCGGGAAA 4080 

GGACAGAAGA AGCTGAATGT AAAAAGGATG 4140 

CTCCTCCATC CCCCGTGGAG AGAGAGATGG 4200 

CAGAGCCAAC CCATGTGAAT GAAGAGAAGC 4260 

AAGAGGTCAG TAAGCAGCTC CTCCAGACAG 4320 

AAGTCAGCAG TTTGGAAGGA AGCCCTCCTC 4380 

CCAAAATTCA AGTTCAGAGC TCTGAGGCAT 4440 

AAAAGGTCTT AGGAGAAACT GCCAACATTT 4500 

GTGCACATTT AGTTCTGGAA GAGAAATCCT 4560 

CAGGGGAAGA TGCTGTGCCC ACAGGGCCCG 4620 

TATCTGCTAC TACCAAGAAA GGCTTAAGTT .4680 

TGAAGTGGAA GTCAGATGAA GTCGAT6AGC 4740 

TAGCAATTQA GGATTTA6AG CCTGAAAATG 4800 

AACTTGTCCA AAACATCATC CAGACAGCCG 4860 

CCACCGAAAT GTT6ACGTCT GAGTTACAGA 4920 

AGGACGCTG6 ACAGGAAAC6 GAGAAAGAA6 4980 

AAACACCAAT TACTTCAGCC AAAGAGGAGT 5040 

CT6ATATTTC CAAAGACATG AGTGAAGCCT 5100 

GTTCCACTGT AAATGATCAG CAGCTGGAAG 5160 

GTG6AGCTGG AACAAAGTCT GTGCCAGAAG 5220 

TAGAGAAGTC ACTAGTTGAA CCGAAAGAAG 5280 

AAAACCAGTVA CTCAGCCCTG GCTGATACTQ 5340 
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ATGCCTCAGG AGGCTTAACC AAAGAGTCCC 
AGGAGGATGC CCAGGAAGTA GAATTGCAGG 
CGATCACACC CCAAGCACAG GAGGAGTTAC 
AACTTACAGA ATC TTAAA AC ATCATGCAGT 
5 TGTGAAGACA AGTAGTAGAA GAAAATGAAT 
AGAACTTTGA GAATTGGAGA GCAGGCACAT 
CAATCCTGAG GCTTCATCAG GAGCTAGAGC 
CTACAATTTT CCCTTGATAA CCATATAAAT 
GAACTGGAGT TGGCAATACC TA6TTCTGCT 

10 TTATATGTAT GTTTTAAGTA GTCCTCCTGT 
AGGAAATGTG CAGGATACTA CATGCTTTTT 
CATAGTGCAG GCTTGGGGAG CTTTAAGCCT 
CCTAGATGTA ACATTCCTGA TCAAGGTACA 
CATATTTAGT GGTACTCTGA AATTGGTCAC 

15 AAAAGCATTT TGAAACATAC AGAATGTTCT 
CAGTGGAGGT TAGAAAGAAG TTATATTCTG 
ACTTGTTATG GTTGTTTGGA CCGATAAGTG 
GTTTTATATG TTATGAAGAA AAGAATTGTT 
GACTGCATTC ACACATGGCA TGAAATAAGT 

20 TACTGGATTG TGTTTGTGCC ATATTTGTGC 
TCATTTGGAT AAGTTGTGAT TTGACGACTG 
AAAAAAAA 

Seq ID NO: 44 Protein sequence t 
25 Protein Accession #: NP 005091 



CAGATACAAA TGGACCAAAA CAAAAAGAGA 54 00 

AAGGAAAAGT GCACAGTGAA TCAGATAAAG 54 60 

AGAAACAAGA GAGAGAATCT GCAAAGTCAG 5520 

TAAACTCATT GTCTGTTTGG AAGACCAGAA 5580 

GCTGCTGCTG AGACTGAAGA CCAGTATTTC 564 0 

CAACTGATCT CATTTCTAGA GAGCCCCTGA 5700 

CATTTAACAT TTCCTCTTTC CAAGACCAAC 5760 

TCTGATTTAA GGTCCTAAAT TCTTAACCTG 5820 

TCT6AAACT6 GAGTATCATT CTTTACATAT 5880 

ATCTATTGTA TATTTTTTTC TTAATGTTTA 5940 

GTATCACACA 6TATATGATG G66CATGTGC 6000 

CAGTTATATA ACCCACAAAA AACAGAGCCT 6060 

ATTCTTTAAA ATTCACTAAT GATTGAGGTC 6120 

TTTCCTATTA CACGGAGTGT GCCAAAACTA 6180 

ATT6TCATTG 6GAAATTTTG CTTTCTAACC 6240 

GTA6CAAATT AACTTTACAT CCTTTTTCCT 6300 

TGCTTAATCC T6AGGCAAAG TAGTGAATAT 6360 

GTAAGTTTTT GATTCTACTC TTATATGCTG 6420 

CAGGTTCTTT ACAAATGGTA TTTTGATAGA 64 80 

CATTCCTTTA AGAACAATGT TGCAACACAT 6540 

ATTTTU^TAA AATATTTGCT TCACTTAAAA 6600 



1 11 21 31 41 51 

I I I 1 I t 

D\J MGAGSSTEQR SPEQPPEGSS TPAEPEPSGG GPSAEAAPDT TADPAIAASD PATKLLQKNG 60 

QLSTING^/AE QDEliSLQEGD UUGQKGAUifG QGAIiNSQEEE EVIVTSV6QR DSEDVSERDS 120 

DKEMATKSAV VHDITDDGQB ENRNIEQIPS SESNIiEELTQ PTESQANDIG FKKVFKFVGF 180 

KFTVKKDKTE KPDTVQLLTV lOCDEGEGTVAG AGDHQDPSIjG AGBAASKESE PKQSTEKPEB 240 

TIjKREQSHAB ISPPAESGQA VEECKEEGEE KQEKEPSKSA ESPTSPVTSB TGSTFKKFFT 300 

35 QGWAGWRKXT SPRKPKEDEV EASEKKKEQE PBKVDTBEDG KAEVASEKLT ASEQAHPQBP 360 

AESAHEPRLS AEYEKVELPS BEQVSGSQGP SEBKPAPIiAT EVFDEKIEVH QEEWAEVHV 420 

STVEERTEEQ KTEVEETAGS VPABBLVGMD AEPQBAEPAK ELVKIiKETCV SGBDPTQGAD 480 

LSPDEKVIiSK PPEGWSEVE MLSSQERMKV QGSPLKKIjFT STGLKKLSGK KQKGKRGGGD 540 

EESGEHTQVP ADSPDSQEEQ KGESSASSPE EPEEITCLEK GLAEVQQDGE AEEGATSDGE 600 

40 KKREGVTPWA SPKKMVTPKK RVRRPSESDK EDELDKVKSA TIiSSTESTAS EMQEEMKGSV 660 

EEPKPEEPKR KVDTSVSWEA LICVGSSKKR ARRRSSSDEB GGPKAMGGDH QKADEAGKDK 720 

ETGTDGILAG SQEHDPGQGS SSPEQAGSPT EGEGVSTWES FKRLVTPRKK SKSKLEEKSE 780 

DSIAGSGVEH STPDTEPGKE ESWVSIKKFI PGRRKKRPDG KQEQAPVEDA GPTGANEDDS 840 

DVPAWPLSE YDAVEREKME AQQAQKGAEQ PEQKAATEVS KEIiSESQVHM MAAAVADGTR 900 

45 AATIIEERSP SWISASVTEP LEQVEAEAAI. LTEEVLEREV lAEEEPPTVT EPLPENREAR 960 

GDTWSEAEL TPEAVTAAET AGPLGSEEGT EASAAEETTE MVSAVSQLTD SPDTTEEATP 1020 

VQEVEGGVPD lEEQERRTQE VLQAVAEKVK EESQIiPGTGG PEDVLQPVQR AEAERPEEQA 1080 

EASGIiKKETD WliKVDAQEA KTEPFTQGKV VGQTTPESFE KAPQVTESIE SSELVTTCQA 1140 

ETLAGVKSQE MVMEQAIPPD SVETPTDSET DGSTPVADFD APGTTQKDEI VEIHEENEVA 1200 

50 SGTQSGGTEA EAVPAQKERP PAPSSFVFQE ETKEQSKMED TIiBHTDKEVS VBTVSILSKT 1260 

EGTQEADQYA DEKTKDVPFF EGIiEGSZDTG ZTVSREKVTE VALKGEGTBB AECKXDDALE 1320 

IiQSHAKSPPS PVERBMWQV ERBKTEAEPT HVNEEKIiEHE TAVTVSEEVS KQZjLQTVMVP 1380 

IIDGAKEVSS LEGSPPPCLG QEBAVCTKIQ VQSSEASFTL TAAAEEEKVIi GETANZLBTG 1440 

BTLBPAGAHL VXiEEKSSEKN EDFAAHPGED AVPTGPDGQA KSTPVIVSAT TiCKGIiSSDLE 1500 

55 GEKTTSLKWK SDEVDEQVAC QBVKVSVAIE . DLBPENGIXiE LETKSSKIiVQ NIIQTAVDQF 1560 

VRTEETATEM LTSEIiQTQAH VIKADSQDAG QBTBKEGEBP QASAQDETPZ TSAKEESBST 1620 

AV6QAHSDZS KDMSEASEKT MTVBVEGSTV NDQQLEBWL PSBEB6GGAG TKSVPBDD6H 1680 

ALZiAERZEKS LVEPKEDEKG DDVDDPEMQN SALAOTDASG GZiTKESPDTN 6PKQKEKEDA 1740 
QEVELQEGKV HSESDKAZTP QAQEBIiQKQB RBSAKSBLTE S 

60 

Seq ID NO: 45 Nucleotide sequence; 
Nucleic Acid Accession #: NM_001290 

Coding sequence: 110. .1231 (underlined sequences correspond to start and stop codons) 

65 

1 11 21 31 41 51 

I' I I I I I 

GTGAGCGTGT GTGCGTGCGT CTACTTTGTA CTGGGAAGAA CACAGCCCAT GTGCTCTGCA 60 

TGGACGTTAC TGATACTCTG TTTAGCTTGA TTTTCGAAAA GCAGGCAAGA TGTCCAGCAC 120 

70 ACCACATGAC CCCTTCTATT CTTCTCCTTT CGGCCCATTT TATAGGAGGC ATACACCATA 180 

CATGGTACAG CCAGAGTACC GAATCTATGA GATGAACAAG AGACTGCAGT CTC6CACAGA 240 

GGATAGTGAC AACCTCTGGT GGGACGCCTT TGCCACTGAA TTTTTTGAAG ATGACGCCAC 300 

ATTAACCCTT TCATTTTGTT TGGAAGATGG ACCAAAGCGA TACACTATCG GCA6GACCCT 360 

CATCXrCCCGT TACTTTAGCA CTGTGTTTGA AGGA6GGGTG ACCX3ACCTGT ATTACATTCT 420 

75 CAAACACTCG AAAGAGTCAT ACCACAACTC ATCCATCACG GT6GACTGCG ACCAGTGTAC 480 

CATG6TCACC CA6CACGGGA AGCCCATGTT TACCAAGGTA TGTACAGAAG GCAGACTGAT 540 
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CTTGGAGTTC ACCTTTGATG ATCTCAT6AG 
ATACCGAGA6 TTAGTCCCGA GAA6CATCCT 
GGATCAGCTG TCCAAAAACA TCACCAGGAT 
CAGGTTGTGT GTAATATTGG AGCCAATGCA 
5 CCTCAGTCCC CGAGACTGCC TGAAGACCTG 
TCCGCCAGCA GAACCCACJiUV 66CAACCAAC 
CAGCAGCACT TCCAACAGCA GCGCTGGGAA 
GACCACAGCT GCAAACCTGA GTCTGTCCAG 
GCCAACTCTG ATGGGAGGTG AGTTTGGGGA 

10 AAACACGCAA TATGATGCGG CCAACGGCAT 
CGCGCTGGGG AACAACAGCC CGTGGAACAG 
AGTUUVACCCC CCACCCCAGG CTTCCCAATA 
AGGCCCGTGG GTGATCATTA CAATTGCAAA 
ATAAAAACTT TTCCATGCAA ATATCTATTT 

15 CTTTCTTTTT TTCTAATTGA GAGGATTATT 
GGCCTTCACA GGTAATACAG ATACTGGCAC 
GCGCATCTTC TGGCACGGTT TTAACAACGT 
AACGAAGGCC ATATTGTCCA TAAATGCTCA 
TAACTACAGA TGACTTTTTA ATATTGTAAA 

20 AGTTTCTTGT TTCAGTAAAA AAAGAAAAGA 
ATGTACCTTA TTTTTTTTTT CTTTATGTTT 
AGCAAGGTAA TTTATGGTTG AGCT6ATGTC 
TAGCCCAAGT GCTGAAACAA GAAATGTCAT 
TTAAGTAAAG AAAGACAATT GGACCCTTAA 

25 TCCAAATATT TTCAAGCCAT GTAATCCATT 
CTTTGTGTGT TTTCTAATTG TACCTGAGTT 
TGTATGATAT TTTGTAAAGC TCTCACCTGG 
AACTCCAGTG TATTTATGTG AAACTTTATA 
TATGTTCCTC CACACATGTA AAGGCACAGT 

30 TATGTATGCT TTACTGATAA GTGTGCCAAT 



AATCAAAACA TGGCACTTTA CCATTA6ACA 600 

AGCCATGCAT GCACAAGATC CTCAGGTCCT 660 

GGGGCTAACA AACTTCACCC TCAACTACCT 720 

GGAACTGATG TCGAGACATA AAACTTACAA 780 

CTTGTTTCAG AAGTGGCAGA GGATGGTGGC 840 

AACCAAACGG AGAAAAAGGA AAAATTCCAC 900 

CAATGCAAAC AGCACTGGCT^ GCAAGAAGAA 960 

TCAGGTACCT GATGTGATGG TGGTAGGAGA 1020 

CGAGGACGAA AGGCTAATCA CTAGATTAGA 1080 

GGACGACGAG GAGGACTTCA ACAATTCACC 1140 

TAAACCTCCC GCCACTCAAG AGACCAAATC 1200 

AGATGATCGG CACCAGAATC CACTGTCAAT 1260 

TCTTTACTTA CAGGAGAGGA AACAGAAGAG 1320 

CTAAACCACA ATGATCT6AT TTTCTTTCTT 1380 

CCCAGTAAGC TTCCATGACC CTTTCTTGGA 144 0 

TGATTGTAAT TAAAATGAGA GAAAACTCTA 1500 

GTTTGTGTTG AATTTCCTTT TTATGCATCA 1560 

GTGCTCAGGA TCTCATTAAT ATGCCGAACC 1620 

ATATTTTCTG CTTTTTGACT TGCATCTGAG 1680 

CAAAAAAATC AGCTTTGGAA AGTAATTTAA 1740 

TCTTTCATTG GGCAACA6CT AAGAGGGCCC 1800 

AATTGGTTCT TGTCTTGAGT CGACTCAATT 1860 

TTTTTTCATC AAAGACACCA GGGCAGATTT 1920 

GAATTTATGC ATTTGTAAAG TTGCTGTTGA 1980 

GGTTTTGTGG GCAGTTTAAT AT^CCTGAAC 2040 

GACC3VTCCTT TCTTTTTATA GTATATTTCT 2100 

TTCTTTTATG GGGACTTTTC GTTTTTGGrGC 2160 

AGJW3AATTAA TTTTTCCATT TGCATATTAA 2220 

GGCTCCGTGT GTTAAAAAAC AGCTGTATTT 2280 
TU^TAAACTGT 6TTAATGACC 



35 



50 



Seq ID NO; 46 Protein sequence; 
Protein Accession #: NP 001281 



1 11 21 31 41 51 

I LI I I I 

MSSTPHDPFY SSPFGPFYRR HTPYMVQPEY RIYEMNKRLQ SRTEDSDNIjW WDAFATEFFE 60 

40 DDATIiTLSFC liEDGPKRYTI GRTLIPRYFS TVFEGGVTDL YYILKHSKES YHNSSITVDC 120 

DQCTMVTQHG KPMFTKVCTE GRLILEPTFD DLMRIKTWHP TIRQYRBLVP RSIIiAMHAQD 180 

PQVLDQIiSKN ITRMGLTNFT LNYLRLCVII* EPMQELMSRH KTYNLSPRDC IiKTCIiFQKWQ 240 

RMVAPPAEPT RQPTTKRRKR KNSTSSTSNS SAGNNANSTG SKKKTTAANIt SLSSQVPDVM 300 

WGEPTIiMGG EFGDBDERIiI TRIiBNTOYDA ANGMDDEEDP NNSPALGNMS PWNSKPPATQ 360 
45 ETKSENPPPQ ASQ 



Seq ID NO: 47 Nucleotide secnience; 
Nucleic Acid Accession #: NM_004126 

Coding sequence: 108.. 329 (imderlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GGCACGAGCT CGTGCCGGCC TTCAGTTGTT TCGGGACGCG CCGAGCTTCG CCGCTCTTCC 60 

AGCGGCTCCG CTGCCAGAGC TAGCCCGAGC CCGGTTCTGG GGCGAAAATG CCTGCCCTTC 120 

55 ACATCGAAGA TTTGCCAGAG AAGGAAAAAC TGAAAATGGA AGTTGAGCAG CTTCGCAAAG 180 

AAGTGAAGTT GCAGAGACAA CAAGTGTCTA i\ATGTTCTGA AGAAATAAAG AACTATATTG 24 0 

AAGAACGTTC TGGAGAGGAT CCTCTAGTAA AGGGAATTCC AGAAGACAAG AACCCCTTTA 300 

AAGAAAAAGG CAGCTGTGTT ATTTC ATAAA TAACTTGGGA GAAACTGCAT CCTAAGTGGA 360 

AGAACTAGTT TGTTTTAGTT TTCCCAGATA AAACCAACAT GCTTTTTAAG GAAGGAAGAA 420 

60 TGAAATTAAA AGGAGACTTT CTTAAGCACC ATATAGATAG GGTTATGTAT AAAAGCATAT 480 

GTGCTACTCA TCTTTGCTCA CTATGCAGTC TTTTTTAAGA GAGCAGAGAG TATCAGATGT 540 

ACAATTATGG AAATAAGAAC ATTACTTGAG CATGACACTT CTTTCAGTAT ATTGCTTGAT 600 
GCTTC7UVATA AA6TTTTGTC TT 

65 Seq ID NO: 48 Protein sequence: 
Protein Accession #: NP_004117 

1 11 21 31 41 51 

I ' ' I ' ' 

70 MPALHIEDLP KKEKLKNEVE QLRKEVKLQR QQVSKCSEEI KNYIBBR5GB DPZiVKGZPED 60 

KNPFKEKGSC VIS 



Seq ID NO: 49 Nucleotide sequence: 
75 Nucleic Acid Accession #: XM_051896 

Coding sequence: 13 9.. 23 88 (underlined sequences correspond to start and stop codons) 
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15 
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1 

I 

GTTTTAAAGA 
GAAACCAAAA 
CACAAGTTTA 
ATGCTTGATA 
AAGAGAACAA 
ATTTTGGATC 

atggatg;^ 
aaagaagttc 

GTTTGCTCAT 
AGACAACAGA 
AGTGAAGGAT 
GGTTTCCGAG 
CTGGATTGTG 
TATTCTCACC 
AATGTTAGCC 
TCTTTATGGA 
TTAATAGGAG 
AAAGTTAATA 
GTTTCAGAGC 
AAATATGGTA 
6TTAAGAAGT 
TCCATATTGT 
GA6GAAGAAT 
GATGATGAAT 
A6T6ATAATC 
TTATTCAATA 
CTCAATACAT 
GATGATGAAC 
CTGGATGTCA 
TATCCCTTGA 
GCAAGGCCAA 
AAAATGAACA 
AAGGAGTGCT 
ATCCACTTTG 
GAAACTGAGG 
TTTTCAACCT 
CACTTCAATA 
TATAGAAGAC 
TTTTTCAACA 
CAGTTTCTGA 
AGATAGTCX3T 
ATGA6AATAA 
TTAGCTACAT 
TTTTAAACAT 
AAAATATTTA 
ACTAGATTTA 
TACATGAAAT 



11 
I 

CGCTAGAGTG 
TGTCATTTAT 
CGGTAGTGGT 
CTCCAGATCC 
GACATTTC7VA 
CTAATCAGGA 
CTCTAGGGAC 
CTTTTATTTT 
GCCCAGACCT 
GAAAAGAACA 
TGCATTCTGC 
CCATGGTGGG 
CTACCTACGT 
CTGATTTTCC 
ACAATCCCCT 
AGAAGAAAAG 
AAACACTAAT 
CTGCACAATG 
TGATGTTTGC 
CTTTTATGGC 
ATGAAGAAAA 
TCAACAGAGT 
TA6AAAATAT 
CACACGAACC 
AAGCAAGTT6 
CCAGAGAAGG 
CTTATCCACT 
TGGATGCAGC 
AAAGTAAAAA 
TACTGAGACC 
GTGACTCTAG 
AGCTCCCCTT 
ATGTCTTTAA 
TTCTGGCCAA 
AAGAGAAAGA 
TCAATTTTCA 
CTCTGAACAA 
AGAATCCATC 
AGGAGTTTCT 
TGCTGAGGCA 
ACTGATCATG 
TACTATTATA 
TTTCAGTCAG 
TTCTCACCAA 
ACAGTTCAAT 
TTCATACCAT 
AAATACATCA 



21 
I 

CCAAAGAAGA 
AGATCCTTAC 
GTTACX5TGCC 
CTATGTGGAA 
TAATGACATA 
AAATGTTTTG 
AGCAACATTT 
CAACCAAGTC 
ACGATTTAGT 
CATAAGGGAG 
ACGTGATGTG 
ATTCTCTGGT 
TGCTGGTCTT 
AGAGAAAGGG 
TTTACTTCTC 
CTCTGGACAA 
TCATAATAGA 
CCCTTTACCT 
AGATTGGGTT 
TCCCGACTTA 
CCCCTTGCAT 
TTTGGGCGTT 
TACCACAAAG 
CAAAGGCACT 
GATTCATCGT 
ACGTGCTGGG 
GTCTCCTTTG 
TGTAGCAGAT 
GATTCATGTA 
TCAGAGAGGG 
TCCTCCGTTC 
TCCAAAGATT 
ACCCAAGAAT 
CATCAACTTC 
AATCGCTGAC 
ATATCCAAAT 
CATTGATGTG 
TCGTTGCTCT 
AAGTAAACCC 
GTTTGC7ATC 
AGAGACTGGC 
AGTTA6GTTG 
TAT6AACT7C 
CTTTCTTATG 
CTCAATAAGA 
6AGACAACAC 
ATATAAAAAT 



31 
I 

CTTTGAAGTG 
CAGCACATTA 
ACCAAAGTGA 
CTTTTTATCT 
AACX:CTGTGT 
GAGATTAOST 
ACTGTATCTT 
ACTGAAAT6G 
ATGGCTCTGT 
AGCATGAAGA 
CCTGTGGTAG 
GTGATGAAGG 
TCTGGCTGCA 
CCAGAGGAGA 
ACACCACAGA 
CCTGTCACCT 
ATGAATACTA 
CTTTTCACCT 
6AATTTAGTC 
TTTGGAAGCA 
TTCTTAATGG 
TCTGGTTCAC 
CATATT6TGA 
GAAAATGAAG 
ATGATAATGG 
AAGGTACACA 
A6TGACTTTG 
CCTGATGAAT 
GTG6ACAGTG 
GTTQATCTCA 
AAGGAACTTC 
GATCCTTATG 
CCTGATATGG 
AGAAAGTACA 
TTTGATATTT 
CAAGCATTCA 
ATAAAAGAAG 
GTTTCCCTTA 
AAAGCATAGT 
CCATGACAAC 
TGATACTCAA 
ACAAATGATG 
CTGATACAAA 
TGTGTTCTTT 
CCTCGCATTA 
TATTTTTATT 



41 
I 

TGAAAACATT 
TAGTGGA6CA 
CAAAGGGGGC 
CTACAACCCC 
GGAATGAGAC 
TAATGGATGC 
CTATGAAG6T 
TTCTAGAAAT 
GTGATCAGGA 
AACTCTTGGG 
CCATATTGGG 
CATTATACGA 
CCTGGTATAT 
TTAATGAAGA 
AAGTTAAAAG 
TTACTGATAT 
CTCTGAGCAG 
GTCTTCATGT 
CATACGAAAT 
T^TTTTTTAT 
GTGTCTGGGG 
AAAGCAGAGG 
6TAATGATAG 
ATGCTGGAAG 
CCTTGGTGAG 
ACTTCATGCT 
CCACACAGGA 
TTGAGC6AAT 
GG CTCACATT 
TAATCTCCTT 
TACTTGCAGA 
TGTTTGATCG 
AGAAAGATTG 
GGGCTCCAGG 
TTGATGACCC 
AAAGACTACA 
CCATGGTTGA 
GTAATGTTGA 
TCATGTACTG 
TGGATTTAAA 
AGTTGCAGTT 
TTGATTATGT 
TGTAGGGATA 
TTAAAAATTT 
TGTATGAATG 
TATATATGCA 



51 
1 

TCCTGTAATT 
CCAGTATTCC 
CTTTGGT6AC 
TGACAGCAGG 
CTTTGAATTT 
CAATTATGTC 
GGGAGAAAAG 
GTCTCTT6AA 
GAAGACTTTC 
TCCAAAGAAT 
TTCAGGTGGG 
ATCAGGAATT 
GTCAACCTTG 
ACTAATGAAA 
ATATGTTGAG 
CTTTGGGATG 
TTTGAAGGAA 
CAAACCTGAC 
TGGCATGGCT 
GGGAACAGTC 
CAGTGCCTTT 
CTGCACAATG 
CTCGGACAGT 
TGACTATCAA 
TGATTCAGCT 
GGGCTTGAAT 
CTCCTTTGAT 
ATATGAGCCT 
TAACCTGCCG 
TGACTTTTCT 
AAAGTGGGCT 
GGAAGGGCTG 
CCCAACCATC 
TGTTCCAAGG 
AGAATCACCA 
TGATCTTATG 
7VAGCATTGAA 
GGCAAGAA6A 
GAAATGGCAG 
AGTACAGTAC 
ACTTAGCTGC 
AAG6ATATAC 
TATACTGTAT 
TTTTTCTTTT 
TTATTCACTG 
TATATATACA 



60 
120 
180 
240 
300 
3^0 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



55 



60 



65 



70 



Seq ID NO: 50 Protein sequence; 
Protein Accession #: XP 051896 



1 

I 

MSFIDPYQHI 
RHFNNDINPV 
PFIFNQVTEM 
LHSARDVPW 
PDFPEKGPEE 
• ETLIHNRMNT 
TFMAPDLFGS 
LENITTKHIV 
TREGRAGKVH 
KSKKIHWDS 
KIiPFPKIDPy 
EEKBIADFDI 
QNPSRCSVSIj 



11 
I 

IVEHQYSHKF 
WNETPEFIIiD 
VIiEMSLEVCS 
AILGSGGGFR 
INEELMKNVS 
TIiSSliKEKVN 
KFFMGTWKK 
SNDSSDSDDE 
NFMLGLNIiNT 
GIiTFMLPYPli 
VPDREGLKEC 
FDDPESPFST 
SNVEARRFFN 



21 
I 

TVWIjRATKV 
PNQENVIiEIT 
CPDLRFSMAIi 
AMVGFSGVMK 
HNPLLLLTPQ 
TAQCPLPIiFT 
YEENPLHFLM 
SHEPKGTENE 
SYPLSPLSDF 
ILRPQRGVDL 
YVPKPKNPDM 
FNFQYPNQAP 
KEFIiSKPKA 



31 
I 

TKGAFGDMIiD 
LMDANYVMDE 
CDQEKTFRQQ 
ALYESGILDC 
KVKRYVESLW 
CLHVKPDVSE 
GVWGSAFSIL 
DAGSDYQSDN 
ATQDSFDDDE 
IISFDFSARP 
EKDCPTIIHF 
KRLHDLMHFN 



41 

I 

TPDPYVELFI 
TIjGTATFTVS 
RKEHIRESMK 
ATYVAGLSGS 
KKKSSGQPVT 
LMFADWVEFS 
PNRVLGVSGS 
QASWIHRMIM 
LDAAVADPDE 
SDSSPPFKEIb 
VLANIMPRKY 
TLNNIDVIKE 



51 
I 

STTPDSRKRT 
SMKVGEKKEV 
KLliGPKNSEG 
TWYMSTLYSH 
FTDIFGMLIG 
PYEIGMAKYG 
QSRGSTMEEE 
AIiVSDSALFN 
FERIYEPLDV 
IiIiAEKWAKMN 
KAP6VPRETE 
AMVESIBYRR 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 51 Nucleotide sequence: 
Nucleic Acid Accession #: NM_00652a 
75 Coding sequence: 57.. 764 (underlined sequences correspond to start and stop codons) 



204 



JNSDOCrD: <WO 0207949aA2_l„> 



wo 02/079492 



PCT/US02/04915 



10 



15 



20 



GCCGCCAGCG 
ACCCXTGCTCG 
GCGATGCTGC 
ACX5GACCCTG 
GCCAGTTCCT 
GCGACGATGC 
TGGACGACCA 
GTGAAAAATT 
AAGCTACTTG 
AAGATGAGGG 
CCTGTGATGC 
AGGATTGCAA 
GCTTTGCCAG 
ATCTTGTTTG 
GCATGAGGAA 
TTCAAAAATT 
TTTAATTTAT 
AAATATGACT 
AAACAACATA 
CC 



11 
I 

GCTTTCTCGG 
CCCCCTGGGG 
TCAGGAGCCA 
CCGGGCCCTA 
GTACGGGGGC 
TTGCTGGAGG 
GT6TGAGGGG 
CTTTTCCGGT 
TATGGGCTTC 
ACTGTGCTCT 
TTTCACCTAT 
ACGTGCATGT 
TAGMTCCGG 
TCTTTATGGC 
ACAAATCATT 
TGGATTTTTT 
GGTTCAACTG 
CACTCATTTC 
AGACAATATA 



21 
1 

ACGCCTTGCC 
CTGTC6ATTC 
ACAGGAAATA 
CTTCTCCGTT 
TGCGAGGGCA 
ATAGAAAAA6 
TCCACAGAAA 
GGGTGTCACC 
TGCGCACCAA 
GCCAATGTGA 
ACTGGCTGTG 
GCAAAAGCTT 
AAAATTCGGA 
TTATTTGCCT 
GGTGATTTAT 
TATATATAAC 
TTTGTGAGAC 
TTGGGGTCGT 
ATCATGTGCT 



31 
I 

CA6CGGGCC6 
TGCT6CTTTT 
ACGCG6AGAT 
ACTACTACGA 
ACGCCAACAA 
TTCCCAAAGT 
AGTATTTCTT 
GGAACCGGAT 
AGAAAATTCC 
CTCGCTATTA 
GAGGGAATGA 
TGAAAAAGAA 
AGAAGCAATT 
TTATGGTTGT 
TCACCAGTTT 
TAGCTGCTAT 
GAATTCTTGC 
ATTCCTGATT 
TTTAACATAT 



41 
1 

CCCX3ACCCCC 
CCTGACGGAG 
CTGTCTCCTG 
CAGGTACACG 
TTTCTACACC 
TTGCCGGCTG 
TAATCTAAGT 
TGAGAACAGG 
ATCATTTTGC 
TTTTAATCCA 
CAATAACTTT 
AAAGAAGATG 
TTAA ACATTC 
ATCTGAAGAA 
TTATTAATAC 
TCAAATGTGA 
AATGCATAAG 
TCAGAAGAGG 
TTGAGAATAA 



51 
i 

TGCAC CATGG 
GCTGCACTGG 
CCCCTAGACT 
CAGAGCT6CC 
TGGGAGGCTT 
CAAGTGAGTG 
TCCATGACAT 
TTTCCAGATG 
TACAGTCCAA 
AGATACAGAA 
GTTAGCAGGG 
CCAAAGCTTC 
TTAATATGTC 
TAATATGACA 
AAGTCACTTT 
GTCTACCATT 
ATATAAAAGC 
ATCATAACTG 
AAAGGACTAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



25 



Seq ID NO: 52 Protein aeauence: 
Protein Accession #: NP 006519 



11 



21 



31 



41 

1 



51 



MDPARPLGbS IIiIiIiPLTEAA LGDAAQEPTG NNAEXCliLPL DYGPCRALLlt RYYVDRYTOS 60 

30 CRQPIjYGGCE GNANNFYTWE ACX)DACWRIE KVPKVCRIjQV SVDDQCEGST EKYPFNIiSSM 120 

TCEKFFSGGC HRNRIENRFP DBATCMGFCA PKKIPSPCYS PKDEGLCSAN VTRYYPNPRY 180 
RTCDAFTYTG CGGNDNNFVS REDCKRACAK AUCKKKKMPK LRFASRZRKI RKKQF 



35 Seq ID NO: 53 Nucleotide sequence; 

Nucleic Acid Accession #: AA478778 
Coding sequence: no ORF foxind 



40 



45 



50 



55 



60 



TATTTTTGTA 
TGATCCTTCA 
ACTTTTTTAA 
GTATAAAAGA 
TCACAGATTG 
GGAAAAAAAT 
6AAATATAAA 
CACTTGGATG 
AGACTTAGAC 
GTGCCTTGGT 
TATTGG6AAA 
TTACCA6TAG 
CTGGAG/^T 
GCTGGTGATG 
CTGCAGTAAT 
TTCCGATTGC 
CATGGTCTTT 
ACACTTGGTT 
CAGTATTCTC 
TGAATAAGTG 
CACCACACCC 



11 
I 

CGTAAAATGA 
TTATCACGGT 
AATGAATTTT 
TATTTTTGGC 
TACCAACTAT 
ATGCTGCCTT 
CACTTTTAAT 
AAATAAGACC 
TTTATCCTTA 
CTCTCCACAA 
GTGAGATCCT 
AAAGACACA6 
TCAGJ\ACCAG 
TGACTTCTCT 
GGACGTTTGT 
TCATTAATTC 
CTGCCCCTCC 
TGAGAAACCC 
CAACTCCAAA 
TTATTCTCCA 
AAAAAAAAAA 



21 
I 

TTCTATTATG 
ACACTATTGT 
TTTAAAACAA 
ATTTCTAGGC 
TAACTATGTT 
GGTGCTAATA 
GAAAGGGA66 
AGCTCTTTAC 
TTGTTGTTAG 
TCAAATGGAG 
CTCACCATTT 
GATGCACAGA 
GTTCTGAATC 
TCAGGCCATG 
GTGAAGAAAT 
ACTTTTTTGT 
AAGCTGATGA 
TGCCCACTTC 
CAAGCTCTAG 
TTATTAATGT 
AAAAAAAAAA 



31 
I 

ACTGCCTTTG 
TTACTTTTCA 
TCTAGCCATC 
AAGTATCAGC 
AAATAAGTAT 
TTGTATGTAT 
AACGGAAGGA 
CCTTATTTTT 
TGTTGTTAAT 
GATCCCCCAA 
TGCCAAGATA 
ATGGGCATGA 
ATCACGATTG 
AGCCTAACAY 
GAACTGTGGA 
TACTTCTTTC 
AGGGAAGCCT 
CAAAGACCAA 
AGTGCTCCAG 
GTTCTGAAAA 
AAAA 



41 

I 

CATGTAGTAA 
TCTGTAAATG 
ATCAA6GTGC 
CAATAAGTAT 
TCAGTTTCAT 
TTAAATGATC 
CAATTTCCAG 
G6ATATGCCT 
ATTCGTTGCT 
GCAGCTTCAT 
CTCTAAAATG 
CCTTCAGCTC 
CCTTTTGCAT 
CCTGCCGGTT 
GTACAAAATG 
CAAAATGGAA 
TTGCCAATGG 
AGAGATTAGG 
GAAAAGTTAT 
TATATTAT6A 



51 
I 

TATGACAAAG 
TTTTATTGTT 
TATAAGAGTT 
GTTAGTGATA 
GT6ATCTCTG 
ATCTGACTCA 
TGCACAGAAT 
TTTTTGGAAG 
TCAGCCCACX3 
TACAGAGOXSA 
ACATCCAAGT 
ACGAGCACAC 
GAAAACATCG 
TTCATGCCCG 
CTTTGAGTCT 
GTGCTGAAGC 
CCCATGGAAG 
AAAAGCCTGG 
ATTCAGTATA 
ATAAATACAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



65 Seq ID NO: 54 Nucleotide aeguence: 
Nucleic Acid Accession #: NM_020663 

Coding sequence: 1..645 (underlined sequences correspond to start and stop codons) 



70 



75 



ATGAACTGCA 
ATGTTGAAGT 
TACGCCAACG 
ACTGTGACTG 
GACTACAACC 



11 
I 

AAGAGGGAAC 
GTGTGGTGGT 
ACGCCTTCCC 
TGGGAGGCAA 
AGCTGAGGCC 



21 
I 

TGACAGCAGC 
GG6GGACGGT 
AGAGGAATAC 
GCAACACTTG 
ACTCTCCTAC 



31 
I 

TGCG6CTGCA 
GCCGTGGGGA 
GTGCCCACTG 
CTCGGACTGT 
CCCAACACGG 



41 



51 



6GGGCAACGA CGAGAAGAAG 60 

AAACCTGCCT GCTGATGAGC 120 

TGTTTGACCA CTATGCAGTT 180 

ATGACACOGC G6GACAGGAG 240 

ATGTGTTTTT GATCTGCTTC 300 



205 



BNSOOCID: <WO_ 



_0207949aA2.L> 



wo 02/079492 PCT/US02/04915 



TCTGTCGTAA ACCCTGCCTC TTACCACAAT GTCCAGGAGG AATGGGTCCC CGAGCTCAAG 360 

GACTGCATGC CTCACGTGCC TTATGTCCTC ATAGGGACCC AGATTGATCT CCGTGATGAC 420 

CCA7UVAACCT TGGCCCGTTT GCTGTATATG AAAGAGAAAC CTCTCACTTA CGAGCATGGT 4 80 

GTGAAGCTCG CAAAAGCGAT CGGAGCACAG TGCTACTTGG AATGTTCAGC TCTGACTCAG 540 

5 AAAG6TCTCA AAGCGGTTTT TGATGAAGCA ATCCTCACCA TTTTCCACCC CAAGAAAAAG 600 
AAGAAACGCT GTTCTGAGGG TCACAGCTGC TGTTCAATTA T CTGA 



Seq ID NO: 55 Protein secruence: 
Protein Accession #; NP_065714 

1 11 21 31 41 51 

i I I I I I 

MNCKEGTDSS CGCRGNDEKK MIjKCWVGDG AVGKTCLUyiS YT^AFPEEY VPTVFDHYAV 60 
TVTVGGKQHL LGIiYDTAGQE DYNQLRPIiSY PNTDVFLICF SWNPASYHN VQEEWVPEIjK 120 
DCMPHVPYVL IGTQIDIiRDD PKriiARLLYM KEKPLTYEHG VKTiAKAXGAQ CYLECSAIjTQ 180 
KGUCAVFDBA IIiTIFHPKKK KKRCSEGHSC CSII 



20 Seq ID NO: 56 Nucleotide sequence; 

Nucleic Acid Accession fgenesh prediction 

Coding sequence: 1-54 6 (underlined sequences correspond to start and stop codons) 

25 1 11 21 31 41 51 

I I I i I t 

aSGGCCTTGG GCAGCTCCGC CCCTGTGGCT TTGCAGGGTA ATGCCCACTT CCCTGCTGCT 60 

TTCATGQCTG GCATTAAGTG TCTGTGGCTT TTCCAGGTAG TCCXrCCTGGG GCTCCCCX3AG 120 

TTGGTGCAAA GGCTCCTGGG TGGAGCTCGA ACTGAAACTC GCTTTGTGCC C6CAGCCCTG 180 

30 CAGCTCGCCG GTGCCCTCGA CCTGCCCGCT GGGTCCTGTG CCTTTGAAGA GAGCACTTGC 240 

GGCTTTGACT CCGTGTTGGC CTCTCTGCCG TGGATTTTAA ATGAGGAAG6 CCAGCAACCT 300 

TTCTGGTCCT CAGGAGACAT GTCTGACTGG GACTACTG6G TTGGCTGGCG 6AAGTTAATT 360 

CATTCTCCTC TGAGCACTCC AGGGTGGAGC AGGCAGGTTA GGCTCCAGTT 6TTCCAGCTT 420 

CAGTTTGTCA AAGGCCAGAA CTTGGACGTA ACAGTGTACT GCAGGCTCCA GGGCAGTGAG 480 

35 AAACCCTTTG AAACTGGTTC CATGGTTCCA TTCACCTTCA TGTACTGGAT CCACCATGGA 540 
AAGTAG 



Seq ID NO: 57 Protein sequence: 
40 Protein Accession #: fgenesh prediction 

1 11 21 31 41 51 

I I I I I I 

MALGSSAPVA LQGNAHFPAA FMAGIKCLWL FQWPLGLPE LVQRIiLGGAR TETRFVPAAL 60 

45 QLAGAIiDLPA GSCAPEBSTC GFDSVIiASIjP WILNEEGQQP FWSSGDMSDW DYWVGWRKIjI 120 

HSPLSTPGWS RQVRLQLFQIj QPVKGQNLDV TVYCRLQGSB KPPETGSMVP FTFMYWIHHG 180 
K 

Seq ID NO: 58 Nucleotide sequence; 
50 Nucleic Acid Accession #: XM_050478 

Coding sequence: 2 7.. 4 5 08 (underlined sequences correspond to start suid stop codons) 
1- 11 21 31 41 51 

I I I I I I - 

CCGGCGGCGC CTGAGCCCAG CCGAGG ATGG AGAACCGGCC TGGGTCCTTC CAGTACGTCC 60 

55 CTGTGCAGCT GCAAGGGGGG GCACCCTGGG GCTTCACCCT TAAGGGGGGT CTGGAACACT 120 

GTGAGCCGCT CACAGTGTCT AAGATTGAAG ATGGAGGCAA GGCAGCTTTG TCCCAGAAGA 180 

TGAGGACTGG TGATGAGCTG GTGAATATCA ATGGCACTCC ATTATATGGC TCCCGCCAAG 240 

AGGCCCTCAT TCTCATCAAA GGCTCCTTCC GGATTCTCAA GCTGATTGTC AGGAGGAGGA 300 

ACGCCCCTGT CAGTAGGCCG CACTCATGGC ATGTGGCCAA GCTGCTGGAG GGATGCCCTG 360 

60 AAGCAGCCAC CACCATGCAT TTCCCTTCTG AAGCCTTCAG CTTGTCCTGG CATTCTGGCT 420 

GCAACACAAG TGACGTGTGT GTGCAGTGGT GTCCACTCTC CCGGCATTGC AGCACCGAGA 480 

AAAGCAGCTC CATTGGCAGC ATGGAGAGCC TGGAGCAACC AGGCCAAGCC ACCTATGAGA 540 

GCCATCTGTT GCCTATTGAC CAGAACATGT ACCCTAACCA GCGTGACTCA GCCTACAGCT 600 

CCTTCTCGGC CAGCTCAAAT GCTTCTGACT GTGCCCTTTC CCTCAGGCCA GAGGAGCCAG 660 

65 CCTCTACAGA CTGCATCATG CAAGGCCCAG GGCCAACTAA GGCCCCCAGT GGCCGGCCTA 720 

ATGTGGCTGA GACCTCAGGA GGTAGTCGGC GCACCAATGG GGGCCACCTG ACCCCCAGCT 780 

CTCAGATGTC ATCCCGTCCA CA6GAGGGAT ACCA6TCAGG GCCCGCCAAA GCAGTCAGGG 840 

GCCCACCACA ACCTCCAGTG AGGCGG6ACA GCCTTCAGGC CTCCAGAGCC CAACTCCTCA 900 

ATGGAGAGCA GCGCAGGGCA TCTGAGCCTG TGGTCCCCTT GCCACAG7AG GAGAAACTGA 960 

70 GCTTAGAGCC TGTGCTACCC GCAAGGAACC CTAATAGGTT CTGTTGCCTC AGTGGGCATG 1020 

ACCAA6TGAC AAGTGAGGGC CATCAGAACT GTGAGTTCAG TCAGCCTCCT GAATCCAGCC 1080 

AACAGGGCTC TGAGCATCTA CTGATGCAGG CCTCAACCAA A6CT6TTGGA TCCCCAAAAG 1140 

CCTGTGACAG AGCTTCCAGC GTGGATTCCA ACCC31CTCAA TGAGGCTTCT GCAGAGCTAG 1200 

CTAAGGCTTC TTTTGGCAGA CCTCCACATC TCATAGGACC CACAGGGCAT CGCCATAGT6 1260 

75 CCCCTGAACA GCTGGTGGCA TCCCACCTGC AGCATGT6CA CCTTGATACC AGGGGCAGCA 1320 

AAGGGATGGA GCTCCCACCC 6TACAGGATG GGCACCAGTG GACTCT6TCC CCTTTGCACA 1380 



206 

3NSDOCID: <WO 02070492A2^L> 



wo 02/079492 PCT/US02/04915 



GCAGCCACAA A6G6AAGAAA AGTCCAT6CC 
6CAAAGAAAG AAA6ACCAGA CAA6TGGATG 
AAAGCAGTCC CCCACATGGA GAGGCTGATG 
CAAACAGAAC AAGCAGAGCA GCCAGTGAAT 
5 CCCTTGTTCA ACAAGCCAC6 GACTGTTCTT 
CAGGTGAAGA AGGGGACAGC GAGCCCAAGG 
GAGGGACCCG GGGCCGCTCG ATCCAAAACC 
TGCGTAAT6A AATTCAGAGG AGGAAGGCCC 
AGCTGTGTGA CACTAAGGAG CCAGTGGAAG 

10 TCACTGCCTC TAACACATCT CTTCTATCTT 
AGCTCTTCAA CAAAAGCATG ATGCTCAGAG 
CTGAGAGCCA TGAATCTAGG ACAGGCTTAG 
GCCAGTCCTC TTTGGGCCTG AACACCTGGT 
CTGAGAAAGC ACATGCTCAC TGTGGAGTCC 

15 ATAATTCACA GCCACTTGTG GCAGCAGCCA 
AGGAATTGAA GGCTTCTACT GCTCAAGCTG 
ACAGAAGAAA GTTCTTTGAA GAGAGTAGCA 
TAACCACTCA TAGCAACAAG ACTTTTACCC 
AGCCAATGAG CTCCAGCTGT AGGGAATTGA 

20 CCGCAGACCA ACCATATCAT GCCACAGACC 
CA6AAACTCC CACTTACTCA GAATGTTTTG 
6TAAGCCACT ACACT6TGGT 6ATTTTGATT 
TTCAAGGA6C TCTAGTCCAT 6ATCCTTGCA 
TGCTAAA6AG AAATAT6ATG CCAAATT6CT 

25 TTCGGTGTTC AGTTT6CTAT CATAATCCTC 
CACCTG6CAA CACTTG6AAA CCCAGGAAGC 
GGAATCCAAT AACAGGAAAC AGGAAGACCA 
A6ACTA6CTT TTCATGG6CA ACCCCTTTCC 
TGTCAAGCTA CCX3AGCAATT TCTTCTCTTG 

30 AAAAATCAGA GGAAACTTCA GTTTATGAGG 
CACTGCGCAG CCGTGCCTTC TCAGAGAGTC 
CCTGGGGGCA GCATAGGAGG GAGCTCTTTA 
TCGQAGCCAG GAAGAAGGCC TTTCCTCCTC 
ACAGGCTCTT TCGTGCAGCC CAGCAGCAGA 

35 AGGAGGAGGA GGAGGAGGAA GAAGAAGAAG 
CAGA6GAGGA GGAAGAGGAG CTGCCACCCC 
GTGCTCTCAA TCCTGAGGAG GTCCTAGAGC 
AGGGCTCGAG ACAGGGTTCA CAAAGTGTCC 
CCAGTGATTT CTTGCCTCCA ATAAGGGGTC 

40 CCCCTTGCTA CTATGGCATT GGTGGGCTTT 
CCGCCAAACA AGAGTTTCAG CACTTTTCGC 
CTTACTCAGC TTATTACAAT ATTTCTGTGO 
ACCAACCTGA GATGGCAGAG ATTG6CCTAG 
AAAAAAAGAT ACAGCTTATC 6AAAGCATCA 

45 A6C6AG66CT GCTAGA6GAC ATCAATGCCA 
ACTTAAAAGC CX3TCTGC7kAA TCCAATGAAT 
TGGACAAAGT GGTCAACCTG TTGCTGTCAC 
CTCT6AACAG CATCGATTCA GAGGCCAACC 
AGCA6CT6AC GGGGCAGTTG GCAGATGCCA 

50 AGAAGTTGGT GTTTGGCATG GTCTCCCGCT 
AGCACTTTGT CAAGATGAAA TCTGCTCTCA 
TCAAGCTCGG GGAAGAGCAA CTCAAATGTC 
ATTTCTAATT CTACCAGCAC TCTGCCACAG 
TCAATCTTCT TTGTTAGCAG TTTCTCAGCA 

55 CCTCTACCCT GGATGTCTCT CACTACCCCT 
ACCCTGGGGA AGCCACAAGC TTCTACCCAA 
CACACTCTCC TTCCCACAGT TGCCAAGGGC 
TGCCTTCATT CTGCTTTGTA CTAGGACACC 
ATCATCAACA GCCTCTAAAG GCTCAGAGGG 

60 GGCTTGTGGC CAGCCATTTC TCACAGAGAG 
CAGTTTCAGG GCCTCACCCA AGCTTTGCAG 
AAAAAATGCA AGCAAAGGTT GAGTACCCCC 
ATAGGCTCTA CCCTTACCTT TCTCAGCAGC 
CTGGCTAGTG TGACCCTCTT CCTGTCCTAA 

65 TTTCCTTTAC ATTGCTGGGG 6TTACCX3CAG 
CATTAATAGC TCTACTAAAA CTGACTTCTA 
TCTTATTGTT ATATTTTAAA TGGCCTTTTG 
TTTTCTTTTT TAACTAATAA GGCGAGAAGA 
AAG6AAAGCA TTTTCTGCA6 ATCA6CCTGA 

70 ' TCTC6TGTTG CTCACAACTA CCTGCCTGGA 
XAAAACACAA 6ATCAAATGA ACAATCCGAA 
CATGGTGGCT CACGCCTGAA ATCCCAGCAC 
GTCAGGAGAT CAAGACCATC CTGGCTAACA 
AAAAATTAGC CAGGTGTGGT GGCACGCACC 

75 AGGAGAATTG CTTGAACCTG GAAGGCAGAG 
CTCCATCCTG GGCAACAGAG TGAGACTTTG 



CCCCTACA6G AGGAACCCAT GACCAGTCCA 1440 

ACAGGTCTTT AGTTTTGGGA CACCAGAGCC 1500 

GACACCCCTC AGAT^AAAGGT TTCCTGGACC 1560 

TGGCCAACCA GCAACCCTCT GCCTCT6GCT 1620 

CAACCACTAA AGCAGCTAGT GGCACAGAG6 1680 

AGTGCAGCCG GAT6G6T6GT AGGCGAAGT6 1740 

GGCGGAAGAG TGA6CGTTTT GCTACCAATC 1800 

AGCTCCAGAA AAGCAAGGGT CCCTTGTCAC 1860 

AGACCCAGGA GCCCCCAGAA AGTCCTCCAC 1920 

CATGTAAAAA ACCTCCCAGC CCCAGAGACA 1980 

CTAGGTCTTC CGAGTGCCTC AGCCAAGCCC 2040 

AGGGACGAAT AAGCCCTGGC CAGAGGCCTG 2100 

GGAAAGCACC TGACCCATCC TCCTCAGACC 2160 

GTGGAGGTCA TTGGAGATGG TCTCCAGAGC 2220 

TGGAAGGCCC TTCCAACCCA GGTGACAACA 2280 

GGGAGGATGC CATCCTCTTG CCTTTTGCAG 234 0 

AATCCTTATC TACATCTCAT TTGCCAGGTT 2400 

AGAGACCAAA ACCTATAGAC CAAAACTTCC 2460 

GGCGCCATCC CATGGACCAA TCATATCATT 2520 

AATCATATCA TTCCATGTCA CCCCTTCAGT 2580 

CAAGCAAAGG TCTAGAAAAT TCCATGTGTT 2640 

ACCACAGGAC CTGCTCTTAC TCCTGCAGTG 2700 

TTTATTGTTC TGGGGAAATC TGCCCTGCCT 2760 

ACAACTGCCG GTGCCACCAC CACCAATGCA 2820 

AGCACAGTGC CCTCGAG6AC AGCAGCTTGG 2880 

TGACAGTGCA GGAATTTCCT GGGGACAAAT 2940 

6CCAGTCAGG GAG6GAAATG GCTCATTCCA 3000 

ATCCTTGCCT TGAGAACXXrA GCACTG6ACT 3060 

ACCTCCTTG6 AGACTTC/^ CATGCTTTGA 3120 

AGGGGAGCTC CCTTGCCTCC ATGCCCCACC 3180 

ACATCAGCTT GGCX3CCCCAA AGCACCCGGG 3240 

GCAAAGGTGA TGAGACCCAG TCGGATCTTC 3300 

CTCGCCCTCC TCCTCCCAAC TGGGAGAAGT 3360 

AGCAGCAACA GCAGCAGCAG AAGCAACAGG 3420 

AAGAGGAAGA GGAAGAGGAG GAGGAGGAGG 34 80 

AGTATTTCAG TTCAGAAACC TCTGGTTCCT 3540 

AGCCACAAGC CCTCAGCTTT GGCCACCTGG 3600 

CAGCAGAGCA AGAATCCTTT GCACTCCATT 3660 

ACTTGGGATC TCAACCTGAG CAGGCTCAGC 3720 

GGAGGACATC GGGACAGGAA GCCACTGAAT 3780 

CTCCTTCAGG GGCCCCAGGA ATCCCTACCT 3840 

CCAAGGCAGA GCTGCTGAAC AAACTGAAAQ 3900 

GAGAGGAGGA AGTTGACCAT 6AACTGGCTC 3960 

GCAGAAAACT TTCTGTCTTG CGGGAG6CCC 4020 

ATTCT6CCCT TGGGGAGGAG 6TG6AGGCCA 4080 

TTGAAAAGTA CCACTTGTTT GTTGGGGACC 4140 

TCTCTGGACG ACTG6CCCG6 6TGGAGAATG 4200 

AGGAGAAGTT 6GTACT6ATA GAGAAGAAGC 4260 

AGGAGCTGAA G6AGCACGTG GACCGC066G 4320 

ACCTGCCTCA GGACCA6CTC CAAGATTACC 4380 

TCATTGAACA GCGAGAGCT6 GA6GAGAAGA 4440 

TCAGGGAGAG TCTACTCCTG GGGCCCAGCA 4500 

CATCCCTGCC CAGCCATGTG GGAAGTGCTT 4560 

AGTAGATAGC AATTAGCAGT TTGTTCCAGC 4620 

TCCCTAGCAG TGGTCCTAAC CAGCTAGGAG 4680 

GGGAGCTGCA GCAAGGTGTG ATCTTAGAAC 4740 

AAGTACTTGC TGCACAGAGA ACCAAGGJW^G 4800 

AAAGACATCA AGTACTCATC ACCCACCCAT 4860 

AATCTGCCTT GCAGCTCTAC TCTGCCCCAG 4920 

CTGGCTGCCT TGAGGGCATT CACCTGGCAC 4980 

GGGAAAGCAC AGAGGGAGGA ATTACACTGA 504 0 

AGGTGCCCCT TAGGAAGGAA CCAGGTTTAA 5100 

AAGTTCAGGG GAAGA6GCCT ACTCTTAGCC 5160 

GACTTTGGTC CTACCACCTC TTGTTTCATC 5220 

6TGCCTACCC CAGGGCTTCA CCATAT6GGC 5280 

6ATGTA6GTT TCATTATTGG 6GGAGGGGGT 5340 

ATTTTATTTA TTTTTATGTT TTGATTATTT 5400 

GGGAAGTTGG AGAGGGAAAA GTTAGCCCAG 5460 

ATCCACC6TG GCTAGGCATA TTCTTGCTCT 5520 

TGAATTTAGG AAAGTTGCAG GATACAAGGT 5580 

AATGTTATTA A6AAAACAGT TCCGGCCGGG 5640 

TTTGGGAG6C C6AGGCAGGT 6GATCACGAG 5700 

CGGTGAAACC CTATCTCTAC TAAAAATACA 5760 

AGTAGTCCCA GCTACTCGGG AGGCTGAGGC 5820 

ATTGCAGTGA GCTGAGACCA CACCACTGCA 5880 

TCTCAAAAAG AAAGAAAGAA AGAAAGAAA6 5940 
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AAAGAAAGAA AGAAAAGAAA GAAAGAAAGA AAGAAAGAAA ACAGTTCCAT TTACAATAGC 6000 
ATC 

Seq ID NO: 59 Protein sequence; 
5 Protein Accession #: XP_050478 

1 11 21 31 41 51 

I I I I I I 

MENRP6SFQY VPVQLQG6AP WGFTIiKGGLE HCEPLTVSKI EDGGKAALSQ KMRTGDEZiVN 60 

10 INGTPLYGSR QEALILIKGS FRILKLIVRR RNAPVSRPHS WHVAKIiLEGC PEAATTMHFP 120 
SBAFSIiSNHS GCNTSDVCVQ WCPLSRHCST BKSSSZGSME SltBQPGQATY BSHLLPIDQH 180 
MYPNQRDSAY SSFSASSHAS DCALSLRPEE PASTDCIMQG PGPTKAPS6R PNVABTSG6S 240 
RRTNGGHIiTP SSQMSSRPQE GYQSGPAKAV R6PPQPPVRR DSLQT^RAQIi XiNGBQRRASE 300 
PWPLPQKEK IjSLEPVLPAR NPNRFCCLSG HDQVTSEGHQ NCEFSQPPES SQQGSEHIjtiM 360 

15 QASTKAVGSP KACDRASSVD SNPLNEASAE IiAKASFGRPP HI>1GPTGHRH SAPEQliLASH 420 
LQHVHIiDTRG SKGMELPPVQ DGHQWTLSPIi HSSHKGKKSP CPPTGGTHDQ SSKERKTRQV 4 80 
DDRSLVIjGHQ SQSSPPHGEA DGHPSEKGPIi DPNRTSRAAS EIiANQQPSAS GSliVQQATDC 54 0 
SSTTKAASGT EAGEEGDSEP KECSRMGGRR SGGTRGRSIQ NRRKSERFAT NliRNEIQRRK 600 
AQLQKSKGPL SQLCDTKEPV EETQEPPESP PbTASNTStiL SSCKKPPSPR DKLFNKSMML 660 

20 RARSSECLSQ APESHESRTG LEGRISPGQR PGQSSLGIjNT WWKAPDPSSS DPEKAHAHCG 720 
VRGGHWRWSP EHNSQPLVAA AMEGPSNPGD NKELKASTAQ AGEDAILIiPF ADRRKFPEES 780 
SKSLSTSHLP GLTTHSNKTF TQRPKPIDQN FQPMSSSCRE LRRHPMDQSY HSADQPYHAT 840 
DQSYHSMSPL QSETPTYSEC FASKGLENSM CCKPLHCGDF DYHRTCSYSC SVQGALVHDP 900 
CIYCSGEICP ALLKRNMMPN CYNCRCHHHQ CIRCSVCYHN PQHSALEDSS IiAPGNTWKPR 960 

25 KLTVQEFPGD KWNPITGNRK TSQSGREMAH SKTSFSWATP FHPCLENPAL DLSSYRAISS 1020 

liDIiliGDFKHA LKKSEETSVY EEGSSLASMP HPLRSRAFSE SHISLAPQST RAWGQHRREL 1080 

FSKGDETQSD LLGARKKAFP PPRPPPPNWE KYRLFRAAQQ QKQQQQQQKQ QEEEEEEEEE 1140 

EEBEEEEEEE EAEEBEEELP PQYFSSETSG SCAIiNPEEVL EQPQPLSFGH LBGSRQGSQS 1200 

VPAEQESFAL HSSDFLPPIR GHL6SQPEQA QPPCYY6IG6 LWRTSGQBAT ESAKQEFQHF 1260 

30 SPPSGAPGIP TSYSAYYMIS VAKAELI^NKL KDQPBMAEI6 LGEEEVDHEL AQKKIQIiIES 1320 

ISRKIiSVIiRE AQRGLIiBDIN ANSALGEEVE ANLKAVCKSN EFEKYHLFVG DLDKWNLLL 1380 

SLSGRIiARVB NALNSIDSEA NQEKIiVIiXEK KQQLTGQIiAD AKBIJCEHVDR REKIjVFGMVS 1440 
RYIiPQDQLQD YQHFVKMKSA LIIEQREIiEE KIKLGBEQI>K CI«RESLIiIiGP SMF 

35 

Seq ID NO: 60 Nucleotide sequence: 
Nucleic Acid Accession #: NM_014 705 

Coding sequence: 192.. 2489 (underlined sequences correspond to start and stop codons) 

40 

1 11 21 31 41 51 

I I I I I I 

GGGAGAAGCT AGGAAAA7UVT GTCTTTGAGC TGTGAGATGC TTGTATATTT TGAAAATATG 60 
ATTATATGCA TGTGTTTGTA TTTTATGACT TGGATAATCT GAAAATCAAT TTGCTTTGTC 120 

45 AATGCTTCCT GGATTAGAAT TCCACTATTT GGTCCCTATC CTAGTCTACT AAAGAAAATT 180 
GAGCGGGAAA CATGG CGGGA AAGTGGCGTT TCATTAATTG CTACTGTAAC TCGTCTAATG 240 
GAGAGGTTGT TAGATTACAG AACTTCTATA AGACTGAACT GAACAAGGAG GAGATGTATA 300 
TACGCTACAT TCACAAACTC TATGATCTGC ATCTCAAAGC ACAGAACTTT ACAGAAGCTG 360 
CATATACCCT CCTCTTATAT GACGAGCTAC TGGAATGGTC TGATCGGCCX: CTCAGGGAGT 420 

50 TCCTGACCTA CCCCATGCAA ACAGAAT6GC AGCGCAAAGA GCACCT6CAC CTCACCATCA 4 BO 
TCCAGAACTT TGACAGAGGC AAAT6TTGGG AGAATG6CAT TATCTTGT6C CGGAAGATT6 540 
CAGAGCAGTA TGAGAGTTAT TATGACTACA GAAACCT6AG CAAGATGC6G ATGATG6AAG 600 
CCTCTTTGTA T6ACAAAATT AT6GACCAGC AACGTCTTGA ACCAGAGTTC TTCA6AGTTG 660 
GATTTTAT6G AAAAAAATTT CCATTTTTCT TAAGAAATAA GGAGTTTGTG TGTCGAGGGC 720 

55 ATGACTACGA GAGGCTGGAA GCCTTCCAAC AGA6AATGCT GAACX3A6TTC CCCCAT6CCA 780 
TCGCCATGCA GCACGCCAAC CAGCCCGATG AGACCATCTT CCAGGCAGAA GCTCAGTATT 840 
TGCAGATATA TGCTGTGACT CCCATTCCAG AGAGCCAGGA GGTCCTGCAG AGAGAGGGTG 900 
TTCCGGACAA CATCAAAAGC TTCTATAAAG TGAATCACAT CTGGAAATTC CGCTATGACC 960 

GACCATTTCA CAAAGGCACA AAAGATAAAG AGAATGAATT CAAGAGTCTC TGGGTGGAGA 1020 

60 GAACGTCATT ATACTTGGTG CAGAGTTTGC CTGGCATCTC TCGCTGGTTT GAAGTGGAAA 1080 

A6CGTGAAGT GGTAGAAATG AGTCCTCTGG AAAATGCAAT TGAAGTGCTA GAAAATAAGA 1140 

ATCAGCAGCT GAAGACTCTG ATTAGTCAGT GTCAGACAAG ACAGATGCAG AATATTAATC 1200 

CCCTGACTAT GTGCCTGAAT GGAGTTATAG ATGCTGCAGT TAATGGTGGC GTTTCCAGGT 1260 

ATCAAGAGGC ATTCTTTGTC AAAGAATATA TCTTAAGTCA CCCTGAAGAT GGGGAGAAAA 1320 

65 TTGCACGATT AAGAGAGCTG ATGCTTGAGC AGGCACAGAT TCTGGAATTT GGTTTGGCCG 13 80 

TGCATGAGAA GTTTGTACCT CAAGATATGA GACCCCTTCA CA7UUVAGCTG GTTGACCAAT 1440 

TCTTTGTGAT GAAGTCGAGC TTAGGGATAC AGGAGTTCTC TGCTTGTATG CAAGCCAGTC 1500 

CTGTCCATTT TCCTAATGGA AGCCCTCGTG TGTGTAGAAA CTCAGCACCT GCTTCTGTGA 1560 

GCCCAGATGG TACCAGGGTA ATTCCTAGAC GCAGCCCGTT AAGTTACCCA GCT6TC3ACC 1620 

70 GATATTCTTC CTCCTCACTG TCCTCACAAG CTTCTGCTGA AGTAAGCAAT ATTACAGGGC 1680 

AATCAGAAAG CTCTGATGAA GTCTTTAACA TGCAGCCAAG TCCATCTACC TCAAGCTTGA 1740 

GTTCTACTCA CTCGGCTTCA CCTAATGTGA CAAGTTCTGC TCCATCGAGT GCCA6AGCTT 1800 

CTCCTTTGTT GTCTGACAAA CACAAACATT CCCGAGAAAA CTCTT6CCT6 TCACCAAGAG 1860 

AGA6ACCATG CAGTGCCATC TATCCAACAC CTGTGGAGCC TTC6CAGAGG ATGCTGTTTA 1920 

75 ATCATATTGG AGACGGGGCC TT6CCACGCA GTGACCCAAA TCTCTCT6CA CCTGAAAAAG 1980 

CTTCACCA6C AAGACACACG ACATCAGTAT CCCCCTCGCC TGCCG6GCGA TCTCCATTGA 2040 
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10 



15 



20 



25 



30 



35 



40 



45 



AGGGCTCTGT 
CCAACTCCCC 
GCACGTC66A 
TGCCAGTGCC 
AGACTCCGCC 
ACAGCCTCTC 
CGCGATCCAG 
CCCTGCCCCG 
TTGCCCX3TTT 
CTTACTCAGC 
ATATTAAAAG 
AAAACGTTTC 
GTTCACTTTT 
TGTGTGACAA 
TCAGTGCACA 
TTTATCAGTG 
GGGGTTGCAA 
GTTGCTTTTG 
CTAAATCCCT 
TGCTTCAGAG 
ATTTTGCGCT 
TGCTGAAGTA 
TACAGACAGT 
TTAGCACGAT 
CACAAAATAA 
CCTGTCGTCT 
ATAGCATCTT 
CTTGTATGTA 
TCAATTGACA 
TGTTGGGAAG 
A6TGACCACT 
TGGTTATAGA 
TAATAAGAGA 
CAGTATGATT 
ATGTGCTTTT 
ATATGTTAAA 
TTTAGCTTTG 
GATAACACTT 
TATTTCTGTT 
AACAAAGCTT 
GGGCACATTA 
AGTGAGCAGT 
AGCAGAACAT 
TAAATATTAA 



GCAGTCTTTC 
TGTCTTGTCG 
AACCTCAGGC 
GGTGCCCGT6 
CCCGTACAGC 
CATCCCCGTC 
CCACCTGGAG 
CAAGGTCTCT 
ACAAAATAAG 
TCCTTCGATG 
TTGCTGATCT 
TCATTTGGAA 
TTTTATATAG 
TGTTGTATCG 
TCCTAACACA 
CAGATCATCA 
AGCTTGGGAC 
CTGTTTGACT 
TACTACCCTG 
TTCCAATCAG 
TCCTTTCTTG 
TTAATGAGGC 
ATTAAATTAA 
TT^TGGCAA 
AAAATTCCTC 
TTCTCTTGAA 
AGGTCTAGAT 
GTAATGTATT 
ATAGATATGA 
CTCATTTTAQ 
TTTTTATATT 
GGTTTTCTAT 
TACCATTATG 
GTTATAATTA 
AACAATTCTG 
TATTTATGTT 
TATCTTGCAA 
CGTOTTTGTA 
GCTTTAAAGA 
CCTTGATTTC 
TAAATCAGTG 
TGAATTTATC 
TTTAAGAGAT 
ATAAAATTAC 



ACCCCCTCTC 
GGCAGCTACA 
TTTGAAAATC 
CCGAGCTACG 
GTCTACGAGC 
ACGTCGGAGC 
AAT66G6CCC 
CAGTTATAAG 
AAGTATGATG 
AAT6GAATTA 
AAAACGCCAG 
GTGGTAAATA 
TTTAATCTTA 
TTTTTACTGA 
GTGGTCCTTA 
GAATTAAAGT 
TGGAAATTGT 
GCTGTCTACA 
ACACCGTGGT 
CTAGATTAAG 
ATAGTTTCCT 
ACAAATGACT 
TAGCTTAAGT 
AAGGACTTAT 
ACX3ACTCTCC 
G6AG6ATT6C 
AGGGATGCTA 
TTATATCTTT 
ACTGTATTTT 
TTTAACCATG 
CTCTTAATGA 
AATAAATGTT 
TGTAAAAAAA 
TGCCAAATAC 
CCATATTGAC 
TAGTGAAAGT 
GTTTTGCAGT 
ACCACATTCA 
AGTAAAACCT 
CTTTTCCTGT 
TTATTTGCTC 
TTGAATTTAT 
TCTGTTAGCC 
CAGATTAATC 



CAGTGGAGTA 
GCAGTGGGAT 
A6GTGAATGA 
GCX3GGGAGGA 
GGACTCTGCG 
CGCCCGCGCT 
GGAGGACTGA 
TCACTTTTCT 
AGAA6ACATT 
AAACTTGCTT 
ATGTTT^AATG 
GTGATAAAGA 
AT^CCAATAC 
ATACTTGATA 
TTTTAGAAGA 
TCAAGCAGGC 
TTTGTTCTTG 
TTCGTAAAAT 
ATCTACTGTA 
CAAGAGGCTC 
ATATAAAATT 
GTGCCCCATT 
GAAGAAAAAA 
AAAAGGCAAG 
ACmTTACCA 
TGTAGACTTC 
ATGCCAGTTG 
GTTTTTTCTT 
AAATCATACT 
TTTGTTTTGT 
AACCATTCAG 
CAAGTATTTT 
AGTAAAAATA 
TTTACGTATG 
TTTACAATTT 
GTTCATAATT 
CAGAAATTTT 
TATATATATA 
TCCATTTAT^ 
GTAATTTAAT 
TTGGAGCCAT 
CATGTGTGTG 
CACATGTTCA 
TT 



CCACTCGCCA 
TTCTTCTCTC 
ACAGTCGGCC 
GCCAGTGCGC 
GCGCCCCGTC 
GCCCCCCAAG 
CCCCGGCCCG 
ATGTACCTGC 
TAGTGTAGGC 
ATTA7VATATC 
AAGTATGGCT 
CTCeTTTTGT 
GATATTGTCA 
CTTGGAGAAA 
CTTCTGTAAA 
GAGCAAGACA 
AAACAAAATA 
TCTATTTTGT 
TTTCTTTTCA 
CAGAAGAAAT 
TGTCATTGAA 
AGCAAGAATT 
AAAAACTTAG 
GGCATTAACT 
GTGGAGTTTG 
TCTA6CTTGA 
TA6AAGTGTG 
TTACTGACTG 
6TTAAATATT 
TGGTAGCTTA 
CAGGTATATG 
TGTATATAAC 
AACGCAAACA 
GAAAAAGAAT 
TGAATGTCGG 
GAGAAAAGGA 
TTGAACTAGC 
CATATATATG 
TAAGATGACA 
AGATTTGTTG 
TTTTTAAAAA 
TATTTCTGAA 
TGTTGGTTGC 



GGACTCATCT 
AGCCGGTGCA 
CCCXTTGCCGG 
AAGGAGAGCA 
CCGCTACCTC 
CCTCTGGCAG 
CGGCCCAGGC 
GAT6CATTCT 
ACTTTAATAA 
ATGTTGCACA 
GAATTTCATT 
ACCTTTTTAT 
7VACGATACAA 
GCTTATTAAG 
TAAGGCAAGG 
GTATACTT7A 
CTTCTTTAAG 
GAATTGGTAG 
AGGTGCAATT 
GTTTACTTGA 
CAAGAGCAAA 
CAGGAATCAA 
TGAAAATGTA 
TTCAGTCCT6 
TCTTAGCTGA 
ATATTGCAAC 
AAAATkAGCAC 
TTTATAACAC 
TTCCCTCTTT 
CCTGGAAG6C 
CTGTTGAGGC 
TGGg?TAATTT 
GTTGTTGATG 
ATTTGTACAT 
AAAAATTAAT 
ACATATGCAT 
TTTTGCTTTT 
TGAAGCTCCA 
TGCATAAGAT 
ACTAGTGCTT 
AAATTTTGGC 
GCAGCTACAT 
TGCTGAATGG 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 



Seq ID NO: 61 Protein sequence; 
Protein Accession # : NP 055520 



50 



55 



60 



65 



MAGKWRFINC 
IiIiYDEIiLEWS 
ESYYDYRNIiS 
RLBAFQQRMIi 
IKSFYKVNHI 
VEMSPIiENAI 
FFVKEYIIiSH 
KSSIiGIQEFS 
SSXiSSQASAE 
SDKHKHSREN 
RHTTSVSPSP 
TSGFENQVNE 
IPVTSBPPAL 



11 
I 

YCHSSNGBW 
DRPIiREFLTY 
KMRMMBASItY 
NEFPHAIAMQ 
WKFRYDRPFH 
EVLENKNQQL 
PEDGEKIARL 
ACMQASPVHP 
VSNITGQSES 
SCLSPRERPC 
AGRSPLKGSV 
QSAPIiPVPVP 
PPKPLAARSS 



21 
I 

RIiQNFYKTEIj 
PMQTEWQRKE 
DKIMDQQRIiH 
HANQPDETIF 
KGTKDKENEF 
KTLISQCQTR 
REIiMLEQAQI 
PNGSPRVCRN 
SDEVFNMQPS 
SAIYPTPVEP 
QSFTPSPVEY 
VPVPSYGGEE 
HLENGARRTD 



31 
I 

NKBEMYIRYI 
HIiHLTIIQNF 
PEPPRVGFYG 
QAEAQYLQIY 
KSLWVERTSL 
QMQNINPLTM 
LEFGIiAVHBK 
SAPASVSPDG 
PSTSSLSSTH 
SQRI^FNHIG 
HSPGLISNSP 
PVRKESKTPP 
P6PRPRPLPR 



41 
I 

HKIiYDZiHIiKA 
DRGKCWEN6I 
KKFPPFliRNK 
AVTPIPESQE 
YLVQSLPGIS 
CLNGVIDAAV 
FVPQDMRPLH 
TRVIPRRSPL 
SASPNVTSSA 
DGALPRSDPN 
V1*SGSYSSGI 
PYSVYERTLR 
KVSQL 



51 
I 

QNFTEAAYTIi 
IIiCRKIABQY 
EFVCRGHDYE 
VliQREGVPDN 
RWFEVEKREV 
NGGVSRYQEA 
KKIiVDQFPVM 
SYPAVNRYSS 
PS SARAS PLIj 
liSAPEKASPA 
SSLSRCSTSE 
RPVPIiPHSIiS 



60 
120 
180 

240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 62 Nucleotide sequence: 

Nucleic Acid Accession #: fgenesh prediction 

Coding sequence; 1..2561 (underlined sequences correspond to start and stop codons) 



70 1 11 21 31 41 51 

I I I I I I 

ATGG ACCGAG. GCCAGGGTAA GAGGGGCCGC GA06CCCGCA CTTGTTGCG6 CGC06GGCG6 60 

GAAA6GGAGA CTGGACX3ATC TGAAGCCGGA GAGGAGGA6G 6AGAGAGGCG GGC6GTGGGG 120 

CGG66GCTGA GGAACGCTCG GAG6GGACT6 GGAGACGCGG CGCTTAT6CA AAGGTGCCTT 180 

75 CG6CTGCCGG GACAACCCGC CAGCAACCAG GTACAGCTCT CAGAGGTTCC ACAGAGGAAG 240 

CTCAGGGTCC CTGAATCTCC CAGTGTGGCA GAGAAAGTGA AACTTGGTCA CCGATGCCTG 300 



209 



8NSOOCID: <WO 0207949aA2_L> ' 



wo 02/079492 



PCT/US02/04915 



GAACTGCTGG AGCAGCTGCT CCCAGAGCTC ACCGGGCTGC TCAGCCTCCT GGACCACGAG 360 

TACCTCAGCG ATACCACCCT GGAAAAGAAG ATGGCCGTGG CCTCCATCCT GCAGAGCCTG 420 

CAGCCCCTTC CAGCAAAGGA GGTCTCCTAC CTGTATGTGA ACACAGCAGA CCTCCACTCG 480 

GGGCCCA6CT TCGTGGAATC CCTCTTTGAA GAATTTGACT GTGACCTGAG TGACCTTCGG 540 

GACATGCCAG AGGATGATGG GGAGCCCAGC AAAGGAGCCA GCCCTGAGCT AGCCAAGAGC 600 

CCACGCCTGA GAAACGC6GC CGACCTGCCT CCACCGCTCC CCAACAAGCC TCCCCCTGAG 660 

GACTACTATG AAGAG6CCCT TCCTCTG6GA CCCGGCAAGT C6CCTGAGTA CATCAGCTCC 720 

CACAAT6GCT GCAGCCCCTC ACACTCGATT GTGGATGGCT ACTATGAGGA CGCAGACAGC 780 

AGCTACCCTG CAACCAGGGT GAACGGCGAG CTTAAGAGCT CCTATAATGA CTCTGACGCA 840 

ATGA6CAGCT CCTATGAGTC CTACGATGAA GAGGAGGAG6 AAG6GAAGA6 CCCX3CAGCCC 900 

CGACACCAGT. 6GCCCTCAGA GGAGGCCTCC ATGCACCTGG TGAGGGAATG CAGGATATGT 960 

6CCTTCCTGC TGCGGAAAAA GCGTTTCGGG CAGTGGGCCA AGCAGCTGAC GGTCATCAG6 1020 

GAGGACCAGC TCCTGTGTTA CAAAAGCTCC AAGGATCGGC AGCCACATCT 6AGGTTGGCA 1080 

CTGGATACCT GCAGCATCAT CTAC6TGCCC AAGGACAGCC GGCACAAGAG GCACGAGCTG 1140 

CGTTTCACCC AGGGGGCTAC CGAGGTCTTG GTGCTGGCAC TGCAGAGCCG AGAGCAGGCC 1200 

GAGGAGTGGC TGAAGGTCAT CCGAGAAGTG AGCAAGCCAG TTGGGGGAGC TGAGGGAGTG 1260 

GAGGTCCCCA GATCCCCAGT CCTCCTGTGC AAGTTGGACC TGGACAAGAG GCTGTCCCAA 1320 

GAGAAGCAGA CCTCAGATTC TGACAGCGTG GGTGTGGGTG ACAACTGTTC TACCCTTGGC 1380 

CGCCGGGAGA CCTGTGATCA CGGCAAAGGG AAGAAGAGCA GCCTGGCAGA ACTGAAGGGC 1440 

TCAATGAGCA GGGCTGCGGG CCGCAAGATC ACCCGTATCA TTGGCTTCTC CAAGAAGAAG 1500 

ACACTGGCCG ATGACCTGCA GACGTCCTCC ACCGAGGAGG AGGTTCCCTG CTGTGGCTAC 1560 

CTGAACGTGC TGGTGAACCA GGGCTGGAAG GAACGCTGGT GCCGCCTGAA GTGCAACACT 1620 

CTGTATTTCC ACAAGGATCA CATGGACCTG CGAACCCATG TGAACGCCAT CGCCCTGCAA 1680 

GGCTGTGAGG TGGCCCCGGG CTTTGGGCCC CGACACCCAT TTGCCTTCAG GATCCTGCGC 1740 

AACCGGCAGG AGGTGGCCAT CTTGGAGGCA AGCTGTTCAG AGGACATGGG TCGCTGGCTC 1800 

GGGCTGCTGC TGGTGGAGAT GGGCTCCAGA GTCACTCCGG AGGCGCTGCA CTATGACTAC I860 

GTGGATGTGG AGACCTTAAC CAGCATCGTC AGTGCTGGGC GCAACTCCTT CCTATATGCA 1920 

AGATCCTGCC AGT^TCAGTG GCCTGAGCCC CGAGTCTATG ATGATGTTCC TTATGAAAAG 1980 

ATGCAGGACXi AGGAGCCCGA GCGCCCCACA GGG6CCCAGG TGAAGCGTCA CGCCTCCTCC 2040 

TGCAGTGAGA AGTCCCATCG TGTGGACCCG CAGGTCAAAG TCA7ACGCCA CX3CCTCCAGT 2100 

6CCAATCAAT ACAA6TATGG CAAGAACCX3A GCCGAGGAGG ATGCCCX^GAG GTACTTGGTA 2160 

GAAAAAGAGA AGCTGGAGAA AGAGAAAGAG ACQATTCGGA CAGAGCTGAT AGCACTGAGA 2220 

CAGGA6AAGA 6GGAACTGAA GGAAGCCATT CGGAGCAGCC CAGGAGCAAA ATTAAAGGCT 2280 

CTGGAAGAAG CCGTG6CCAC CCTGGAAGCT CAGTGTCGGG CAAAG6AGGA 6CGCC6GATT 2340 

GACCTGGAGC T6AAGCTG6T GGCT6TGAAG GAGCGCTTGC AGCAGTCCCT GGCAGGA6G6 2400 

CCAGCCCTG6 G6CTCTCCGT GAGCAGCAAG CCCAAGAGT6 GGCAACTCTC TGAGGAAGAT 2460 

ACGCTCACCT CCAATGGTGC TCTCTCAGAG AGAACTTCTC TGACCTCATC TACACCAGGG 2520 
CTTCTCAACC CCAACACTAC TGACATTTT6 QACCA GTAA 



Seg ID NO: 63 Protein aeouence; 

Protein Accession #: fgenesh prediction 



1 11 21 31 • 41 51 

I I I I I I 

MDRGQGKRGR DARTCCGAGR ERETGRSEAG EEEGERRAVG RGUINARRGL GDAALMQRCL 60 

RIjPGQPASNQ VQLSEVPQRK LRVPESPSVA EKVKLGHRCL EIjLBQIjLPGIi TGLLSIiIiDHE 120 

YIjSDTTLEKK MAVASILQSIi QPLPAKEVSY IiYVNTADLHS GPSFVESLFE EFDCDIiSDUl 180 

DMPEDDGEPS KGASPELAKS PRLRNAADLP PPIiPNKPPPE DYYEEALPLG PGKSPEYISS 240 

HMGCSPSHSI VDGYYEDADS SYPATRVNGE IiKSSYNDSDA MSSSYESYDB EEEEGKSPQP 300 

RHQWPSEEAS MHLVRECRIC AFXjLRKKRFG QWAKQLTVIR EDQLIiCYKSS KDRQPHLRLA 360 

liDTCSIIYVP KDSRHKRHEIj RFTQGATEVIi VliALQSREQA EEWUCVXREV SKPV6GAEGV 420 

EVPRSPVMjC KIiDLDKRIiSQ EKQTSDSDSV GVGDNCSTLG RRETCDKGKG KKSSLAELKG 480 

SMSRT^GRKI TRIIGFSKKK TIiADDLQTSS TEEEVPCCGY LNVLVNQGWK ERWC3UiKCNT 540 

LYFHKDHMDL RTHVNAIALQ GCEVAPGFGP RHPFAFRILR NRQEVAILEA SCSEDMGRWIj 600 

GLLLVEMGSR VTPEALHYDY VDVETIiTSIV SAGRNSFLYA RSCQNQWPEP RVYDDVPYBK 660 

MQDEEPERPT GAQVKRHASS CSEKSHRVDP QVKVKRHASS ANQYKYGKNR AEEDARRYLV 720 

EKEKLEKEKE TIRTELIALR QEKRELKEAI RSSPGAKLKA LEEAVATLEA QCRAKEBRRI 780 

DltELKIiVAVK ERLQQSIiAGG PALGLSVSSK PKSGQIiSEED TLTSNGALSE RTSIjTSSTPG 840 
XiliNPNTTDIIi DQ 

Seq ID NO: 64 Nucleotide aegruence: 
Nucleic Acid Accession #: NM_004126.1 

Coding sequence: 108-129 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGCT CGTGCCGGCC TTCAGTTGTT TCGGGACGCG CCXSAGCTTCG CCGCTCTTCC 60 

AGCGGCTCCG CTGCCAGAGC TAGCCCGAGC CCX«5TTCTGG GGCGAA AATG CCTGCCCTTC 120 

ACATCGAAGA TTTGCCAGAG AAGGAAAAAC TGAAAATGGA A6TT6AGCAG CTTCGCAAAG 180 

AAGTGAAGTT GCAGAGACAA CAJ\GTGTCTA AATGTTCTGA AGAAATAAAG AACTATATTG 240 

AA6AACGTTC TGGAGAGGAT CCTCTAGTAA AGGGAATTCC AGAAGACAAG AACCCCTTTA 300 

AAGAAAAAGG CAGCTGTGTT ATTTC ATAA A TAACTTGGGA GAAACTGCAT CCTAAGTGGA 360 

AGAACTA6TT TGTTTTAGTT TTCCCAGATA AAACCAACAT 6CTTTTTAA6 GAAGGAAGAA 420 
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TGAAATTAAA AGGAGACTTT CTTAAGCACC ATATAGATAG GGTTATGTAT AAAAGCATAT 480 
GTGCTACTCA TCTTTGCTCA CTATGCAGTC TTTTTTAAGA GAGCAGAGAG TATCAGATGT 540 
ACAATTATGG AAATAAGAAC ATTACTTGAG CATGACACTT CTTTCAGTAT ATTGCTTGAT 600 
GCTTC2U\ATA AAGTTTTGTC TT 

5 

Seq ID NO: 65 Protein secnaence! 
Protein Accession #: KP_004117 



10 1 11 ' 21 31 41 51 

I I I I I I 

MPAIiHIEDI*P EKEKLKMBVB QLRKEVKIiQR QQVSKCSEEl KNYIBERSGE DPLVKGIPED 60 
KNPPKEKGSC VIS 

15 

Seq ID NO: 66 Nucleotide aegruence; 
Nucleic Acid Accession #: NM_003842.l 

Coding sequence: 1-1236 (underlined sequences correspond to start and stop codons) 

20 

1 11 21 31 41 51 

I I I I I I 

ATGGAACAAC GGGGACAGAA CGCCCCGGCC GCTTCGGGGG CCCGGAAAAG GCACGGCCCA 
GGACCCAGGG AGGCGCGGGG AGCCAGGCCT GGGQCCCGGG TCCCCAAGAC CCTTGTGCTC 

25 GTTGTCGCCG CGGTCCTGCT GTTGGTCTCA GCTGAGTCTG CTCTGATCAC CCAACAA6AC 
CTAGCTCCCC AGCAGAGAGC GGCCCCACAA CAAAAGAG6T CCA6CCCCTC AGAGGGATTG 
TGTCCACCTG GACACCATAT CTCA6AAGAC GGTAGAGATT GCATCTCCTG CAAATATGGA 
CAGGACTATA GCACTCACTG GAATGACCTC CTTTTCTGCT TGCGCTGCAC CAGGTGTGAT 
TCAGGTGAAG TGGAGCTAAG TCCCTGCACC ACGACCAGAA ACACAGTGTG TCAGTGCGAA 

30 GAAGGCACCT TCCX^GGAAGA AGATTCTCCT GAGATGTGCC GGAAGTGCCG CACAGGGTGT 
CCCAGAG6GA TGGTCAAGGT CGGT6ATPGT ACACCCTGGA GT6ACATCGA ATGTGTCCAC 
AAAGAATCAG GCATCATCAT AGGAGTCACA GTTGCAGCCG TAGTCTTGAT TGTGGCTGTG 
TTTGTTTGCA AGTCTTTACT GTGGAAGAAA GTCCTTCCTT ACCTGAAAGG CATCTGCTCA 
GGTGGTG6TG GGGACCCTGA GCGTGTGGAC AGAAGCTCAC AACGACCTGG GGCTGAGGAC 

35' AATGTCCTCA ATGAGATCGT GAGTATCTTG CAGCCCACCC AGGTCCCTGA GCAGGAAATG 
GAAGTCCAGG AGCCAGCAGA GCCAACAGGT GTCAACATGT TGTCCCCCGG GGAGTCAGAG 
CATCTGCTGG MCCGGCAGA AGCTGAAAGG TCTCAGAGGA GGAGGCTGCT GGTTCCAGCA 
AATGAAGGTG ATCCCACTGA GACTCTGAGA CAGTGCTTCG ATGACTTTGC AGACTTGGTG 
CCCTTTGACT CCTGGGAGCC GCTCATGAGG AAGTTGGGCC TCATGGACAA TGAGATAAAG 

40 GTGGCTAAAG CTGAGGCAGC GGGCCACAGG GACACCTTGT ACACGATGCT GATAAAGTGG 
GTCAACAAAA CCGGGCGAGA TGCCTCTGTC CACACCCTGC TGGATGCCTT GGAGACGCTG 
GGAGAGAGAC TTGCC7AGCA GAAGATTGAG GACCACTTGT TGAGCTCTGG AAAGTTCATG 
TATCTAGAAG GTAATGCAGA CTCTGCCATG TCCTAA 

45 Seq ID NO: 67 Protein sequence; 

Protein Accession #: NP_003833.l 

1 11 21 31 41 51 

50 MEQRGQNAPA ASGARKRHGP GPRBARGARP GPRVPKTLVI. WAAVLIiLVS AESALITQQD 
IjAPCX)RAAPQ QKRSSPSEGIi CPPGHHISED GRDCISCKYG QDYSTHVINDL LFCIjRCTRCD 
SGEVELSPCT TTRNTVCQCE EGTFREEDSP EMCRKCRTGC PRGMVKVGDC TPWSDIECVH 
KESGIIIGVT VAAWLIVAV PVCKSLLWKK VIjPYIiKGICS ggggdpervd rssqrpgaed 
NVIiNEIVSIL QPTQVPEQEM EVQEPAEPTG VNMIjSPGESE HI»LEPAEAER SQRRRIjLVPA 

55 NEGDPTETLR QCFDDFADLV PFDSWEPLMR KliGLMDNEIK VAKAEAAGHR DTLYTMLIKW 
VNKTGRDASV HTLIjDAIjETL GERLAKQKIE DHLLSSGKFM YLEGNADSAM s 

Seq ID NO: 68 Nucleotide sequence t 
Nucleic Acid Accession #: FGENESH predicted ORF 
60 Coding sequence: . 361- 2220 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I i I 

GGCACCATCT GCTCCCTGCC CTGCCCAGAG GGCTTTCACG GACCCAACTG CTCCCAGGAA 60 

65 TGTCGCTGCC ACAACGGCGG CCTCTGTGAC CGATTCACTG GGCAGTGCCG CTGCGCTCCG 120 

GGTTACACTG GGGATCGGTG CCGGGAGGAG TGCCCGGTGG GCCGCTTTGG GCAGGACTGT 180 

GCTGAGACGT GCGACTGCGC CCCGGACGCC CGTTGCTTCC CGGCCAACGG CGCAT6TCTG 240 

TGCGAACACG GCTTCACTGG GGACCGCTGC AGGGATCGCC TCTGCCCCGA CGGCTTCTAC 300 

GGTCTCAGCT GCCAGGCCCC CTGCACCTGC GACCGGGAGC ACAQCCTCAG CT6CCACCCG 360 

70 ATGAACGGGG AGTGCTCCTG CCTGCCGGGC TGGGCGGGCC TCCACTGCAA CGAGAGCT6C 420 

CCGCAG6ACA CGCATGGGCC AGG6TGCCAG GAGCACTGTC TCTGCCTGCA CG6TGGCGTC 480 

TGCCAGGCTA CCAGCGGCCT CTGTCAGTGC GC<3CCG6GTT ACACGGGCCC TCACTGT6CT 540 

AGTCTTTGTC CTCCTGACAC CTACGGTGTC AACTGTTCTG CACGCTGCTC ATGTGAAAAT 600 

GCCATC6CCT GCTCACCCAT CGACXiGCGAG TGCGTCTGCA AGGAAGGTTG GCAGCGTGGT 660 

75 AACTGCTCTG TGCCCTGCCC ACCOGGAACC TGQGGCTTCA GTTGCAATGC CAGCTGCCA6 720 

TGTGCCCATG AGGCAGTCTG CAGCCCCCAA ACTGGAGCCT GTACCTGCAC CCCTGGGTGG 780 



60 
120 
180 
240 
' 300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



60 
120 
180 
240 
300 
360 
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CATGGGGCCC ACTGCCAGCT GCCCTGTCCG AAGGGGCAGT TTGGAGAAGG TTGTGCCAGT 840 

CGCTGTGACT GTGACCACTC TGATGGCTGT GACCCTGTTC ATGGACGCTG TCAGTGCCAG 900 

GCTGGCTGGA TGGGTGCCCG CTGCCACCTG TCCTGCCCTG AGGGCTTATG GGGAGTCAAC 960 

TGTAGCAACA CCTGCACCTG CAAGAATGGG GGCACCTGTC TCCCTGAGAA TGGCAACTGC 1020 

5 GTGTGTGCAC CCGGATTCCG GGGCCCCTCC TGCCA6AGAT CCTGTCAGCC TGGCCGCTAT 1080 

GGCAAACGCT GTGTGCCCTG CAAGTGCGCT AACCACTCCT TCT(3CCACCC CTCGAACGGG 1140 

ACCTGCTACT GCCTGGCTGG CTG6ACAG6C CCCGACTGCT CCCAGCX3CTG CCCTCTGGGG 1200 

ACATTTGGTG CTAACTGCTC CCAGCCATGC CAGTGTGGTC CTGGAGAAAA GT6CCACCCA 1260 

GA6ACTGGGG CCTGTGTATG TCCCCCAGGG CACAGTGGTG CACCTTGCAG GATTGGAATC 1320 

10 CAGGAGCCCT TTACTGTGAT GCCGACCACT CCAGTAGCGT ATAACTCGCT GGGTGCAGTG 1380 

ATTG6CATT6 CA6TGCTGGG GTCCCTTGTG GTAGCCCTGG TGGCACTGTT CATTG6CTAT 1440 

CGGCACTGGC AJ\AAA66CAA GGA6CACCAC CACCTG6CTG TG6CTTACA6 CAGCGGGCGC 1500 

CTGGACGGCT CCGAGTATGT CATGCCAGAT GTCCCTCC6A GCTACAGTCA CTACTACTCC 1560 

AACCCCAGCT ACCACACCCT GTCGCAGTGC TCCCCAAACC CCCCACCCCC TAACAA6GTT 1620 

15 CCAG6CCC6C TCTTTGCCAG CCTGCAGAAC CCTGAGCG6C CAGGTGGGGC CCAAGGGCAT 1680 

GATAACCACA CCACCCTGCC TGCTGACTGG AAGCACCGCC GGGAGCCCCC TCCAGGGCCT 1740 

CTGGACAGGG GGAGCAGCCG CCTGGACCGA AGCTACAGCT ATAGCTACAG CAATGGCCCA 1800 

GGCCCATTCT ACAATAAAGG GCTCATCTCT GTU^GAGGAGC TCGGGGCCAG TGTGGCTTCC 1860 

CTGAGCAGTG AGAACCCATA TGCCACCATC CGGGACCTGC CCAGCTTGCC AGGGGGCCCC 1920 

20 CGGGAGAGCA GCTACATGGA GATGAAAGGC CCTCCCTCAG GATCTCCCCC CAGGCAGCCT 1980 

CCTCAGTTCT GGGACAGCCA GAGGCGGCGG CAACCCCAGC CACAGAGAGA CAGTGGCACC 204 0 

TACGAGCAGC CCAGCCCCCT GATCCATGAC CGAGACTCTG TGGGCTCCCA GCCCCCTCTG 2100 

CCTCCGGGCC TACCCCCCGG CCACTATGAC TCACCCAAGA ACAGCCACAT CCCTGGACAT 2160 
TATGACTTGC CTCCAGTACG GCATCCCCCA TCACCTCCAC TTCGACGCCA GGACCGTTGA 



Seg ID NO: 69 Protein sequence : 

Protein Accession # : FGENESH prediction 

30 1 11 21 31 41 51 

I I I I I I 

GTICSIiPCPE 6FHGPNCSQE CRCHNGGLCD RFTGQCRCAP GYTGDRCREE CPVGRF6QDC 60 

AETCDCAPDA RCFPANGACIi CEHGFTGDRC TDRXiCPDGFY GLSCQAPCTC DREHSLSCHP 120 

MNGECSCIiPG WAGLHCNESC PQDTHGPGCQ EHCliCLHGGV CQATSGIiCQC APGYT6PHCA 180 

35 SLCPPDTYGV NCSARCSCEN AIACSPIDGE CVCKEGWQRG NCSVPCPPGT WGPSCNASCQ 240 

CAHBAVCSPQ TGACTCTPGW HGAHCQLPCP KGQFGEGCAS RCDCDHSOGC DPVHGRCQCQ 300 

AGWMGARCHL SCPEGLWGVN CSNTCTCKNG GTCLPENGNC VCAPGFRGPS CQRSCQPGRY 360 

GKRCVPCKCA NHSFCHPSNG TCYCIiAGWTG PDCSQRCPLG TFGANCSQPC QCGPGEKCHP 420 

ETGACVCPPG HSGAPCRIGI QEPFTVMPTT PVAYNSLGAV IGIAVLGSLV VALVALFIGY 480 

40 RHWQKGKEHH HIAVAYSSGR LDGSEYVMPD VPPSYSHYYS NPSYHTLSQC SPNPPPPNKV 54 0 

PGPLFASLQN PERPGGAQGH DNHTTLPADW KHRREPPPGP LDRGSSRLDR SYSYSYSNGP 600 

GPFYNKGLIS EEELGASVAS I»SSENPYATI RDLPSLPGGP RESSYMEMKG PPSGSPPRQP 660 

PQFWDSQRRR QPQPQRDSGT YEQPSPIiIHD RDSVGSQPPD PPGIjPPGHYD SPKNSHIPGH 720 
YDLPPVRHPP SPPX*RRQDR 

45 

Seq ID NO: 70 Nucleotide sequence: 
Nucleic Acid Accession #: NM_005458 

Coding sequence: 1..2826 (underlined sequences correspond to start and stop codons) 

50 

1 11 21 31 41 51 

^1 t I I I I 

ATGG CTTCCC C6CGGAGGTC CGGGCA6CCA GGGC6GCCGC CGCC6CCX3CC ACCGCCGCCC 60 

GCXSCXSCCTGC TACTGCTACT GCTGCTGCCX5 CTGCTGCTGC CTCT6GCGCC CGGGGCCTGG 120 

55 GGCTGGGCGC GGGGCGCCCC CCGGCCGCCX3 CCCA6CAGCC OSCCGCTCTC CATCATGGGC 180 

CTCATGCCGC TCACCAAGGA GGTGGCCAAG GGCAGCATCG GGCGCGGTGT GCTCCCCGCC 240 

GTGGAACTGG CCATCGAGCA GATCCGCAAC GAGTCACTCC TGCGCCCCTA CTTCCTCGAC 300 

CTGCGGCTCT ATGACACGGA GTGCGACAAC GCAAAAGGGT TGAAAGCCTT CTACGATGCA 360 

ATAAAATACG GGCCGAACCA CTTGATGGTG TTTGGAGGCG TCTGTCCATC CGTCACATCC 420 

60 ATCATTGCAG AGTCCCTCCA AGGCTGGAAT CTGGTGCAGC TTTCTTTTGC TGCAACCACG 480 

CCTGTTCTAG CCGATAAGAA AAAATACCCT TATTTCTTTC GGACCGTCCC ATCAGACAAT 540 

GCGGTGAATC CAGCCATTCT GAAGTTGCTC AAGCACTACC AGTGGAAGCG CGTGGGCACG 600 

CTGACGCAAG ACGTTCAGAG GTTCTCTGAG GTGCGGAATG ACCTGACTGG AGTTCTGTAT 660 

GGCGAGGACA TTGAGATTTC AGACACCGAG AGCTTCTCCA ACGATCCCTG TACCAGTGTC 720 

65 AAAAAGCTGA AGGGGAATGA TGTGCGGATC ATCCTTGGCC AGTTTGACCA GAATATGGCA 780 

GCAAAAGTGT TCTGTTGTGC ATACGAGGAG AACATGTATG GTAGTAAATA TCAGTGGATC 840 

ATTCCGGGCT GGTACGAGCC TTCTTGGTGG GAGCAGGTGC ACACGGAAGC CAACTCATCC 900 

CGCTGCCTCC GGAAGAATCT GCTTGCTGCC ATGGAGGGCT ACATTGGCGT GGATTTCGAG 960 

CCCCTGAGCT CCAAGCAGAT CAAGACCATC TCAG6AAAGA CTCCACAGCA GTATGAGA6A 1020 

70 GAGTACAACA ACAAGCGGTC AGGCGTGGG6 CCCAGCAAGT TCCACG6GTA CGCCTACGAT 1080 

GGCATCTGGG TCATC6CCAA GACACTGCAG AGGGCCATG6 AGACACT6CA TGCCAGCAGC 1140 

CGGCACCAGC GGATCCAGGA CTTCAACTAC ACGGACCACA CGCTGGGCAG GATCATCCTC 1200 

AATGCCATGA ACGAGACCAA CTTCTTCGG6 GTCACGG6TC AAOTTGTATT CCGGAATGGG 1260 

GAGAGAATG6 GGACCATTAA ATTTACTCAA TTTCAAGACA GCAGGGAGGT GAAG6TGGGA 1320 

75 GAGTACAACG CTGTGGCCGA CACACTGGAG ATCATCAATG ACACCATCAG GTTCCAAGGA 1380 

TCCGAACCAC CAAAAGACAA GACCATCATC CTGGAGCAGC TGC6GAAGA.T CTCCCTACCT 1440 
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10 



15 



20 



25 



CTCTACAGCA 
TTCTTCAACA 
AACCTTATCA 
G6ATCCTTTG 
ACCGTGG6CT 
ATCTTCAAAA 
6TGG6GG6CA 
CTGCGAAG6A 
ATCCGCCCTC 
TATGCCTACA 
GTCAGCATCC 
ATCATGTGCA 
TTCTGCATC6 
GTGCCGAAGC 
CAGTTCACTC 
AACCAAGCCA 
AAGATCACAG 
GAAAA6ACCA 
GGAAACTTCA 
AATCCCCAGC 
GATATAAACT 
CACGCCTACC 
CCCACCGCCA 
CTGTAA 



TCXrrCTCTGC 
TCAAGAACCG 
TCCTTGGAGG 
TCTCTGAAAA 
ACAC6ACCGC 
ATGTGAAAAT 
TGCTGCTGAT 
CA6TGGAGAA 
TCCTGGAGCA 
AGGGACTTCT 
CCGCACTCAA 
TCATCGGGGC 
TGGCTCTGGT 
TCATCACCCT 
AGAATCAG7VA 
GCACATCCCG 
AGCTGGATAA 
CCTACATTAA 
CTGAGAGCAC 
TACAGTGGAA 
CTCCAGAACA 
TCCCATCCAT 
GCCCCCGCCA 



CCTCACCATC 
GAATCAGAA6 
GATGCTCTCC 
GACCTTTGAA 
TTTTGGGGCC 
6AAGAAGAAG 
CGACCTGTGT 
6TACAGCATG 
CTGTGAGAAC 
CATGTTGTTC 
CGACAGCAAG 
CGCTGTCTCC 
CATCATCTTC 
GAGAACAAAC 
GAAAGAAGAT 
CCTGGAGGGC 
AGACTTGGAA 
ACAGAACCAC 
AGATGGAGGA 
CACAACAGAG 
CATCCAGCGT 
CGGAGGCGTG 
CAGACATGTG 



CTCGGGATGA 
CTCATAAAGA 
TATGCTTCCA 
ACACTTTGCA 
ATGTTTGCAA 
ATCATCAAGG 
ATCCTGATCT 
GAGCCGGACC 
ACCCATATGA 
GGTTGTTTCT 
TACATCGGGA 
TTCCTGACCC 
TGCAGCACCA 
CCAGATGCAG 
TCTAAAACGT 
CTACAGTCAG 
GAGGTCACCA 
TACCAAGAGC 
AAGGCCATTT 
CCCTCTCGAA 
CGGCTGTCCC 
GACGCCAGCT 
CCACCCTCCT 



TCATGGCCAG 
TGTCGAGTCC 
TATTTCTCTT 
CCGTCAGGAC 
AGACCT6GAG 
ACCAGAAACT 
GCT6GCAGGC 
CAGCAGGACG 
CCATCTGGCT 
TAGCTTGGGA 
TGAGTGTCTA 
GGGACCAGCC 
TCACCCTCTG 
CAACGCAGAA 
CCACCTCGGT 
AAAACCATCG 
TGCAGCTGCA 
TCAATGACAT 
TAAAAAATCA 
CATGCAAAGA 
TCCAGCTCCC 
GTGTCAGCCC 
TCCGAGTCAT 



TGCTTTTCTC 
ATACATGAAC 
TG6CCTTGAT 
CTGGATTCTC 
A6TCCACGCC 
6CTTGTGATC 
TGTGGACCCC 
GGATATCTCC 
TGGCATCGTC 
GACCCGCAAC 
CAACGTGGGG 
CAATGTGCAG 
CCTGGTATTC 
CAGGCGATTC 
CACCAGTGTG 
CCTGCGAATG 
GGACACACCA 
CCTCAACCTG 
CCTCGATCAA 
TCCTATAGAA 
CATCCTCCAC 
CTGCGTCAGC 
GGTCTCGGGC 



1500 
X560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



Seq ID NO: 71 Protein sequence; 
Protein Accession #: NP 005449 



30 



35 



40 



45 



MASPRRSGQP 
liMPLTKEVAK 
IKYGPNHIiMV 
AVNPAIIiKLL 
KKIiKGNDVRI 
RCI*RiCNLLAA 
GIWVIAKTLQ 
ERMGTIKFTQ 
LYSILSALTI 
GSFVSEKTFE 
VGGMI.LIDLC 
YAYKGLLMLF 
FCIVALVIIF 
NQASTSRIiBG 
GNFTBSTDGG 
HAYIiPSIGGV 



11 

I 

6RPPPPPPPP 
6SIGRGVLPA 
FGGVCPSVTS 
KHYQWKRVGT 
ILGQFDQNMA 
MEGYIGVDFE 
RAMETLH7VSS 
FQDSREVKVG 
LGMIMASAFL 
TLCTVRTWIIj 
ILICWQAVDP 
GCFLAWETRN 
CSTITIiCLVF 
LQSENHRXiRM 
KAILKNHIiDQ 
DASCVSPCVS 



21 
I 

ARUjLIiLLIiP 
VEIiAlEQIRN 
IIAESLQG^m 
LTQDVQRFSE 
AKVFCCAYEE 
PhSSKQllCrX 
RHQRIQDFNY 
EYNAVADTLE 
FFNIKNRNQK 
TVGYTTAFGA 
LRRTVEKYSM 
VSIPALNDSK 
VPKLITLRTN 
KITEIiDKDLE 
NPQLQWNTTE 
PTASPRHRHV 



31 
I 

IiLLPIiAPGAW 
ESIiLRPYFLD 
LVQLSFAATT 
VRNDIiTGVLY 
NMYGSKYQWI 
SGKTPQQYER 
TDHTIiGRIIL 
IINDTIRFQG 
LIKMSSPYMN 
MFAKTWRVHA 
EPDPAGRDIS 
YIGMSVYNVG 
PDAATQNRRF 
EVTMQLQDTP 
PSRTCKDPIE 
PPSFRVMVS6 



41 
I 

GWARGAPRPP 
LRLYDTECDN 
PVIiADKKKYP 
GEDIEISDTE 
IPGWYEPSWW 
EYNNKRSGVG 
NAMNETNFFG 
SEPPKDKTII 
NLIILGGMLS 
IFKNVKMKKK 
IRPLIiEHCEN 
IMCIIGAAVS 
QFTQNQKKED 
EKTTYIKQNH 
DINSPEHIQR 
L 



51 
I 

PSSPPIjSIMG 
AKGIiKAPYDA 
YFFRTVPSDN 
SFSNDPCTSV 
EQVHTEANSS 
PSKFHGYAYD 
VTGQWFRNG 
liEQLRKISIiP 
YASIPIjFGLD 
IIKDQKIiIiVI 
THMTIWLGIV 
FIjTRDQPNVQ 
SKTSTSVTSV 
YQEIiNDlLNL 
RXiSLQIiPILH 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



50 



55 



60 



65 



70 



75 



Seq ID NO: 72 Nueleotide Becmence; 
Nucleic Acid Accession #: NM_005795 

Coding sequence: 522-1940 (underlined sequences correspond to start and stop codons) 



GCACGAGGGA 
CAAGCTCTGC 
TTCCCACCTT 
TGAGAATATT 
AAGAAATTCT 
GACAATTGTG 
GAATAATAAA 
AAAGAAAACT 
ACAAGGTTGC 
ATTTGGGCTT 
TTATGATTCT 
TTACTAGAAA 
CCATTCAACA 
ACGATGTTGC 
ATCCATCAGA 
CAAGCAACAG 
AGACTGCACT 
TGCTTATCTC 
TACACAAAAA 
CTGCAGTGGC 
AGTTCATTCA 
ACCTACACAC 



11 
I 

ACAACCTCTC 
TAACTGAATC 
GCTTGTGGGT 
TCACAAAGAA 
TAAAGACAAT 
CATATCGTCT 
AACCCATACT 
ACTACAACTT 
TATAAAACAA 
AATGATGGAQ 
TGTTACA6CA 
TAAAATCATG 
AGCAGAAGGC 
AGCAGGAACT 
AAAAGTTACA 
AACATGGACA 
AAATTTGTTT 
GCTTGGCATA 
TCTGTTCTTC 
CAACAACCAG 
TCTTTACCTG 
ACTCATT6T6 



21 
I 

TCTCTSCAGC 
TCATCCTAAT 
AAATCTCTTC 
TTTCCTTAAG 
GTCAAATATG 
AATAATAAAA 
AGCCTATAGA 
GACAAGACTG 
GATTGCTACA 
AAAAAGTGTA 
GAATTAGAAG 
ACAGCTCAAT 
GTTTACTGCA 
6AATCAATGC 
AAGATCT6TG 
AATTATACCC 
TACCTGACCA 
TTCTTTTATT 
TCATTTGTTT 
GCCTTAGTAG 
ATGGGCT6TA 
GTGGCC6TGT 



31 

I 

AGAGAGTGTC 
TGCAGGATCA 
TGCGGAATCT 
AGCTGGACTG 
ATCCAAGAGA 
ACCCATACTA 
AAACAATATT 
CTG CAAACTT 
ACTTCTAGTT 
CCCTGTATTT 
AGAGTCCTGA 
ATGAATGTTA 
ACAGAACCTG 
AGCTCTGCCC 
ACCAAGATGG 
AGTGTAAT6T 
TAATTGGACA 
TCAA6AGCCT 
GTAACTCTGT 
CCACAAATCC 
ATTACTTTTG 
TTGCAGA6AA 



41 

I 

ACCTCCTGCT 
CATTGCAAAG 
CAGAAAGTAA 
GGTCTTGACC 
AAATGTGATT 
GCCTATAGAA 
TGAAAGATTG 
CAATTGGTCA 
T ATG TTATAC 
TCTGGTTCTC 
GGACTCAATT 
CCAAAAGATT 
GGATGGATGG 
T6ATTACTTT 
AAACTGGTTT 
TAACACCCAC 
CGGATTGTCT 
AAGTTGCCAA 
TGTAACAATC 
TGTTAGTTGC 
6ATGCTCTGT 
6CAACATTTA 



51 
I 

TTAGGACCAT 
CTTTCACTCT 
AGTTCCATCC 
CCTGGAATTT 
TGAGTCTGGA 
AACAATATTT 
CTACCACTAA 
CCACAACTTG 
AGCATATTTC 
TTGCCTTTTT 
CAGTTG6GAG 
ATGCAAGACC 
CTCTGCTGGA 
CAGGACTTTG 
A6ACATCCAG 
GAGAAAGTGA 
ATTGCATCAC 
AGGATTACCT 
ATTCACCTCA 
AAAGTGTCCC 
GAAGGCATTT 
ATGT6GTATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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ATTTTCTTGG CTGGGGATTT CCACTGATTC 
TATATTACAA TGACAATTGC TGGATCAGTT 
GCCCAATTTG TGCTGCTTTA CTGGTGAATC 
TCATCACCAA GTTAAAAGTT ACACACCAAG 
5 GAGCTACTCT TATCTTGGT6 CCATTGCTTG 
CTGAAGGAAA GATTGCAGA6 GAGGTATATG 
AG6GTCTTTT GGTCTCTACC ATTTTCTGCT 
GAAGAT^CTG 6AATCAATAC AAAATCCAAT 
TTC6TAGTGC GTCTTACACA GTGTCAACAA 

10 GTCCTAGTGA ACACTTAAAT GGAAAAAGCA 
CAGAAAATTT ATATAATTGA AAATAGAAGG 
AACTCAAGGA CTTGGACCCA TGACTCTGTA 
GGGAATGTCA TAAAGAAGAG CCTTCACATG 
ATCCAGCTCT ATGTGGGAAA AAAGAAATCC 

15 CACTATGCCT GATGTGACGC TACTAACCTG 
ACAATCAACT TTTCTGAGCT GGTGTAAGCC 
AAATGGCTGT AAAACTAAAC ATACATGTTG 
GACCTAGCTA AGGTCTATAA ACATGAAGGG 
TCCCATCTTG ATTGGGGCAG TTGACTTTTT 

20 TAACTAGCCT CTCAAATGGA CAATACCAGA 
CTATGAAAAG CAACTGAGTA CAATTGTTAT 
ATCTTGTGGC ATATCCATTG TGGAAACTGG 
TTCTATATCA TTAGGAAAAC ATCTTAGTTG 
TGTCTTACCA AAC2AGTGG6A GGGAATTCCT 

25 TTCTACTGTA TA7UVCAAATT AGCAATCATT 
TATTTTCTTG GAATTTTGTA AAAA6AAATT 
TTTTATTTTA TAGTCTCAAA TCAAATACAT 
TAATGCAACA AT6TGTGTAT GTTAATATCT 
AAATAGAGTC TGGAATGCTA TATTTGGTAA 

30 AGAAGTCTGT TT6AGAACTA AGAGAACAGA 
AAACACAAGG TCACTATTTT ACTGAATATA 
GGT6TGTTTG ACATATTTCT TTTTTCATTT 
TTTTAAACAA CTACTGTGAT AAATACCAAT 
AATATTACTT TACTGACTTT TACTATGTGA 

35 ATTCAAGAAA TATAAAAAAC TAGAAGGATA 
TTTAATAGAG CTACTGTATA TAATACAAAT 
AAAATTATTG TCAGATCTTA CTGAATTATT 
AACCTTGCTA ATGAATTAAA GT6AAATTTG 
CCGCTGAAAT CTCTAAAGAA CAAGAATGAC 

40 GTCATGGGTA TCTGTTTTTT AAGTGTGTCA 
CATCATAAGT TGTTTCTTAA GCTGTCAATA 
TCAAATTGCT AAGACAAATT ATCTAAATTC 
AGTACATTTA TAATTTATCT ATGCATGAAA 
ATAGCAA6CT GCCATAGAAA GGA 

45 

Seq ID NO: 73 Protein sequence; 
Protein Accession #: NM 005795 



CTGCTTGTAT ACATGCCATT GCTAGAAGCT 13 80 

CTGATACCCA TCTCCTCTAC ATTATCCAT6 1440 

TTTTTTTCTT GTTAAATATT GTACGCGTTC 1500 

CGGAATCCAA TCT6TACATG AAAGCTGTGA 1560 

GCATTGAATT TGT6CTGATT CCATG6CGAC 1620 

ACTACATCAT GCACATCCTT ATGCACTTCC 1680 

TCTTTAATGG AGAGGTTCAA GCAATTCTGA 1740 

TTGGAAACAG CTTTTCCAAC TCAGAAGCTC 1800 

TCAGTGATGG TCCAGGTTAT AGTCATGACT 1860 

TCCATGATAT TGAAAATGTT CTCTTAAAAC 1920 

ATGGTTGTCT CACTGTTTGG TGCTTCTCCT 1980 

GCCAGAAGAC TTCAATATTA AATGACTTTG 2040 

AAATTAGTAG TGTGTTGATA AGAGTGTAAC 2100 

TGGTTTGTAA TGTTTGTCAG TAAATACTCC 2160 

ACATCACCAA GTGTGGAATT GGAGAAAAGC 2220 

AGTTCCAGCA CACCATTGAT GAATTCAAAC 2280 

GGCATGATTC TACCCTTATT CSCCCCAAGA 2340 

AAAATTAGCT TTTAGTTTTA 7U\ACTCTTTA 2400 

TTTTTTCCCA GAGTGCCGTA GTCCTTTTTG 2460 

AGTGAATTAT CCCTGCTGGC TTTCTTTTCT 2520 

GATCTACTCA TTTGCTGACA CATCAGTTAT 2580 

ATGAACAGGA TGTATAATAT GCAATCTTAC 264 0 

ATGCTACAAA ACACCTTGTC AACCTCTTCC 2700 

AGCTGTAAAT ATAAATTTTG TCCCTTCCAT 2760 

TTATATAAAG AAAATCAATG AAGGATTTCT 2820 

GTGAAAAATG A6CTTGTAAA TACTCCATTA 2880 

ACAACCTATG TAATTTTTAA AGCAAATATA 2940 

GATACTGTAT CTG6GCTGAT TTTTTAAATA 3000 

ATATTTTAAA GACAACCAGA TGCCAGCATC 3060 

AACATCTATC ATAAGATATA TTTATTTTAA 3120 

TTTGTTTTGA TAACTCATAC CTTAATAATA 3180 

TGACAATGAA CTCACATTCT AATCCAGJ^ 3240 

CTGCTACTTT TATAGATTTT ACCCCATTAA 3300 

AGATATATAG CTTTGGAAAT GTCCCAGGCT 3360 

CTATATATAC CATATACAAT GCTTTAATAT 3420 

TAGGGAAATA CTTGAATATA TCATTGAGAA 3480 

GTCAGACTTT ATTAAATAAA GATAGAAGAA 354 0 
CATGGGATTC AGTTTCTCTA ATGTTATTTT . 3600 

TTCAATTAGT AAAAGTCAAT TTTGGGAAAA 3660 

ATCTGATTAA AATGGATGAA ACAAATTACT 3720 

TGTCAATAGA TGGTGAGTTC AGAACTTATT 3780 

GTAAGT^TTA ACATATAGAA TGGTCTGGTC 3840 

AAGTATTGTT TTGTTTGAAA CATGAATTTC 3900 



50 1 11 21 31 41 51 

I I I I I I 

MLYSIPHliGL MMEKKCTIiYF LVLLPFFMIL VTAELEESPE DSIQLGVTRN KIMTAQYECY 60 

QKIMQDPIQQ AEGVYCNRTW DGWLCWNDVA AGTESMQLCP DYFQDFDPSE KVTKICDQD6 120 

NWFRHPASNR TWTNYTQCNV NTHBKVKTAIi NLFYIiTIIGH GLSIASLLIS IiGIFFYFKSL 180 

55 SCQRITIjHKN liFFSFVCNSV VTIIHIiTAVA NNQALVATNP VSCKVSQFIH LYLMGCNYFW 240 

MLCEGIYLHT LIWAVFAEK QHIiMWYYFLG WGFPLIPACI HAIARSLYYN DNCWISSDTH 300 

IiIjYIIHGPIC AAIjIjVNIjFPL LNIVRVLITK IjKVTHQAESN LYMKAVRATIj ILVPLLGIEF 360 

VLIPWRPEGK IAEEVYDYIM HIIjMHFQGLL VSTIFCFFNG EVQAILRRNW NQYKIQFGNS 420 
FSHSEALRSA SYTVSTZSDG PGYSHDCPSE HI^GKSIHDI ENVLLKPENIi yn 

60 

Seq ID NO: 74 Nucleotide sequence; 
Nucleic Acid Accession #: NM_000450.1 

Coding sequence: 117.. 1949 (underlined sequences correspond to start and stop codons) 

65 1 11 21 31 41 51 

I I I I I I 

CCTGAGACAG AGGCAGCAGT GATACCCACC TGAGAGATCC TGTGTTTGAA CAACTGCTTC 60 

CCAAAACGGA AAGTATTTCA AGCCTAAACC TTTGGGTGAA AAGAACTCTT GAAGT CATGA 120 

TTGCTTCACA GTTTCTCTCA GCTCTCACTT TGGTGCTTCT CATTAAAGAG AGTGGAGCCT 180 

70 GGTCTTACAA CACCTCCACG GAAGCTATGA CTTATGATGA GGCCAGTGCT TATTGTCAGC 240 

AAAG6TACAC ACACCTGGTT GCAATTCAAA ACAAAGAAGA GATTGAGTAC CTAAACTCCA 300 

TATTGAGCTA TTCACCAAGT TATTACTGGA TTGGAATCAG AAAAGTCAAC AATGTGTGGG 360 

TCTGGGTAGG AACCCAGAAA CCTCTGACAG AAGAAGCCAA GTUVCTGGGCT CCAGGTGAAC 420 

CCAACAATAG GCAAAAAGAT GAGGACTGCG TGGAGATCTA CATCAAGAGA GAAAAAGATG 480 

75 TGGGCATGTG GAATGATGAG AGGTGCAGCA A6AA6AAGCT TGCCCTATGC TACACAGCTG 540 

CCTGTACCAA TACATCCT6C A6TGGCCACG GTGAAT6TGT A6AGACCATC AATAATTACA 600 
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CTTGCAAGTG TGACCCTGGC TTCAGTGGAC TCAA6TGTGA GCAAATTGTG AACTGTACAG 660 

CCCTGGAATC CCCTGAGCAT GGAAGCCTGG TTTGCAGTCA CCCACTGGGA AACTTCAGCT 720 

ACAATTCTTC CTGCTCTATC AGCT6TGATA GGG6TTACCT GCCAAGCAGC ATGGAGACCA 780 

TGCAGTGTAT 6TCCTCTGGA GAATGGAGTG CTCCTATTCC AGCCTGCAAT GTGGTTGAGT 840 

5 6TGATGCT6T GACAAATCCA GCCAATGGGT TCGTGGAATG TTTCCAAAAC CCTGGAAGCT 900 

TCCCATG6AA CACAACCTGT ACATTTGACT GTGAAGAAGG ATTTGAACTA ATGGGAGCCC 960 

AGAGCCTTCA GTGTACCTCA TCTGGGAATT GG6ACAACGA GAAGCCAACG T6TAAA6CTG 1020 

TGACATGCAG GGCCGTCCGC CAGCCTCAGA ATGGCTCTGT GAGGTGCAGC CATTCCCCTG 1080 

CTGGAGAGTT CACCTTCAAA TCATCCTGCA ACTTCACCTG TGAGGAAGGC TTCATGTTGC 1140 

10 A6GGACCAGC CCAGGTTGAA TGCACCACTC AAGGGCAGTG GACACAGCAA ATCCCAGTTT 1200 

GTGAAGCTTT CCAGTGCACA GCCTTGTCCA ACCCCGAGCG AGGCTACATG 7VATTGTCTTC 1260 

CTAGTGCTTC TGGCAGTTTC CGTTATGGGT CCAGCTGTGA GTTCTCCTGT GAGCAGGGTT 1320 

TTGTGTTGAA GGGATCCAAA AGGCTCCAAT GTGGCCCCAC AGGGGAGTGG GACAACGAGA 1380 

AGCCCACATG TGAAGCTGTG AGATGCGATG CTGTCCACCA GCCCCCX3AAG GGTTTGGTGA 1440 

15 GGTGTGCTCA TTCCCCTATT GGAGAATTCA CCTACAAGTC CTCTTGTGCC TTCAGCTGTG 1500 

AGGAGGGATT TGAATTATAT GGATCAACTC AACTTGAGTG CACATCTCAG GGACAATGGA 1560 

CAGAAGAGGT TCCTTCCTGC CAAGTGGTAA AATGTTCAAG CCTGGCAGTT CCGGGATUVGA 1620 

TCAACATGAG CTGCAGTGGG GAGCCCGTGT TTGGCACTGT GTGCAAGTTC GCCTGTCCTG 1680 

AAGGATGGAC GCTCAATGGC TCTGCAGCTC GGACATGTGG AGCCACAGGA CACTGGTCTG 1740 

20 GCCTGCTACC TACCTGTGAA GCTCCCACTG AGTCCAACAT TCCCTTGGTA GCTGGACTTT 1800 

CTGCTGCTGG ACTCTCCCTC CTGACATTAG CACCATTTCT CCTCTGGCTT CGGAAATGCT 1860 

TACGGAAAGC AAAGAAATTT GTTCCTGCCA GCAGCTGCCA AAGCCTTGAA TCAGACG6AA 1920 

GCTACCAAAA GCCTTCTTAC ATCCT TTAAG TTCAAAAGAA TCAGAAACAG GTGCATCTGG 1980 

GGAACTAGAG GGATACACTG AAGTTAACAG AGACAGATAA CTCTCCTCGG GTCTCTGGCC 2040 

25 CTTCTTGCCT ACTATGCCAG ATGCCTTTAT GGCTGAAACC GCAACACCCA TCACCACTTC 2100 

AATAGATCAA A6TCCAGCAG GCAAG6ACGG CCTTCAACTG AAAAGACTCA GTGTTCCCTT 2160 

TCCTACTCTC A6GATCAAGA AAGTGTTGGC TAATGAAGGG AAA6GATATT TTCTTCCAAG 2220 

CAAAGGTGAA GAGACCAAGA CTCTGAAATC TCAGAATTCC TTTTCTAACT CTCCCTTGCT 2280 

CGCTGTAAAA TCTTGGCACA 6AAACACAAT ATTTTGTGGC TTTCTTTCTT TTGCCCTTCA 2340 

30 CAGTGTTTCG ACAGCTGATT ACACAGTTGC TGTCAT7VAGA ATGAATAATA ATTATCCA6A 2400 

GTTTAGAGGA AAAAAATGAC TAAAAATATT ATAACTTAAA AAAATGACAG ATGTTGAATG 2460 

CCCACAGGCA AATGC3VTGGA GGGTTGTTAA TGGTGCAAAT CCTACTGAAT GCTCTGTGCG 2520 

AGGGTTACTA TGCACAATTT AATCACTTTC ATCCCTATGG GATTCAGTGC TTCTTAAAGA 2580 

GTTCTTAAGG ATTGTGATAT TTTTACTTGC ATTGAATATA TTATAATCTT CCATACTTCT 264 0 

35 TCATTCAATA CAAGTGTGGT AGGGACTTAA AAJ^CTTGTA TATGCTGTCA ACTATGATAT 2700 

GGTAAAAGTT ACTTATTCTA GATTACCCCC TCATTGTTTA TTAACAAATT ATGTTACATC 2760 

TGTTTTAAAT TTATTTCAAA AAGGGAAACT ATTGTCCCCT AGCAAGGCAT GATGTTAACC 2820 

AGAATAAAGT TCTGAGTGTT TTTACTACAG TTGTTTTTTG AAAACATGGT AGAATTGGAG 2880 

AGTAAA7U\CT GAATGGAAGG TTTGTATATT GTCAGATATT TTTTCAGAAA TATGTGGTTT 2940 

40 CCACGATGAA AAACTTCCAT GAGGCCAAAC GTTTTGAACT AATAAAAGCA TAAATGCAAA 3000 

CACACAAAGG TATAATTTTA TGAATGTCTT TGTTGGAAAA GAATACAGAA AGATGGATGT 3060 

GCTTTGCATT CCTACAAAGA TGTTTGTCAG ATGTGATATG TAAACATAAT TCTTGTATAT 3120 

TATGGAAGAT TTTAAATTCA CAATAGAAAC TCACCATGTA AAAGAGTCAT CTGGTAGATT 3180 

TTTAACGAAT GAAGATGTCT AATAGTTATT CCCTATTTGT TTTCTTCTGT ATGTTAGGGT 3240 

45 GCTCTGGAAG AGAGGAATGC CTGTGTGAGC AAGCATTTAT GTTTATTTAT AAGCAGATTT 3300 

AACAATTCCA AAGGAATCTC CAGTTTTCAG TTGATCACTG GCAATGAAAA ATTCTCAGTC 3360 

AGTAATTGCC AAAGCTGCTC TAGCCTTGAG GAGTGTGAGA ATCAAAACTC TCCTACACTT 3420 

CCATTAACTT A6CATGTGTT GAAAAAAAAA GTTTCAGAGA AGTTCTGGCT GAACACT6GC 3480 

AACGACAAA6 CCAACAGTCA AAACAGAGAT GTGATAAGGA TCAGAACAGC AGAG6TTCTT 3540 

50 TTAAA6GGGC A6AAAAACTC TGGGAAATAA GAGAGAACAA CTACTGTGAT CAGGCTATGT 3600 

ATGGAATACA 6TGTTATTTT CTTTGAAATT GTTTAAGTGT TGTAAATATT TATGTAAACT 3660 

GCATTAGAAA TTAGCTGTCT GAAATACCAG TGTGGTTTGT GTTTGAGTTT TATTGAGAAT 3720 

TTTAAATTAT AACTT7U\AAT ATTTTATAAT TTTTA7UVGTA TATATTTATT TAAGCTTATG 3780 
TCAGACCTAT TTGACATAAC ACTATAAAGG TTGACAATAA ATGTGCTTAT GTTT 

55 

Seq ID NO: 75 Protein sequence; 
Protein Accession #: NP_00044l 

1 11 21 31 41 51 

60 t I 1 I 1 I 

MIASQFLSAIi TLVLLIKESG AWSYNTSTEA MTYDEASAYC QQRYTHLVAI QNKEEIEYIjN 60 

SILSYSPSYY WIGIRKVNNV WVWVGTQKPL TEEAKNWAPG EPNNRQKDBD CVEIYIKREK 120 

DVGMWNDERC SKKKIALCYT AACTNTSCSG HGECVETINN YTCKCDPGPS GIiKCEQZVNC 180 

TAIiESPEHGS LVCSHPIiGNF SYNSSCSISC DRGYLPSSME TMQCMSS6EW SAPIPACNW 240 

65 ECDAVTNPAN GFVECFQNPG SFPWNTTCTF DCBEGPEIiMG AQSIiQCTSSG NWDNEKPTCK 300 

AVTCRAVRQP QNGSVRCSHS PAGBFTFKSS CWFTCEEGFM IjQGPAQVECT TQGQWTQQIP 360 

VCEAFQCTAIi SNPERGYMNC LPSASGSFRY GSSCEFSCEQ GFVIiKGSKRI* QCGPTGBWDN 420 

EKPTCBAYRC DAVHQPPKGL VRCAHSPIGE FTYKSSCAPS CBEGFELYGS TQIiBCTSQGQ 480 

WTEBVPSCQV VKCSSLAVPG KINMSCSGEP VFGTVCKFAC PEGWTLNGSA ARTCX3ATGHW 540 

70 SGI»IiPTCBAP TESNIPI.VAG LSAAGLSLLT lAPFIiIiWLRK CLRKAKKFVP ASSCQSIiESD 600 
GSYQKPSYIIi 

Seg ID NO: 76 Wucleotid e aemience; 
. 75 Nucleic Acid Accession #: NM_031439 

Coding seguence: 6 9.. 12 3 5 (underlined sequences correspond to start and stop codons) 
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.1 I I I I I 

D CCCGACCCGT GCGAGGGCCA GGTCCGCGCX: TGCCCCGCCA GGCGAA6CGA GGCGACCCGC 60 

GTGCGGCCAT GGCTTCGCTG CTGGGAGCCT ACCCTTGGCC CXSAGGGTCTC GAGTGCCCGG 120 

CCCTGGACGC CGAGCTGTCG GATGGACAAT CGCCGCCGGC CGTCCCCCGG CCCCCGGGGG 180 

ACAAGGGCTC CGA6A6CCGT ATCC6GC66C CCATGAAC6C CTTCAT6GTT T6GGCCM6G 240 

ACGAGAGGAA A0G6CTGGCA GTGCAGAACC CGGACCTGCA CAACGCCGA6 CTCAGCAAGA 300 

10 TGCTGGGAAA GTCGT6GAAG GCGCTGACGC TGTCCCAGAA GAGGCC6TAC GTG6ACGAGG '360 

CGGAGCGGCT GCGCCTGCAG CACATGCAGG ACTACCCCAA CTACAAGTAC CGGCCGCX3CA 420 

GGAAGAAGCA GGCCAAGC6G CTGTGCAAGC GCGTGGACCC GGGCTTCCTT CTGAGCTCCC 480 

TCTCCCGGGA CCAGAACGCC CTGCCGGAGA AGAGAAGCGG CAGCCGGGGQ GCX3CTGGGGG 540 

AGAAGGAGGA CAGGGGTGAG TACTCCCCCG GCACTGCCCT GCCCAGCCTC CGGGGCTGCT 600 

15 ACCACGAGGG GCCGGCTGGT GGTGGCGGCG GCGGCACCCC GAGCAGTGTG GACACGTACC 660 

CGTACGGGCT GCCCACACCT CCTGAAATGT CTCCCCTGGA CGTGCTGGAG CCGGAGCAGA 720 

CCTTCTTCTC CTCCCCCTGC CAGGAGGAGC ATGGCCATCC CCGCCGCATC CCCCACCTGC 780 

CAGGGCACCC GTACTCACCG GAGTACGCCC CAAGCCCTCT CCACTGTAGC CACCCCCTGG 840 

GCTCCCTGGC CCTTGGCCAG TCCCCCGGCG TCTCCATGAT GTCCCCTGTA CCCGGCTGTC 900 

20 CCCCATCTCC TGCCTATTAC TCCCCGGCCA CCTACCACCC ACTCCACTCC AACCTCCAAG 960 

CCCACCTGGG CCAGCTTTCC CCGCCTCCTG AGCACCCTGG CTTCGACGCC CTGGATCAAC 1020 

TGAGCCAGGT GGAACTCCTG GGGGACATG6 ATCGCAATGA ATTCGACCAG TATTTGAACA 1080 

CTCCTGGCCA CCCAGACTCC GCCACAGGGG CCATGGCCCT CAGTGGGCAT GTTCCGGTCT 1140 

CCCAGGTGAC ACCAACGGGT CCCACAGAGA CCAGCCTCAT CTCCGTCCTG GCTGATGCCA 1200 

25 CGGCCACGTA CTACAACAGC TACAGTGTGT C ATAG AGCTG GAGGCGCCCC GTCCGGTCAG 1260 

CCCTCGCGCC CTCTCCTTCT TGTGCCTTGA GTGGCAGAGG AGCCGTCCAG CCACACCAGC 1320 

TTTCCTCCCA CCX3CTCAGGG CAGGGAGGTC TGAACTGCGG CCCCAGAGCC TTTGGCCTAA 13 80 

GCTGGACTCT CCTTATCCGA GTGCCGCCTC TATCCCCTTC CCCACGTTCC AGCCCCTGCA 144 0 

GCCCACATTT TAAGTATATT CCTTCAAGTG AGTTTTCCTC CA6CCCCT6A GAGTTGCTGT 1500 

30 CTCCCAGTGG AATGTTCACT GACGTCTTTT CTTGGTAGCC ATCATCGAAA CTAATGGG66 1560 

GACA6ACTTG ATAGCCAAGG TCCCTTCTGG TCCAGTTTTC TGATTTAGQ6 TTCTCTCAAG 1620. 

ATTAATAAAG GAAGATGGGG AAATTTGACT. CATTAATGAG CTCGCTAACC TACGATCTGG 1680 

TGATAATTTT GTGT6CACAG CCCAAGGACC AC6AGGCTTT CTGCACTTTC TGCACCCCCT 1740 

TCCAAA6TGA CCACAAAATT TCATU^GGGAC TCATACAATT TGAGAAAAAA CyMSTCAACCT 1800 

35 GATTTGAGAA ATTAACCAGT ATGGCTAACT ATATCACAGA AAATGGGATT GAGTTAAAAC 1860 

TATTTTATTT TAAATATACA TTTTAAAGCA GTTCTTTTTT TTTGTTAATT TGTTTATTAT 1920 

ACACACACTT CAAGAGCCAC CGCGCCCAGC CTACATTTAT AATTTTCATT CTCTTTTACC 1980 

TATAAAATTC AGTGTATTAG TTTCATTACA TAGGAGAAAT TATATTTCTA AACATTTTAT 2040 
6AT6TTTAAA AACAAAACAG GCTGTTGTAA AAAAAAAAAA AAAAAAAAA 

40 



45 



55 



60 



Seq ID NO: 77 Protein aeouence; 
Protein Accession #: NP 113627 



11 21 31 41 51 



MASLXiGAYPW PEGIiECPALD AELSDGQSPP AVPRPP6DKG SESRIRRPMN AFMVWAKDER 60 

KRIiAVQNPDL HNAELSKMLG KSWKALTLSQ KRPYVDEAER LRLQHMQDYP NYKYRPRRKK 120 

50 QAKRLCKRVD P6FLLSSLSR DQNTlIiPEKRS GSRGALGEKB DRGEYSPGTA LPSLR6CYHE 180 

GPAGGGGGGT PSSVDTYPYG LPTPPEMSPL DVIjEPEQTPF SSPCQEEHGH PRRIPHItPGH 240 

PYSPEYAPSP IiHCSHPLGSL ALGQSPGVSM MSPVPGCPPS PAYYSPATYH PLHSNLQAHL 300 

GQIiSPPPEHP GFDALDQLSQ VELLGDMDRN EFDQYLNTPG HPDSATGAMA LSGHVPVSQV 360 
TPTGPTETSL ISVLADATAT YYNSYSVS 



Seq ID NO: 78 Nucleotide sequence: 
Nucleic Acid Accession #: XM_035787 

Coding sequence: 32 9.. 94 9 (underlined sequences correspond to start £uad stop codons) 
1 11 21 31 41 51 



TGCCCCGCCC CGCTCCCCAG CGCCCCGGAA GTGATCTGTG GCGGCTGCTG CAGAGCCGCC 60 

AGGAGGAGGG TGGATCTCCC CAGAGCAAAG CGTCGGAGTC CTCCTCCTCC TTCTCCTCCT 120 

65 CCTCCTCCTC CTCCTCCAGC CGCCCAGGCT CCCCCGCCAC CCGTCAGACT CCTCCTTCGA 180 

CCGCTCCCGG CGCGGGGCCT TCCAGGCGAC AAGGACCGAG TACCCTCCGG CCGGAGCCAC 24 0 

GCAGCCGCGG CTTCCGGAGC CCTCGGGGCG GCGGACTGGC TCGCGGTGCA GATTCTTCTT 300 

AATCCTTTGG TGAAAACTGA GACACAAAAT gGCTGCAAAT AAGCCCAAGG GTCAGAATTC 360 

TTTGGCTTTA CACAAAGTCA TCATGGTGGG CAGTGGTGGC GTGGGCAAGT CAGCTCTGAC 420 

70 TCTACAGTTC ATGTACGATG AGTTTGTGGA GGACTATGAG CCTACCAAA6 CAGACAGCTA 480 

TCG6AAGAAG GTAGTGCTAG ATGGGGAG6A AGTCCAGATC GATATCTTAG ATACAGCTGG 540 

GCAGGAG6AC TACGCTGCAA TTAGAGACAA CTACTTCGGA A6TGGG6AG6 GGTTCCTCTG 600 

TGTrTTCTCT ATTACAGAAA TGGAATCCTT TGCAGCTACA GCTGACTTCA GGGAGCAGAT 660 

TTTAAGAGTA AAAGAAGATG AGAATGTTCC ATTTCTACTG GTT6GTAACA AATCAGATTT 720 

75 AGAAGATAAA AGACAGGTTT CTGTAGAA6A GGCAAAAAAC AGAGCTGAGC AGT6GAATGT 780 

TAACTACGT6 GAAACATCT6 CTAAAACACG AGCTAATGTT GACAA6GTAT TTTTTGATTT 840 
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10 



15 



20 



25 



30 



35 



AiVTGAGAGAA 
GAGGAAAAGT 
ACTCCTTTCT 
TTMTTGACT 
TTG6AACTGC 
A6CAAAGTTC 
AAGCTTGTGT 
TG6GAGGAAA 
AAATGTCTTG 
CC7VCTTTTTT 
CATATAGTTA 
TTAGAAAAGT 
GAAAAATATC 
CCATGTGTCC 
AAGTAACTGT 
AAAATAAGAT 
AGGGCCAGCA 
TTTGAATGAA 
GTGAGCATGT 
AGTGCTAATG 
ATTTCTAAAT 
TTTCTGTTTC 
CCATCAAACC 
TCACCATT6T 
CAGTGG6AAA 
AGTTGATTCA 
TGTGTTCAGG 
ACTCACTCTT 
TGAGATGTAC 
TACCAATGTA 
TTTAACCAGG 
GGTACTGAGG 
ACTACCCGAT 



ATTC6AGCGA 
TTAGCCAAGA 
TATCTTGACC 
GAAATTACTT 
AATG2UUIGTC 
AACTTATTTC 
TTCTTGGGCA 
GGT6ACTTCC 
GTCTTCTACT 
TAACCATTAT 
AACTGAGAGT 
GGTGTATIACT 
TGGTTATATC 
TGGTGAGAAA 
CCGCTAGAAG 
TCCAGAGCTC 
TATATACTTG 
CCCTCCTTTT 
AAGTGTTAAG 
CATTTTGCAC 
TTATATTCAT 
TGTTTATAAT 
TGGGTATAGT 
GT6GT6TACC 
TATGCCACTG 
ACAAAGTATT 
CATTCCAGGT 
TAGATATTTA 
ATCTTTCATT 
ACACTGGCCA 
GGTCCTAACC 
TGCTATGAAG 
TTGTTTATTT 



GAAAGATGGA 
GAATCAGAGA 
ATACTAATAA 
TAACATTTTG 
AAATTTACTT 
ATAATTGCCT 
GTCTTTCTTG 
TCTGGTGTTT 
GCCTTGAAAA 
TATGCAAAAT 
AATTCATCTG 
TGTACATGGA 
ATTCTGGGTG 
ATATATGCCT 
TCTGTCCAAA 
TTTGATCGCT 
CAAGATAATT 
CTCTGAGATT 
TTTTTAATCT 
TAGAACGCTT 
AAAGTTACAG 
GAAGAACACT 
GCAGAAAACG 
T6CTGGAAGA 
ACCGATTTTT 
TTTTTCTTTT 
AACAG6TGTG 
CATCCAGCTT 
TCX3TATTTCT 
GCX3GGCCCAG 
ACTAACATTG 
CCAACTGACA 
GCAATTTGAG 



AGACAGCAAA 
AAGATGCTGC 
ATATAATTTA 
GAAATT6TTG 
TAAAAAGAAA 
ACATTTATCA 
AAATTGAAGA 
ATTATAAAGC 
ATGACAATTG 
TTAGAAGAAA 
TGAATCTGCT 
ATTTTTTGAA 
TGTTCTTACT 
GGCACAGCTT 
TTTT^AAATGT 
TTTAATAAAC 
TTCAGCTGCA 
CTGGTCCCTG 
GGGAGCAGGG 
CGGGTU^AATA 
TTTGATACAG 
GTAGCTACAT 
TGGCACACAC 
ATTCTAGCAT 
TTTTTTTCCT 
TTCTCAGTCC 
TATGTAAAGT 
CTCATGTTAA 
CATAGGCTAT 
CAATCTCCAT 
TGACTTTGCT 
AAGATGCATC 
CCATTTAAAG 



GAAAAGAATG 
ATTTT ATAAT 
TAAGCATTGC 
TATATCACTA 
TTAATATGGC 
TGGTCCTGAA 
GGTGAAATGG 
TTAT^TTTTA 
TGAACATGAT 
AGTTATTGGC 
TTAATTACCT 
TATGCCTTAA 
GACACCAGGG 
TTGTATAGAA 
GTGCCATATT 
TGCAAGTTCA 
AGGATTCAGC 
GAAATCCCTT 
CATAGGAAGA 
TTCATGCTTG 
GAATTATTAG 
TTTCAGAAGT 
T6ACCACACA 
GCTACTTGGG 
CTTTGCAGTG 
TAATTTGAAC 
TAAAAATAGG 
ATATTTGTCC 
GCCATGTGCG 
GTGTACTTAT 
TTGAGACCTT 
ACGTGTCTTA 
ACCAATAAAC 



GAAAAAAGAA 
CAAAGCCCAA 
CATTGAAGGC 
AAAGCAT6AA 
TTCACCAAGA 
TGTAGCGTGT 
GGGTGGGGAG 
TATCATTTTA 
AGTTAAACTA 
ATGGTTGTTG 
GGTGAGTAAC 
TTTAGAAACT 
GTCCGCTGCC 
AATTCTTGAG 
CTGGTTCTTG 
TTTTAAATGA 
ACCAGTTATG 
TCTGCTAGTG 
AAATGTCAGT 
CCATCTGTTC 
GAGTAATTCT 
TAACATCAAG 
TTAGGCTGTG 
GACATAATTT 
GGGCTAGGAC 
AGGTCAAAGA 
CTTTTTAGGA 
TTAAAGGGTT 
GAATTCAAGT 
TACAGTCTTA 
TCCTCTCCTG 
GGCTGATGCC 
TTCCTTTTTT 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1B60 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



Seg ID NO: 79 Protein sequence! 
Protein Accession #: XP 035787 
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MAANKPKGQN SIALHKVIMV GSGGVGKSAIi TLQFMYDEFV EDYEPTKADS YRKKWLDGE 60 

EVQIDILDTA GQEDYAAIRD NYFRSGEGFL CVFSITEMES FAATADFREQ ILRVKEDENV 120 

PFLliVGNKSD LEDKRQVSVE EAKNRAEQWN VNYVETSAKT RANVDKVFFD LMREIRARKM 180 
EDSKEKNGKK KRKSLAKRIR ERCCIL 

Seg ID NO: 80 Nucleotide sequence: 
Nucleic Acid Accession #: NM_003467 

Coding seguence: 8 9.. 114 7 (underlined seguences correspond to start and stop codons) 



GTTTGTTGGC 
CACCGCATCT 
CTACACCGAG 
AGAAAATGCT 
TGGCATTGTG 
CATGACGGAC 
TCCCTTCTGG 
AGTCCATGTC 
TCTGGACCGC 
GGCTGAAAAG 
CTTCATCTTT 
CAATGACTTG 
TGGTATTGTC 
CCACCAGAAG 
TTGGCTGCCT 
GCAAGGGTGT 
TTTCTTCCAC 
CTCTGCCCAG 
AGG/U^CGA 
CAGCTAACAC 
ACATTTTTCA 

TCTTTCATAT 
CTCGTGGTAG 
AAGCTAGAAA 



11 
I 

TGCGGCAGCA 
GGAGAACCAG 
GAAATGGGCT 
AATTTCAATA 
GGCAATGGAT 
AAGTACAGGC 
GCAGTTGATG 
ATCTACACAG 
TACCTGGCCA 
GTGGTCTATG 
GCCAACGTCA 
TGGGTGGTTG 
ATCCTGTCCT 
C6CAAGGCCC 
TACTACATTG 
GAGTTTGAGA 
TGTTGTCTGA 
CACGCACTCA 
GGTGGACATT 
AGATGTAAAA 
GATATAAAAG 
TTTCTTTAGT 
TGATGTGTGT 
GACT6TAGAA 
TGATCCCCAG 



21 
I 

GGTAGCAAAG 
CGGTTACCAT 
CAGGGGACTA 
AAATCTTCCT 
TGGTCATCCT 
TGCACCTGTC 
CCGTGGCAAA 
TCAACCTCTA 
TCGTCCACGC 
TTGGCGTCTG 
GTGAGGCAGA 
TGTTCCAGTT 
GCTATTGCAT 
TCAA6ACCAC 
GGATCA6CAT 
ACACTGTGCA 
ACCCCATCCT 
CCTCTGTGAG 
CATCTGTTTC 
GACTTTTTTT 
ACTGACCAAT 
TTTTGTGAAG 
CTAGGCA6GA 
AAGGGAACTG 
CTGTTTATGC 



31 
I 

TGACGCCGAG 
GGAGGGGATC 
TGACTCCATG 
GCCCACCATC 
GGTCATGGGT 
AGTGGCCGAC 
CTGGTACTTT 
CAGCAGTGTC 
CACCAACAGT 
GATCCCTGCC 
TGACAGATAT 
TCAGCACATC 
TATCATCTCC 
AGTCATCCTC 
CGACTCCTTC 
CAAGTGGATT 
CTATGCTTTC 
CAGA6GGTCC 
CACTGAGTCT 
TATACX3ATAA 
ATT6TACAGT 
TTTAATTGAC 
CCTGTGGCCA 
AACATTCCAG 
ATA6ATAATC 



41 

I 

GGCCTGAGTG 
AGTATATACA 
AAGGAACCCT 
TACTCCATCA 
TACCAGAAGA 
CTCCTCTTTG 
GGGAACTTCC 
CTCATCCTGG 
CAGAGGCCAA 
CTCCTGCTGA 
ATCTGTGACC 
ATG6TTGGCC 
AAGCTGTCAC 
ATCCTGGCTT 
ATCCTCCTGG 
TCCATCACCG 
CTTGGAGCCA 
AGCCTCAAGA 
GAGTCTTCAA 
ATAACTTTTT 
TTTTATTGCT 
TTATTTATAT 
A6TTCTTAGT 
AGCGTGTAGT 
TCTCCATTCC 



51 
I 

CTCCAGTAGC 
CTTCAGATAA 
GTTTCCGTGA 
TCTTCTTAAC 
AACTGAGAAG 
TCATCACGCT 
TATGCAAGGC 
CCTTCATCAG 
GGAAGCTGTT 
CTATTCCCGA 
GCTTCTACCC 
TTATCCTGCC 
ACTCCAAGGG 
TCTTCGCCTG 
AAATCATCAA 
AGGCCCTAGC 
AATTTAAAAC 
TCCTCTCCAA 
GTTTTCACTC 
TTTAAGTTAC 
TGTTGGATTT 
AAATTTTTTT 
TGCTGTATGT 
GAATCACGTA 
CX3TGGAACGT 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
760 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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BNSOOCID: <WO 0207M92A2_L> 



wo 02/079492 



PCT/US02/04915 



TTTTCCTGTT CTTAAGACGT GATTTTGCTG TAGAAGATGG CACTTATAAC CAAAGCCCTiA 1560 
AGTGGTATAG AAATGCTGGT TTTTCAGTTT TCAGGAGTG6 GTTGATTTCA 6CACCTACAG 1620 
TGTACAGTCT TGTATTAAGT TGTTAATAAA AGTACATGTT AAACTTACTT AGTGTTATG 

5 Seq ID NO; 81 Protein sequence: 
Protein Accession #; NP_003458 



MEGISIYTSD NYTEEMGSGD YDSMKEPCFR 
LVMGYQKKLR SMTDKYRIiHL SVADLIiPVIT 
YSSVLILAFI SLDRYIiAIVH ATNSQRPRKL 
DDRYICDRPY PNDLWVWFQ PQHIMVGLIL 
15 TVILILAPFA CWIiPYYIGIS IDSFIIiLEII 
LYAFLGAKPK TSAQHALTSV SRGSSLKILS 



31 41 51 

I • I I 

EENANPNKIF LPTIYSIIPI* TGIVGNGLVl 60 
LPFWAVDAVA NWYFGNPIiCK AVHVIYTVNL 120 
liAEKWYVGV WIPAIiLLTIP DFIPANVSEA 180 
PGIVILSCYC IIISKIiSHSK GHQKRKALKT 240 
KQGCEFEIOTV HKWISITEAL AFPHCCIiNPl 300 
KGKRGGHSSV STESESSSFH SS 
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Seq ID NO: 82 Nucleotide sequence: 
Nucleic Acid Accession #: NM_014959 

Coding sequence: 3 14.. 16 09 (underlined sequences correspond to start and stop codons) 



CTGGTTCTCA 
GGGAAAOUVG 
GT6CAAGCAT 
ATTTGACAAA 
CCX3A6ACG6G 
TTTTTCCAAG 
TGTCAACTAT 
TTCCTCTTAT 
TCTGGGGCCT 
CGTTTGGTTC 
AAGGGATGAG 
CCTGCAGCAC 
AGAGGAGGCT 
CGTCTCCTGG 
CCGGGTGGAG 
GCTGCGGATC 
TTATCACCCC 
GCTAACAAAG 
GCCCCCAATG 
GAAAGTAAT6 
CTCAAAATTC 
ACAT6GGACT 
ATCA6CCCCT 
A6CCAG6AT6 
TGAGAAT6AG 
GCTGAGCATG 
TGAAA6GGAC 
TAGGTAGTCT 
TCATTGATTT 
AGGACAGACT 
AATGTCTGAA 
CACTATTTTA 
GTGTTTATGA 
CCTATGGACA 
AAGCTTCTCA 
GTTAGGACTT 
TGAAAGAGAG 
GTGTGTAGCC 
CTGAGATGAA 
CAGCTTATTC 
ACATTT6GAT 
ATGTTTTCGT 
TAGTA6ACAC 
AGTCTCTTTT 
ATTTATCCTC 
CCACTTATU^Q 
TAGGATATA6 
ATATATTTTC 
TTATGTTTTT 
GTTCTGCAAA 
TAGGTACA6A 
TCAGTGATGT 



11 
I 

ACTTCTTTTG 
CTCTCAGGAC 
CTGGGCCATC 
TGGAAAAAAA 
TATACAGGGA 
AA GATGA TGA 
CTGGGGGGGA 
GCTTCTAAAG 
G/^GGAAATG 
CCCACTGCTG 
GTCACAGTGA 
CATGAACAGT 
GTCGCCGAAA 
TTTCTCGTTG 
CCTTTCTATG 
GCCAGTGGGA 
CACCCCGAAG 
GCGATAGATG 
GAACCCCTGA 
CCCAAGGAGT 
TAT6CTGGGC 
TTG6TGTGG6 
CCTCCTTTCT 
GGGGACCTGA 
AAGGAGCTGG 
GTG6AGAAGA 
CCTTACCTCG 
6GAAGAGAGA 
CAGTGTTCAA 
TGTAACCTGG 
GAAGGTAGTA 
TCCATTGACA 
AGGATGGGGC 
CTCCGTTTGA 
GCTGAGGACA 
TAACACTTTA 
CACAAAAATG 
TGCTACAAGG 
ATGTGGTAAA 
TGAGACAGAC 
TCTACCATAG 
GGCACACATA 
CATCTGGTTG 
AACTG6AAGA 
AGTCAGCC716 
T6GAATCTAG 
TTATCTTCTA 
TACTTTTAAT 
CCAGATATTT 
TTTCCACTTC 
TTCTCTTATT 
TTTTTGCTTG 



21 
I 

AAATAATGTT 
TTCCGGTCGC 
TTCAATGGTA 
GGAGTGTCCA 
GCTACCCTGT 
GACAGA6GCA 
CATTCCCAGG 
TCTGTTTTGA 
TGGATGTTGA 
GCTGGTATCT 
CGATTGCGTT 
GGCTGGTGGG 
TCCACCTCCC 
CCCATTTTAA 
CTGTCCTGGA 
CTCGCCTCTC 
ATATTAAGTT 
ATGAGGAAGA 
ACTTTGGTTC 
T6AAATTGTC 
AGATGAAGGA 
ATACT6A6GT 
CAG6T6CA6C 
AAGGGGTGCT 
T66AGCAGGA 
AAGG6GACCT 
TGTCCTATCT 
ATCCAGCGTT 
GACAGAAGAA 
CATGTACCTA 
ATATTCCTTT 
TGATTCTTGA 
CTGGAAAGGC 
AGTATCACCT 
CTCAAGGCAT 
TCTATGGCTA 
GGAGAAAATG 
AGTTGTTGGG 
TCAACTCCAC 
ATTCCTGGCA 
ACTCTGTCAT 
TTTCCATCCT 
AGTCAGTTTT 
TTTCAATTAC 
TTTGTTATGT 
6CACTTTATC 
CATAATCTTT 
CACTCAGAAG 
ACCATTTCTG 
TTCTGATAGA 
TTTTGCTTCC 
TAGTATTTTT 



31 
I 

CATAGA6AA6 
CATGATG6CT 
AAAAAGATAC 
GAAAAGAGTA 
GTTTCTGAGA 
GAGCCATTAT 
AGACATTTGC 
GATCGMGAA 
GTTGATTGAT 
GTGGTCAGCC 
TGGTTCCTGG 
CGGCCCCTTG 
CCACTTCATC 
GAATGAAGGG 
AAGCCCCAGC 
CATCCCCATC 
CCACTTGTAC 
TCGCTTCCAT 
CAGTTATATT 
CTACAGGAGC 
ACCCATTCAA 
GAAGCCAGTG 
CTTTGTGAAG 
CGATGATCTC 
AAAGACACG6 
GGCCCTGGAC 
TAGACAGCAG 
CTCATT6GAA 
GACTGG6TAA 
TTGACTGTAT 
TAAATTTTTT 
AGACCCAGGA 
AACTTTTCCT 
TCTCATAACT 
ACATGATGAC 
CTGTTATTAG 
CAAACATGAG 
TTAAATGTTC 
AGAACCACCA 
ATGTACCATA 
TTTGTAGCCA 
TTTATGTTTA 
TTTTATG6TG 
TTACGTTAAT 
CTTTTCTATT 
ACCATTTA6A 
CTGTATCTTA 
ATTTAAAAAA 
TTGCTCTTCC 
CGTTTTTTAG 
TCTGAGGACA 
AGTTGACATT 



41 
I 

GAGGGCT6TC 
GTGGGCGGTA 
AGTAAAGACA 
GCAGCA6TGA 
CCCTTTGTGA 
TGTTCCGTGC 
TCAGAA6AGA 
GATTATAAAA 
AAGAGCACAA 
ACAGGCCTCG 
AGTCAGCACC 
TTTGATGTCA 
TCCCTCCAAG 
ATGGTCCTGG 
TTCTCTCTGA 
ACTTCCAACA 
CTTGTCCCCA 
GGTGTGCGCC 
GTGTCTAATT 
CCTGGAGAAA 
CTTGAGATTA 
GATCTCCAGC 
GAGAACCACC 
CAGGACAATG 
CAGA6CAAGA 
GTGCTCTTCA 
AATTTGTAAA 
ATGGATAAAC 
CATCTATCAC 
CCTCATGCAT 
CCAACCATTG 
TAAAGGACAT 
GATTAATGTG 
AAAAGCAGAA 
AGTCTTTTTT 
AACAATGTAA 
CAGAAAATAT 
ATGGTCAACT 
AAAAGAAAAT 
CAAAAAATAA 
TTTCAGCTGT 
ATCTGTTTAA 
TATTTTGAAC 
GTAATTATTA 
CTACTGTTAT 
TCCTATTACC 
AAACCCATCA 
CTCATGAGAA 
TTCATTATTT 
TTCTTTTAGA 
TCTTTTTCTC 
6TTTTCTGTT 



51 



TGAGATTCGA 


60 


AACGCGGTTA 


120 


TAAATACCAC 


180 


GGAAGAGCTG 


240 


CATCTCACAT 


300 


TGTTCCTGAG 


360 


ATCAAATAGT 


420 


ATCGTCAGTT 


480 


ACAGATACAG 


540 


GCTTCCTGGT 


600 


TGGCCCTGGA 


660 


CTGCAGAGCC 


720 


GTGAGGTGGA 


780 


AGCATCCAGC 


840 


TGGGCATCCT 


900 


CATTGATCTA 


960 


GCGACGCCTT 


1020 


TGCAGACTTC 


1080 


CTGCTAACCT 


1140 


TTCAGCACTT 


1200 


CTGAAAAAAG 


1260 


TTGTAGCTGC 


1320 


GGCAACTCCA 


1380 


AGGTTCTTAC 


1440 


AT6AGGCCTT 


1500 


6AAGCATTAG 


1560 


ATGAGTCAGT 


1620 


AGAAATGT6A 


1680 


ACAGGCTTTC 


1740 


TTTCCTCAAG 


1800 


CTTGATATAT 


1860 


CCGGATAGGT 


1920 


AAAAATAATT 


1980 


AAGCTAACAA 


2040 


TTTTTTGTAT 


2100 


ATGTATTTGC 


2160 


TTTCCCACTG 


2220 


CCAAGGAATA 


2280 


GAGGGTAATT 


2340 


GCCAACTCTG 


2400 


CTTTTGATTA 


2460 


AACAAGTTCC 


2520 


CCATTCTGAT 


2580 


ATATGTTAGG 


2640 


C3VCATTTGTA 


2700 


TTTTCTCATC 


2760 


ATAAATTATT 


2820 


GAGTAATCTG 


2880 


TCCAAATTTC 


2940 


GTGGTTCTGA 


3000 


ACCTTCATTC 


3060 


CAGCAGTTTC 


3120 
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3NSDOCI0: <WO 02079492A2J_> 



wo 02/079492 



PCT/US02/04915 



CTTTTAGCTT CCGTATTTCC TGATGAGAAA TCTGCAGTCT^ TTCAAATTGT TGTTTCCCTG 3180 

TATGTAGTGT GTCATTTTTC TGTCAGATTT CAAGGTATTT ATCTTTAGTT TTTA6CCATT 3240 

TCATTATGTT GGGGATGAGT TTCCTTGTTT TATTCCCTTT GGAATTTGCT CCAATTCATA 3300 

AATTTGCAGT TTTATGTCTT TTACCAAACT TAGAGGTTTT CAGCCTAATT TCTAAAAATA 3360 

CTTTTTATTA GCCTGATTTT CATCTTTATA GGAAATAGTT TAAGTGATGA CAAGTTCCAA 3420 

TAGCTTATAT GCCCAGAAGG CCTTCAAAAT AAGAATTTTG AAA6AATACA GAAAACAAAC 3480 

TTTTATATCC TTCTCATGTC TTCTACTGTA AAATTCATAT GCTTTGCTAC TCTAAACCTA 3540 

GTTTGAAATC AACAGTCTTG AGAATAGATG AAAATTTTGA TGAATAGTGG AATTCTTTTA 3600 

AATGGAAACC TCTTACATGT GATTTTCCTT GCCATCTAGA AATAAACCAT AGTATTTATG 3660 

TTGAATCAAT CAATATTATA TTTTGTTTTT TTCCTCCTCT TCTGAGACTC TTATTGTGGA 3 720 

AATGTTAGAC TTTTATGTTT TCCTAAATGT CCCTGATATT CTACTTATTT AGAACATCTT 3780 

TTCATTTTTT CCATTATTCT GATTGGGTAA TTTTAATTTG TCTATTTTCA AATTTGCTGG 3840 

AGTGTTCACC TGTTGTTGTC TGTGTCGTCC CACTGAGTGC ATTCACCACC TTTTAAATTT 3900 

TGGTCACTGT ATGTATCAGT TCTAAAATTT CCATTTTGTT CTCTATATTT TAAATTTCTT 3960 

GGCTTATATT CTATTTTCCT GCAAATGTGT CAGCATTTGC TTGTTTGAGC TTTTTTTTTT 4020 

TCAAGACAGG GTCTCAACTC TGTTACCCAG GCTGGAGTGC AGTGGTGCGA TCTCAGCTCA 4080 

CTGCAACCTC TGCCTCCTGG TTCAAGCGAT TATTGTGCCT CAGCCTCCTG AGTAGCTGGG 414 0 

ATTACAGGCA TGCACCACCA CAGCCCAGCT AATTTTTTGT ATTTTTAGTA GAGACAGAGT 4200 

TTTGCTATGT TGGCCAGGCT GGTTTTGAAC TCCTGGCCTC AAGTGATCCA CCCACCTCAG 4260 

CCTCCCAAAG TGCTGGGATT ACAGGCCACT ACACCTGGCA CATTTGAGTA TTTTTTTTTT 4320 

rpTTTTTTTTT TTGAGATGGA GTCTCGCTCT GTCATCTAGG CnX^GAGTGCA GTGGTGTGAT 4380 

CTCAGCTCAC TGCAGCCTCT GTCTCCCGGG CTCAAGCX3AT TCTCTTGCCT CAGCCTCCTG 4440 

AGTAGCTAGG ACTACAGGT6 CATGCCAACA CGCCCG6CTA ATTTTTTTAA AAAATATTTT 4500 

TAGTAGAGAC AGGGTTTCAC CATTTTGGCC AGGATGGTCT CGATCTCCTG ACCTCATGAT 4560 

CCACCCGCCT C6GCCTTCCA AAGTGCTGGG ATTACAGGCA TGAGCCACCG TGCCTG6CCT 4620 

CATTTGAGTA TTTTTATAAT GTCTCTTTTA AAGTCTTTGT CAGATAATTC CAGTGTACAT 4680 

GTTATTCAGT GTTTGGTGTC CACTGAGTTG TCATTTGCCA GACAAGTGGA GATTTTTGCA 4740 

GCTCATCCTT GTATTCTCAG TAGTTCCGAT ATGTACCCTC GACATGTGAA TGTTATCTTA 4800 

TGAGACTCTG TTTTATTTGT ATCCAACAGA AGATGTTTAT TATTTATTTG GCTTTCTGTG 4860 

AACTGAGGTC TTAATATCA6 CTCATTTTAA AAGTCTTTGC AGT6GTATTC GGATC TATC C 4920 

TGTGTGTGCC TATGAGATTG GGTGCA6TGT ATCCT6TTA6 CTCCATTCTC AGGGCGTTTG 4980 

AATGTGAATT AGGACCAGCG CAAT6AATGC TCAAGTTGGG 6TT6GGCGTT AGAATTCATA 5040 
AAAGTCTTTA TATGCTCAG 



Seq ID NO: 83 Protein sequence i 
Protein Accession NP_055774 



1 11 21 31 41 51 

I I I I I I 

MMRQRQSHYC SVLFLSVNYL GGTPPGDICS EENQIVSSYA SKVCFEIEED YKNRQFIiGPE 60 

GNVDVEIjIDK STNRYSVWFP TAGWYIiWSAT GLGFLVRDEV TVTIAFGSWS QHIiALDLQHH 120 

EQWLVGGPIiF DVTAEPEEAV AEIHLPHPIS LQGEVDVSWP LVAHPKNE6M VliBHPARVEP 180 

FYAVLESPSF SLMGIIiIiRIA SGTRLSIPIT SNTLIYYHPH PEDIKPHLYL VPSDAIiLTKA 240 

XDDEEDRFKG VRLQTSPPNB PLNFGSSYIV SNSANIiKVMP KBIiKIiSYRSP GEIQHFSKFY 300 

AGQMKEPIQL EITEKRHGTL VWDTEVKPVD LQLVAASAPP PFS6AAFVKB NHRQLQARMG 360 

DIiKGVIiDDIjQ DNEVLTENEK BIjVEQEKTRQ SKNEAItliSMV EKKGDIALDV IiFRSISBRDP 420 
YltVSYLRQQN !• 



Seq ID NO: 84 Nucleotide sequence; 
Nucleic Acid Accession #: NM_007036 

Coding sequence: 56-610 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CTTCCCACCA GCAAAGACCA CGACTGGAGA GCCGAGCCGG AGGCAGCTGG GAAACATGAA 60 

GAGCGTCTTG CTGCTGACCA CGCTCCTCGT GCCTGCACAC CTGGTGGCCG CCTGGAGCAA 120 

TAATTATGCG GTGGACTGCC CTCAACACTG TGACAGCAGT GAGTGCAJ\AA GCAGCCCGCG 180 

CTGCAAGAGG ACAGTGCTCG ACGACTGTGG CTGCTGCCGA GTGTGCGCTG CAGGGCGGGG 240 

AGAAACTTGC TACCGCACAG TCTCAGGCAT GGATGGCATG AAGTGTGGCC CGGGGCTGAG 300 

GTGTCAGCCT TCTAATGGGG AGGATCCTTT TGGTGAAGAG TTTGGTATCT GCAAAGACTG 360 

TCCCTACGGC ACCTTCGGGA TGGATTGCAG AGAGACCTGC AACTGCCAGT CAGGCATCTQ 420 

TGACAGGGGG ACGGGAAAAT GCCTGAAATT CCCCTTCTTC CAATATTCAG TAACCAAGTC 480 

TTCCAACAGA TTTGTTTCTC TCAGGGAGCA TGACATGGCA TCTGGAGATG GCAATATTGT 540 

GAGAGAAGAA GTTGTGAAAG AGAATGCTGC CGGGTCTCCC GTAATGAGGA AATGGTTT^ 600 

TCCACG CTGA TCCCGGCTGT GATTTCTGAG AGAAGGCTCT ATTTTCGTGA TTGTTCAACA 660 

CACAGCCAAC ATTTTAGGAA CTTTCTAGAT ATAGCATAAG TACATGTAAT TTTTGAAGAT 720 

CCAAATTGTG ATGCATGGTG GATCCAGAAA ACAAAAAGTA GGATACTTAC AATCCATAAC 780 

ATCCATATGA CTGAACACTT GTATGTGTTT GTTAAATATT CGAATGCATG TAGATTTGTT 840 

AAATGTGTGT GTATAGTAAC ACTGAAGAAC TAAAAATGCA ATTTAGGTAA TCTTACATGG 900 

AGACAGGTCA ACCAAAGAGG GAGCTAGGCA AAGCTGAAGA CCGCAGTGAG TCAAATTAGT 960 

TCTTTGACTT TGATGTACAT TAATGTTGGG ATATGGAATG AAGACTTAAG AGCAGGAGAA 1020 

GATGGGGAGG GGGTGGGAGT GGGAAATAAA ATATTTAGCC CTTCCTTGGT AGGTAGCTTC 1080 

TCTAGAATTT AATTGTGCTT TTTTTTTTTT TTTGGCTTTG GGAAAAGTCA AAATAAAACA 1140 



219 



wo 02/079492 



PCT/US02/04915 



ACCAGAAAAC CCCT6AAGGA AGTAAGATGT 
AGCTTTGAAC TGAGAGCAAT TTCAAAAGGC 
T6AAG6ACG6 TTCTGGGGCA TAGGAAACAC 
CCACCTCAGA GATAAATCTA AGAAGTATTT 
5 GTAAATATTT ATATATTTTT ATAAATAAAT 
ArPTATCATC CTCTTGAGGA AAGAAATCTA 
AACCTATGAC TCTATAAGGT TTTCAAACAT 
TATAGGAGTC ACTCTGGATT TCS^AAAAATG 
TAAACATAAG TGCTGTGACT TCGGTGAATT 

10 GGAGGTTTGT AAAAGAAGAA TCAATTTTCA 
TG6AATTAGG A6TATATTTG AAAGAATCTT 
CTTAGGTUU^T ATCTCAGAAG TATTTTATTT 
AGTATTTACC TGTATTTTAT TCTTGAAGTT 
GCCTTTGAAT GTAAAGCTGC ATAAGCTGTT 

IS TTGTTCAATA AAAAAGAACA A6ATAC 



TTGAAGCTTA TGGAAATTTG AGTAACAAAC 1200 

TGCTGATGTA GTTCCCGGGT TACCTGTATC 1260 

ATACACTTCC ATAAATAGCT TTAACGTATG 1320 

TACCCACTGG TGGTTTGTGT GTGTATGAAG 1380 

GTGTTAGTGC AAGTCATCTT CCCTACCCAT 1440 

GTATTATTTG TTGAAAATG6 TTAGAATAAA 1500 

CTGAGGCATG ATAAATTTAT TATCCATAAT 1560 

TCAAAAAATG AGCAACA6AG 6GACCTTATT 1620 

TTCAATTTAA GGTATGAAAA TAAGTTTTTA 1680 

GCAGAAAACA TGTCAACTTT AAAATATA6G 1740 

AGCACTUUVCA 6GACT6TTGT ACTAGATGTT 1800 

GAAGTGAAGA ACTTATTTAA GAATTATTTC 1860 

GGCCAACAGA GTTGTGAATG TGTGTGGAAG 1920 

AGGTTTTGTT TTAAAAGGAC ATGTTTATTA 1980 



20 



25 



Seq ID NO: 85 Protein sequence: 
Protein Accession #: HP 008967.1 



11 



21 



31 



41 
I 



51 



MKSVLIiiTTL LVPAHLVARW SNNYAVDCPQ HCDSSECKSS PRCKRTVLDD CGCCRVCAAG SO 

RGETCYRTVS 6MDGMKCGPG IiRCQPSNGBD PFGEBPGICK DCPYGTFGMD CRETCNCQS6 120 

ICDRGT6KCL KFPFFQYSVT KSSHRFVSIiT EHDHASGD6N IVRBEWKEN AAGSPVMRKW 180 
IiNPR 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Seq ID NO: 86 Nucleotide sequence t 
Nucleic Acid Accession #: D86983 

Coding sequence: 52-4491 (underlined sequences correspond to start and stop codons) 



AGCCGGCCGT 
CGCTCCAGGG 
ACGCTGGCCG 
CGCACCACCG 
ACCTCCATCC 
CGGCTGAGGA 
GGAGCATTTG 
TCAATTGACA 
AATCAGATAG 
TTTTTGCATA 
AT6AAGAGAT 
GCGGATTTGC 
TATCCCAGAC 
GAAAGGCCCC 
TACTTCACCT 
AATGAGCT6A 
ATCCAGAAC^A 
GGAGAGGTGA 
TTTGTAATCC 
AGCGCCACAG 
CCAGTTGACC 
CAGGG6GACA 
ACCGCTTTCA 
GTTATTGAGG 
■ ATCGCCTGGA 
TCGGGAACAC 
GCTGTCAACA 
ACCCCAGTGT 
CTCCCGTGCA 
CAGGTGACAG 
GTTGGCCCTG 
TCGGTGAGCA 
GTAGCTACCT 
ACACATTTGT 
CCGA6GGATC 
CAGCTCATTC 
CACTACAACG 
ACCGCCCACC 
CACGACGGCA 
GAGCGCCTGC 
CACCGACTGT 
6G6ACG6AGA 



11 

I 

GGTGGCTCCG 
GCCCCGGGCG 
TGGTGGCCCA 
TGCGCTGCAT 
TAGATCTTCG 
ACTTGAACAC 
AAGACTTGGA 
GGCAAGCATT 
TITIACTTTGGA 
ACAACCGGAT 
TGC6ACTG6A 
TGAAAACCTA 
GCATCCA6GG 
GGATCACCTC 
GCAGAGCCGA 
GCATGAAGAC 
CACAGGAGAC 
AGACGCAAGA 
AGCCACAGAA 
GCCACCCCCC 
CGCGGGTGAA 
GCGGAGAGTA 
TCATCGTCCA 
GCCAGACCGT 
CCAAGGGAGG 
TTAGAATCTC 
TCATCGGCTC 
TTGCCAGCAT 
GCTCCCAGGG 
AAAGTGGAAA 
CAGACGCAGG 
TGGTGCTCAG 
CCATCGTGGA 
TTGACAGCCG 
CTTACACA6T 
AGGA6CAT6T 
ACCT6GTGTC 
66CGCGTGAA 
CCTGTAACAA 
TGAAATCCGT 
ACAACGG6CA 
CCGTCACACC 



21 
I 

TGCGTCCGAG 
CCGCTGCCTG 
GAAGCCGGGC 
GCATCTGCTG 
CTTTAACAGA 
ATTGCTTCTC 
AAATTTAAAA 
TAAGGGACTT 
CCCAGATTCG 
TACACATTTA 
CTCAAACACA 
CGCGGAGTCG 
AC6CTCAGTG 
CGAGCCCCAG 
AGGCAACCCC 
A6ATTCCCGC 
A6ACCAGGGT 
GGT6ACCCTC 
TACA6AGGTG 
GCCGCGGATC 
CATCACGCCT 
TGCGTGCTCT 
GGCTCTTCCT 
GGATTTCCAG 
GAGCCAGCTC 
TGGTGTTGCC 
CCAGAAGGTC 
TCCCAGCGAC 
CGAGCCCGAG 
ATTTCACATC 
TCGCTATGAG 
TGTGAACGTT 
AGCGATTGCG 
TCCTCGTTCT 
TGAACAGGCA 
ACA6CATGGC 
TCCACA6TAC 
CAACTGCTC6 
CCTGCAGCAC 
GTACGAGAAT 
CGCCCTTCCC 
CGACGAGCAG 



31 
t 

CGTCCGTCCG 
TTGGCGCTCG 
GCAGGGTGTC 
CTGGAGGCCG 
ATCAGAGAGA 
AATAATAATC 
TATCTCTATC 
GCCTCTCTAG 
TTCCAGCATC 
GTTCCAGGGA 
CTTCACTGCG 
GGGAAC6CGC 
GCAACCATCA 
6AC6CAGATG 
AAGCCT6AGA 
CTAAACTTGC 
ATCTACCAGT 
AGGTACTTCG 
CTGGTTGGGG 
TCCTGGACGA 
TCTGGCGGGC 
GCGACCAACA 
CAGTTCACTG 
TGTGAAGCCA 
TCCGTGGACC 
CTCCACGACC 
GTGGCCCACC 
ACAACAGTGG 
CCAGCCATCA 
AGCCCTGAAG 
TGTGTGGCCC 
CCTGACGTCA 
ACTGTTGACA 
CCAAAT6ATT 
CGGGCGGGAG 
TTGATGGTCG 
CTGAACCTCA 
6ACATGTGCT 
CCCATGTGGG 
G6CTTCAACA 
ATGCCGCX3CC 
TTCACCCACA 



41 
I 

CGCCGTCGGC 
TGCTGTTCTG 
CGAGCCGCTG 
TGCCCGCCGT 
TCCAACCTGG 
AGATCAAGAG 
TGTACAAGAA 
AGCAACTATA 
TCCCGAAGCT 
CATTTAATCA 
ACTGTGAAAT 
AGGCA6CGGC 
CCCCGGAAGA 
TGACXrrCGGG 
TCATCTGGCT 
TGGACGATGG 
GCAT6GCAAA 
GGTCTCCAGC 
AGAGCGTCAC 
GAGGTGACCG 
TTTACATACA 
ACATTGACAG 
TGACGCCTCA 
AGGGCAACCC 
GGCGGCACCT 
AGGGCCAGTA 
TGACTGTGCA 
AGGTGGGCGC 
CCTGGAACAA 
GATTCTTQAC 
GGAACACCAT 
GTCGAAATGG 
GAGCTATAAA 
TGCTGGCCTT 
AAATCTTTGA 
ACCTCAACGG 
TCGCAAACCT 
TCCACCAGAA 
GCGCCTCGCT 
CCCCTCGGGG 
T6GTGTCCAC 
TGCTGATGCA 



51 
I 

CATGGCCAAG 
CGCCTGGGGG 
CCTGTGCTTC 
GGCGCCGCAG 
GGCATTCAGG 
GATACCTAGT 
TGAGATCCAG 
CCTGCACTTT 
CGAGAGGCTA 
CTTGGAATCT 
CCTGTGGTTG 
CATCTGTGAA 
GCTGAACTGT 
GAACACCGTG 
GCGAAACAAT 
GACCCTGATG 
GAACGTGGCC 
TCGACCCACT 
GCTGGAGTGC 
CACACCCTTG 
GAACGTCGTA 
CGTCCATGCC 
GGACAGAGTC 
GCCGCCCGTC 
GGTCCTGTCA 
CGAATGCCAG 
GCCCAGAGTC 
CAATGTGCAG 
GGATGGGGTT 
CATCAATGAC 
TGGGTCGGCC 
AGATCCGTTT 
CTCAACCCGA 
GTTCCGGTAT 
ACGGACATTG 
AACAAGTTAC 
GTCGG6CTGT 
6TACCGGACG 
GACCGCCTTC 
CATCAACCCC 
CACCCTGATC 
GTGGGGCCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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TTCCTGGACC 
6GACA6CACT 
CCCAATGACT 
6TGTGCG6CA 
AACCAGCTCA 
CGCAGCATCC 
TCCG6GAAGC 
AACGAGAGCC 
CTGACCAGCA 
AAGCTGAACC 
GCGGAGATCC 
ATGAGGACGC 
GCCTTCGCCA 
CTGGACGAGA 
TCTCCCTTCC 
GTGGCGGGGA 
TTCTCCATGG 
GACCACGGGA 
ACGTTCGAGG 
TTGTATGGCT 
CCTGGCAGCC 
CGAGATGGGG 
CAGATCAAGC 
GTGCAGAGCG 
ATCCCCAGGG 
CAGTTCAATG 
GAGGACAAGC 
GAACATCTCA 
GACTTCAGAG 
AAGAAACTTG 
GCCAACAACA 
GTCACCTGCT 
GGGGCCTGCT 
AGGCTCCTCA 
TGCGGACTGC 
GTGCTGTTAC 
ACAGCAGGTG 
TTTTATTTAA 
ATTTAGGCGC 
ACCTCTATAT 
AGGTGGGGTG 
CTATGTTTAA 
CACAGGGACA 
AACTCCTCCT 
GACAAACATT 
CTTCCACACC 
TCACTT6CAC 
TGTGTATCTG 
CCCTGGTT6C 
AGGAGCTOVA 



AC6ACCTCGA 
GCAGCAACGT 
CCCX3GGCCAG 
GCGGCATGAC 
CCTCCTACAT 
GCGACCTGGC 
CGCTGCTCCC 
CCATCCCCTG 
TGCACACGCT 
CGCACTGGGA 
AGCACATCAC 
TGGGAGAGTA 
CCGCGGCCTT 
ACTTCCAGCC 
GGATTGTGAA 
AAATGCGTGT 
CACACACGGT 
TCCCACCCTA 
ACCTGAAAAA 
CGACACTCAA 
GGCTGGGCCC 
ACAGGTTGTG 
AGACGTCGCT 
ACGTGTTCAG 
TGGACCTCCG 
CCTTTTCCTA 
CGACCAAGAA 
GCAACAGCAC 
AGTTTGTTCT 
AATCACGGCT 
CCAAGTGGAA 
TCGTGGAAGC 
GTCCAGTCTG 
GAGTTTGTCT 
A6ACCAGGAA 
AGAAGGCAGT 
CCTGAAGGGA 
TTCTTTTAAA 
CTAAATTGGT 
GTCAGCCTTG 
AGTCTCGGAG 
AAAGAAAATT 
CTGTCTGGGG 
TCCTCTGGGC 
CCCGCTGCTC 
TGATTAGAAC 
ACATACTGCC 
ATACCTGCCG 
GTCCACGTCC 
GTGTCGGGAA 



CTCCACGGTG 
GTGCAGCAAC 
GAGCGGGGCC 
TTCGCTGCTC 
CGACGCATCC 
CAGCCACCGC 
CTTCGCCACC 
CTTCCTGGCC 
GTGGTTCCGC 
CGGCGACACC 
CTACCAGCAC 
CCACGGCTAC 
CAGGTTTGGC 
CATTGCACAA 
TGAGGGCGGC 
GCCCTCGCAG 
GGCTCTGGAC 
CCACGACTAC 
TGAGATTAAA 
CATCGACCTG 
CACCCTGATG 
GTATGAGAAC 
GGCCAGGATC 
G6TG6CGGAG 
GGTGTG6CAG 
TCATTTCCJGA 
AACAAGACCA 
CTCAGCCTTC 
GGAAATGCAG 
CAGTACCACA 
AAAAGATGCA 
TT6CCCCCCT 
CTTACAGAAG 
GCTGTGCCAT 
ACACCCAGAA 
GCAGGAGGCT 
AGCAGGCAGG 
ATGAAAAATT 
TTTGCCTCCC 
CCTTGTTCAG 
CTGCCAGAGG 
GGTGTTTGGC 
GTGCAGTGCA 
TCTCT6TAAC 
GAAGCAGCTG 
ATTCATAAGC 
TAGTTGTGAA 
AGG6CCAAGG 
TGAACAAGAG 
CTGTCTAACT 



GTGGCCCTGA 
GACCCCCCCT 
CGCTGCATGrr 
ATGAACTCCX3 
AACGTGTACG 
GGCCTGCTGC 
GGGCCGCCCA 
GGGGACCACC 
GAGCACAACC 
ATCTACTATG 
TGGCTCCCGA 
GACCCCGGCA 
CACACGCTTG 
GATCACCTCC 
ATCGATCCGC 
CTGCTGAACA 
CTGGCGGCCA 
AGGGTCTACT 
AACCCTGAGA 
TTTCCGGCGC 
TGTCTTCTCA 
CCTGGGGTGT 
CTATGCGACA 
TTCCCTCACG 
GACTGCT6TG 
GGCAGAC6GT 
CGGAAAATAC 
AGCACACGCT 
AAGACCATCA 
GAGTGCX3TGG 
TGCACCATTT 
GCCACCTGTG 
AGGGCG6AGG 
CGTGAGATCG 
CTCGTGACAT 
TCCAACCAGA 
AGTCCTAGCT 
GGTGCTACTA 
AACACCATTT 
ATGCCAGGAG 
GGCrCACCGA 
AAACGGAACA 
AGCCCCCGGC 
ATTTCACC3VC 
TATAGCCTGT 
CACATTTAGA 
CCAAATGTGA 



CCGCTTCCGG 
TCAGGTTGTG 



GCCAGGCACG 
GCTTCTCTGT 
TCTTCGTGC6 
TGTACCCGC6 
GGAGCAC6GA 
GGCAGGGCAT 
CGGAGTGCAT 
GCGCCAACGA 
GCATTGCCAC 
AGACCAGGAA 
AGATCCTGGG 
TCAATGCTGG 
TCAACCCACT 
CCCTTCACAA 
TTCTCAGGGG 
CGGAGCTCAC 
TCAACATCCA 
GCAATCTATC 
TCCGGGAGAA 
TCGTGGTGGA 
GCACACMTT 
TCTCCCCGGC 
ACGCGGACAA 
GCTACGGCA6 
AA6ACTGTAG 
CTCTT6AGTT 
CCAGTGTTGG 
CAGATGCATC 
CAGACCTCAG 
ATGCCXX3GG6 
GTGAATGCAA 
CTGTCCCCGT 
AAAAGCCCTA 
GGTGGCCGAT 
TTCATGACAA 
GCATCTGCGG 
TCACGTTAGA 
TTAAATTGCA 
CTTTTTAAAT 
CCGGCMACC 
AATCGGGGTT 
GAACCTTTGA 
CTCTTCCCTG 
ACX3TCAGCAT 
GACTCTCCX5T 
AACAGATTTG 
AAAAACCTCC 
ACAAGGCCGC 
ATGGCTCTTC 
TGAGTGCGTT 



CTTCTCC6AC 
CATGATCCCC 
CTCCAGCCCT 
G6AGCAGATC 
GCATGAGGCC 
CGTGCAGCGG 
GCGGGACGAG 
GCAGCTG6GC 
GGAGCTGCTC 
GATCGTGGGT 
GGAGGTGGGC 
CATCTTCAAC 
GCTTTACCGG 
AGCTTTCTTC 
GCTGTTCGGG 
GGAGCGGCTG 
GCGGGGCCGG 
GGCGGCACAC 
ACTGAAAAGG 
GGACCTGGTG 
CAAGCGCCTG 
CCAGCTGACT 
CATCACCCGG 
CTGTGACX3AG 
GACCAGGGGG 
CAGCTACCAG 
GAGACAGG6G 
TGGGACAAAT 
AACACAGATA 
CGAATCTCAC 
AGACGGGCAG 
GAACATCCCA 
_GGCTCCTGGG 
GGCAGGGAGC 
CGTCCAGCTG 
AGAAGGAGGC 
CTTCTCAGGT 
CAGTTGAATC 
AAAGCAGGAT 
TGTCACCCGC 
CCATCACAAG 
TGAGAGCGTT 
GGAACCTCTG 
CTAATCCCTU^ 
GTGTCAGCTC 
CTTTCAGCTG 
TTCATCCCAT 
TCCCAGCC6G 
CCAAGGGA6G 



Seg ID NO: 87 Protein secmence; 



55 I 



Protein Accession #: 
1 11 



SRPWWIiRASB RPSAPSAMAK 
RTTVRCMHLL LEAVPAVAPQ 
GAFEDLENLK YLYIiYKNEIQ 
FLHNNRITHL VPGTFNHLES 
60 YPRRIQGRSV ATITPEELNC 
NELSMKTDSR LNLLDDGTLM 
FVIQPQNTEV LVGESVTLEC 
QGDSGEYACS ATNNIDSVHA 
lAWTKGGSQL SVDRRHLVIiS 
65 TPVPASIPSD TTVEVGANVQ 
VGPADAGRYE CVARNTI6SA 
THIiFDSRPRS PHDUJOtFRY 
HYNDLVSPQY LNLIANLSGC 
ERLLKSVYEN GFNTPRGINP 
70 FLDHDIjDSTV VALSQARFSD 
VCG5GMTSJLL MNSVYPREQZ 
SGKPLIjPPAT GPPTECMRDE 
KXiNPHWDGDT lYYETRKIVG 
AFATAAFRFG HTLVNPIiIiYR 
75 VAGKMRVPSQ LLNTELTERL 
TFEDLKNBZK NPEIREKIiKR 



BAA13219 
21 
I 

RSRGPGRRCL 
TSILDIiRFNR 
SIDRQAFKGIi 
MKRLRLDSNT 
ERPRITSEPQ 
IQNTQETDQG 
SATGHPPPRI 
TAFIIVQALP 
SGTLRXSGVA 
IiPCSSQGEPE 
SVSMVIiSVNV 
PRDPYTVEQA 
TAHRRVNNCS 
HRIiYNGHAIiP 
GQHCSNVCSN 
NQI*TSYIDAS 
NESPIPCFIiA 
AEIQHXTYQH 
IiDENFQPIAQ 
FSMAHTVAIiD 
LYGSTLNIDL 



31 
1 

liALVLFCAWG 
IREIQPGAPR 
ASLEQLYIiHF 
LHCDCEILWL 
DADVTSGNTV 
lYQCMAKNVA 
SWTRGDRTPL 
QFTVTPQDRV 
LHDQGQYECQ 
PAITHNKD6V 
PDVSRNGDPF 
RAGEXFERTIi 
DMCFHQKYRT 
MPRIiVSTTLI 
DPPCFSVHIP 
NVYGSTEHBA 
GDHRANEQLG 
WLPKILGEVG 
DHLPIiHKAFF 
ZiAAINXQRGR 
FPAItWEDXiV 



41 

I 

TIiAWAQKPG 
RliRNLNTLIiL 
NQIETLDPDS 
ADLLKTYAES 
YFTCRAEGNP 
GEVKTQEVTL 
PVDPRVNITP 
VIEGQTVDFQ 
AVNIIGSQKV 
QVTESGKFHI 
VATSXVEAXA 
QIiXQEHVQHG 
HDGTCNNLQH 
GTETVTPDBQ 
PNDSRARSGA 
RSXRDIiASHR 
LTSMHTLWFR 
MRTLGEYHGY 
SPFRXVNEGG 
DHGXPPYHDY 
PGSRZiGPTIjM 



51 
I 

AGCPSRCLCF 
NNNQIKRXPS 
FQHIiPKIjERL 
GNAQTUW^ICE 
KPEIIWLRNN 
RYFGSPARPT 
SGGIiYIQNW 
CEAKGNPPPV 
VAHIiTVQPRV 
SPEGFIiTXMD 
TVDRAINSTR 
LKVDliNGTSY 
PMWGASliTAF 
FTHMLMQWGQ 
RCMFPVRSSP 
GLLRQGXVQR 
EHNRXATELIi 
DPGXNAGXFN 
IDPLLRGLFG 
RVYCNZjSAAH 
CI*IiSTQPKRX, 



2580 
2640 
2700 
2760 
2820 
2680 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4600 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



221 
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RDGDRLWYEN PGVFSPAQIiT QIKQTSLARI LCDNADNITR VQSDVFRVAE FPHGYGSCDE 1320 

IPRVDLRVWQ DCCEDCRTRG QFNAFSYHFR GRRSLEFSYQ EDKPTKKTRP RKIPSVGRQG 1380 

EHIiSNSTSAF STRSDASGTN DFREFVIiEMQ KTITDLRTQI KKLESRLSTT ECVDAGGESH 1440 
ANNTKWKKDA CTICECKDGQ VTCFVEACPP ATCAVPVNIP GACCPVCIjQK RAEEKP 
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Seq ID NO: 86 DNA sequence 

Nucleic Acid Accession #: NM_004834.1 

Coding sequence: 80-3577 (underlined sequences correspond to start and stop codons) 
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AATTCGAGGA 
TTGGTGGCAA 
CCTCTCCTCC 
CACCTATGGA 
AGTTATGGAT 
GAAATACTCT 
TCCAGGACAT 
AGACCTTGTG 
CAGAGAAATC 
CAAGGGCCAG 
GAGTGCTCAG 
GATGGCTCCT 
TGATCTTTGG 
TGACATGCAT 
GTCAAAAAAA 
CAT6CAGCG6 
TGAAAGGCAA 
CGA6AAAGAT 
T6AACAGGAA 
AGATTTCCTG 
GTTACTACAG 
GAGACAGAAG 
GAGAGAGCGG 
AT^GAGGCGT 
AGAA6AAGAA 
GGAGCAGCGG 
GCATGACCAT 
AAGGAGCAAG 
AGCGCGAGAG 
CCCACTGCAG 
TGAGCCCAGG 
CTCCTCAGGG 
CTCCGGGGAA 
CCTGGAAAAT 
T6CTGGCGAA 
GCCACCTCAC 
G6AGGACGAC 
CAGAGCAGCG 
TGTCCATGAT 
CGTCCGCCAG 
TACACCTTTT 
ATCTGTGGTG 
CCGGAAAGGC 
GGAGATTCGT 
AGTGAATTTG 
GAAGGTCTAT 
GAATGTCTTG 
GTTAAGAAAT 
CGTAGGGGAT 
TCTGGTGATT 
CAAATTTATG 
CACTGTTGAG 
TGTTGATGTG 
CCCACACTCT 
AGATGGAATG 
71A6GATCACC 
TCGATCCAAT 
TGGTCACTTG 
ACGCAATGAC 
CAT6ACCTTA 
T6GCCTCCAG 
CT6CACCGAG 
CTCCTTCCTG 



11 
I 

TCCGGGTACC 

AAAGGGAAAA. 

CTGCGGGATC 

CAAGTCTATA 

GTCACTGAGG 

CATCACAGAA 

GATGACCAAC 

AAGAACACCA 

CTGAGGGGAC 

AATGTGTTGC 

CTGGACAGGA 

GAGGTCATCG 

TCTTGTGGCA 

CCAATGAGAG 

TGGTCGAAGA 

CCCTCTACAG 

GTTAGAATCC 

6AAACTGAGT 

GGAGAGCCAA 

AGACTGCAGC 

GAGCAACAGC 

CGGATTGAGC 

GAGGCTAGAA 

CTAGAGGAGT 

AAGAGGAGAG 

CACTTGGAAG 

AGGAGGCCGC 

CCAAGCTTCC 

GTTCCTGTGA 

GGCAGTGGGC 

CTTCTGTGGG 

TCCAGCAACT 

CGCTTCAGAG 

GC7U3TGAAAA 

GTGGATCT6A 

AAAGTAAC6G 

GATGT66A6C 

TCATCTCTGA 

6ATGTAGAAA 

ACTCA6TCCG 

ATAGACCCCA 

GGATTTTCCT 

TCAGTGGTCA 

AAATACAAGA 

CTAGTGGGTA 

CCTCTTATCA 

GTGACAATAT 

AAAATACTTC 

TTGGAAGGAT 

GCTTTGAAGA 

GCCTTTAAGT 

GAAGGCCAGA 

GATTCAGGAT 

ATGATCCAGT 

GAGCTTCTGG 

AAGGATGTAG 

CAGACAATGG 

GATGGTGTGT 

AAGGTGTTCT 

GGCAG6ACTT 

AGTCTTCAAG 

GGCAACCAGG 

TTCCTCTTAT 



21 
I 

ATGGCACAGA 
TGGCGAACGA 
CTGCTGGGAT 
AGGGTC6ACA 
ATG/^GAGGA 
ACATTGCAAC 
TCTGGCTTGT 
AAGGGAACAC 
TGGCACATCT 
TGACTGAGAA 
CTGTGGGGCG 
CCTGTGATGA 
TTACAGCCAT 
CACTGTTTCT 
AGTTTTTTAG 
A6CAGCTTTT 
AGCTTAAGGA 
ATGAGTACAG 
GTTCCATTGT 
AGGAGAACAA 
TCCGGGAGCA 
AGCAGAAAGA 
GGCAGCAGGA 
TGGAGAGAAG 
TTGAAAGAGA 
TCCTTCAGCA 
ACCCGCAGCA 
ATGCTCCCGA 
GAACAACATC 
AGCAGAATAG 
AGAGAGTGGA 
CAGGATCCCA 
TGAGATCATC 
AACCTGT^GA 
CCGCACT6GC 
ACTACTCCTC 
A6GAAGGGGC 
ATTTGAGCAA 
GTGAGCOGGC 
CTAGTAGCAC 
6ATTACTACA 
GTGATGGGAT 
ATGTGAATCC 
AGAGGTTTAA 
CAGAGAGTGG 
ACCGAAGACG 
CTGGCAAAAA 
ACAATGATCC 
GTGTACATTA 
GTTCTGTGGA 
CATTTGGAGA 
GGTTG/^GT 
CAGTCTATGA 
GTAGCATCAA 
TGTGCTATGA 
TTCTACAGTG 
GCTGGGGAGA 
TCATGCACAA 
TTGCCTCTGT 
CTCTT CTGAG 
ATCCTGA6AA 
ACAGCTGT6T 
ATACCAGTTT 



31 
1 

GCGACAGAGA 
CTCCCCTGCA 
TTTTGAGCTG 
TGTTT^AAACG 
AGAAATCAAA 
ATATTATGGT 
TATGGAGTTC 
ACTCAAAGAA 
TCACATTCAT 
TGCAGAGGTG 
GAGAAATACG 
GAACCCAGAT 
TGAGATGGCA 
CATTCCCAGA 
TTTTATAGAA 
GAAACATCCT 
TCATATAGAT 
TGGGAGTGAG 
GAACGTGCCT 
GGAACGTTCC 
GGAAGAATAT 
ACAGAGGCGA 
ACGTGAACAG 
GCGCAAAGAA 
ACAGGAGTAT 
GCAGCTGCTC 
CTCGCAGCAG 
GCCCAAAGCC 
TCGCTCCCCT 
CCAGGCAGGA 
GAAGCTGGTG 
GCCCGGGTCT 
ATCXTAAGTCT 
TAAAAAGGAA 
CAAA6AGCTT 
ATCCAGTGAG 
TGACGAGTCC 
TGGTGAAACG 
CATGACCCCA 
ACTCCAGAAA 
GATTTCTCCA 
GA6ACCAGAA 
TACCAACACT 
CTCTGAGATT 
CCTGATGCTG 
ATTTCAACAA 
GGATAAGTTA 
AGAAGTTGAG 
TAAAGTTGTA 
AGTCTATGCG 
ATTGGTACAT 
GATCTATGGA 
CATTTATCTA 
ACCCCATGCA 
AGATGAGGGG 
G66AGAGATG 
GAAG6CCATA 
AAGGGCTCAA 
TCGGTCTGGT 
CTGGTAGAAG 
CTTGGAATTC 
6TGCAGACCT 
ATCXCCATTC 



41 
I 

CATTTATTGT 
AAAAGTCTGG 
GTGGAAGTGG 
GGTCAGTTGG 
CTGGAGATAA 
GCTTTCATCA 
TGTGGGGCTG 
GACTGGATCG 
CATGTGATTC 
AAACTTGTTG 
TTCATAGGCA 
GCCACCTATG 
GAAGGTGCTC 
AACCCTCCTC 
GGGTGCCTGG 
TTTATAAGGG 
CGTACCAGGA 
GAAGAAGA6G 
GGTGAGTCTA 
GAGGCTCTTC 
AAAAGGCAAC 
CGGCTAGAAG 
CGAAGGAGAG 
GAAGAGGAGA 
ATCAGGCGAC 
CAGGAGCAGG 
CCGCCACCAC 
CACTACGAGC 
GTTCTGTCCC 
CAGAGAAACT 
CCCAGACCTG 
CACCCTGGGT 
GAAGGCTCTC 
GTTTTCAGAC 
CGAGCAGTGG 
GAGTCGGGGA 
ACCTCAGGAC 
6AATCTGTGA 
TCCAAGGA6G 
CACAAATCTT 
TCTAGCG6AA 
6CCATAAG6C 
AGGCCACAGA 
CTGTGTGCTG 
CTGGACAGAA 
ATGGACGTAC 
CGTGTCTACT 
AAGAAGCAGG 
AAATATGAAA 
TGGGCACCAA 
AAGCCATTAC 
TCCTGTGCTG 
CCAACACATG 
ATCATCATCC 
GTTTATGTAA 
CCTACATCAG 
GAGATCCGAT 
A6ACTAAAAT 
GGCAGCAGTC 
CA6TGTGATC 
CTTGTAACTG 
CATGTGTTGG 
TTTTTTTTTT 



51 
I 

TATTTGTTTT 
TGGACATCGA 
TTGGAAATGG 
CAGCCATCAA 
ATATGCTAAA 
AAAAGAGCCC 
GGTCCATTAC 
CTTACATCTC 
ACCGGGATAT 
ACTTTGGTGT 
CTCCCTACTG 
ATTACAGAAG 
CCCCTCTCTG 
CCCGGCTGAA 
TGAAGAATTA 
ATCAGCCAAA 
AGAAGAGAG6 
AGGAAGT6CC 
CTCTTCGCCG 
GGAGACiUlCA 
TGCTGGCAGA 
AGCAACAAAG 
AACAAGAAGA 
G6AGACGGGC 
AGCTAGAAGA 
CCATGTTACT 
CGCAGCAGGA 
CTGCTGACCG 
GTCGAGATTC 
CCACCAGTAT 
GCAGTGGCAG 
CTCAGAGTGG 
CATCTCAGCG 
CCCTCAA6CC 
AAGATGTACG 
CGACGGATGA 
CAGAGGACAC 
AAACCATGAT 
GCACTCTAAT 
CCTCCTCCTT 
CAACAGTGAC 
AAGATCCTAC 
GTGACACCCC 
CCTTATGGGG 
6TGGCCAAGG 
TTGAGGGCTT 
ATTTGTCCTG 
GATGGACAAC 
GAATCAAATT 
AGCCATATCA 
TGGTGGATCT 
GATTCCATGC 
TAAGAAAGAA 
TCCCCAATAC 
ACACATATGG 
TAGCATATAT 
CTGTGGAT^C 
TCTTGTGTGA 
AGGTTTATTT 
CAGGGATTAC 
GAGCTCG6AG 
GTTCTGTCCC 
TCTTACTCCA 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2S20 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3760 
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AAATAAATCA AGGCTGCAAT GCAGCTGGTG CTGTTCAGAT TCCAAAAAAA AAAAAAT^CC 3840 
ATGGTACCCG GATCCTCGAA TTCC 



5 Seq ID No: 89 Protein sequence: 
Protein Accession #: NP_004825.1 



1 11 21 31 41 51 

10 I I I I I 1 

MANDSPAKSI* VDIDIiSSLRD PAGIFELVEV VGNGTYGQVY KGRHVKTGQL AAIKVMDVTE 
DBEEEIKLEI NMLKKYSHHR NIATYYGAPI KKSPPGHDDQ LWIjVMEFCGA GSITDLVKNT 
KGNTLKEDWI AYISREIIiRG lAHLHIHHVI HRDIK6QNVL LTENAEVKLV DFGVSAQLDR 
TVGRRNTFIG TPYWMAPEVI ACDENPDATY DYRSDLWSCG ITAIEMAEGA PPLCDMHPMR 

15 ALFLIPRNPP PRLKSKKWSK KFFSFIEGCL VKNYMQRPST EQItLKHPPIR DQPNERQVRI 
QLKDHIDRTR KKRGEKDETE YEYSGSEEEE EEVPEQEGEP SSIVNVPGBS TIiRRDFLRLQ 
QENKERSEAli RRQQIjIiQEQQ LREQEEYKRQ LIiAERQKRIE QQKEQRRRLE EQQRREREAR 
RQQEREQRRR EQEEKRRLEE LERRRKEEEE RRRAEEEKRR VBREQEYIRR QIiEEEQRHIiE 
VLQQQIiLQEQ AMLLHDHRRP HPQHSQQPPP PQQERSKPSP HAPEPKAHYE PADRAREVPV 

20 RTTSRSPVLS RRDSPLQGSG QQNSQAGQRN STSIBPRIXU ERVEKIiVPRP GSGSSSGSSN 
SGSQPGSHPG SQSGSGERFR VRSSSKSEGS PSQRIiENAVK KPEDKKEVFR PIiKPAGEVDL 
TAIiAKBLRAV EDVRPPHKVT DYSSSSEESG TTDEEDDDVB QEGADESTSG PEDTRA7VSSL 
NLSNGETESV KTMIVHDDVE SEPAMTPSKE GTLIVRQTQS ASSTLQKHKS SSSFTPFIDP 
RLLQISPSSG TTVTSWGFS CDGMRPEAIR QDPTRKGSW NVNPTNTRPQ SDTPEIRKYK 

25 KRFNSBIZiCA AXiWGVNIjLVG TBSGUIIiliDR SGQGKVYPIiI NRRRFQQMDV IiEGU9VI.VTI 
SGKKDKLRVY YLSWUINKIXj HNDPEVEKKQ GHTTVGDIjEG CVHYKWKYE RIKFLVIALK 
SSVEVYAWAP KPYHKFMJVFK SFGBLVHKPL LVDLTVEEGQ RLKVIYGSCA GFHAVDVDSG 
SVYDIYIiPTH VRKNPHSMIQ CSIKPHAIII LPNTDGMEIJ, VCYEDEGVYV NTYGRITKDV 
VLQWGEMPTS VAYIRSNQTM GWGBKAIEIR SVETGHIiDGV FMHKRAQRIiK FLCBRNDKVF 

30 FASVRSGGSS QVYPFPTIiGRT SLI.SW 



Seq ID NO: 90 DNA sequence 
Nucleic Acid Accession #: none found 
35 Coding sequence: 2-71 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I • ' ' ' 

40 TTACA CTTCA ATTCCTTACA CGGTATTTCA AACAAACAGT TTTGCTGAGA GGAGCTTTTG 

TCTCTCC TTA AG AAAATGTT TATAAAGCTG AAAGGAAATC ATU^CAGTAAT CTTAAAAATG 

AAAACAAAAC AACCCAACAA CCTAGATAAC TACAGTGATC AGGGAGCACA 6TTCAACTCC 

TTGTTATGTT TTAGTCATAT GGCCTACTCA AACAGCTAAA TAACAACACC AGTGGCAGAT 

AAAAATCACC ATTTATCTTT CAGCTATTAA TCTTTTGAAT GAATAAACTG TGACAAACS^ 

45 ATTAACATTT TTGAACATGA AAGGCAACTT CTGCACAATC CT6TATCCAA G CAAA CTTTA 
AATTATCCAC TTAATTATTA CTTAATCTTA AAAAAAATTA GAACCCAGAA CTTTTCAATG 
AAGCATTTGA AAGTTGAAGT 6GAATTTAGG AAAGCCATAA AAATATAAAT ACTGTTATCA 
CAGCACCAGC AAGCCAT/UIT CTTTATACCT ATCAGTTCTA TTTCTATTAA CAGTAAAAAC 
ATTAAGCAAG ATATAAGACT ACCTGCCCAA GAATTCAGTC TTTTTTCATT TTTGTTTTTC 

50 TCAGTTCTGA GGATGTTAAT CGTCAAATTT TCTTTGGACT 6CATTCCTCA CTACTTTTTG 
CACAATGGTC TCACGTTCTC ACATTTGTTC TCGCGAATAA ATTGATAAT^ GGTGTTAAGT 
TCTGTGAATG TCTTTTTAAT TATGGGCATA ATTGTGCTTG ACTGGATAAA AACTTAAGTC 
CACCCTTATG TTTATAATAA TTTCTTGAGA ACAGCAAACT GCATTTACCA TCGTAAAACA 
ACATCTGACT TACGGGAGCT GCAGGGAAGT GGTGAGACAG TTCGAACGGC TCCTCAGAAA 

55 TCCAGTGACC CAATTCTAAA GACCATAGCA CCTGCAAGTG ACACAACAAG CAGATTTATT 
ATACATTTAT TAGCCTTAGC AGGCAATAAA CCAAGAATCA CTTTGAAGAC ACAGCAAAAA 
GTGATACACT CCGCAGATCT GAAATAGATG TGTTCTCAGA CAACAAAGTC CCTTCAGAAT 
CTTCATGTTG CATAAATGTT ATGAATATTA ATAAAAAGTT GATTGAGA 

60 

Seq ID No: 91 Protein sequence; 
Protein Accession Ut none found 

1 11 21 31 41 51 

65 I I I I I 1 

YTSIPYTVFQ TNSFAERSFC LSL 



Seq ID NO: 92. DNA sequence 
70 Nucleic Acid Accession #: HM_003706.1 

Coding sequence: 310-1935 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

75 I I I I I I 

CAC6AG6CA6 GGGCO^TTTT ACCTCCAG6T TGQCCCT6CT CAG6ACCAG6 A6QAAACACC 60 
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BNSDOCID: <WO ^02079492A2_I_> 



60 
120 
180 
240 
300 
360 
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480 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



wo 02/079492 



TCCAGCCCGC GACCTCCTCC CACAGGGGGA 
GGCACCGAGG ATCCCCGCAG TCTTCACCCG 
GACTACACCG CTAAGCGCAG GGAGCCCAAG 
CACCGCGCAT GCGCACACGC CCCAGACCCA 
5 CAGTGCACCA_TGGGAAGCTC TGAAGTTTCC 
GCGGCCGTGG AGAGACGAAG ACTTCATGTG 
GCTGATGAGG CCCCAGTTGT TGCTGTGCTG 
GCCTGCCTTG GG6TCCTGAG TGAGAT6AAA 
CTCGCAGGGG TCTCTGGATC CACTTGGGCA 

10 ATGGAAGCTC TCX3AGGCTGA CCTGAAACAT 
AAGAGCCTAC AGAAAACCAT CCAAGCAGCG 
TGGGCCTACA TGGTTATCTC TAAGCAAACC 
AT6AAGAAGC CCGT66AAGA AGGGACACTA 
GACCT6CAAC CTTCCTG6CA 66AGGCAAGA 

15 CACCACGCTG GCTTCTCTGC ACTGGGGGCC 
TTCAAGAAGG GAAGACTGGT CAGAACTCAC 
TTATGGGGAA GTGCTCTTGG T;U\CACTGAA 
AGGAATCTGA CCCTGAAAGG TTTATGGAGA 
CACCTTATTT TTGCCCGATT ACTGAGGCTG 

20 CCAGAAGATG AAGGCGGTGA GCCTGAACAC 
ACCAGGACCT CCCTGGAAAA GCAGGAGCAG 
CTCAGTAACT TGATGGATTT TGTGAAGAAA 
GGGACCACTC ACAACTTCCT GTACAAACAC 
CGGAAGCACC TCCACCTGGT GGATGCTGGT 

25 CTGCCCCCGA CGCGGGAGGT TCACCTCATC 
TTCGAGACCA TCCGGGCTAC CACTGACTAC 
GTAGAAGAGG CTGAGCTGGA TTTGTGGTCC 
GGAGAAACTG GACCAGTGGT GATACATTTT 
GATATTGAGG CATGGA6TGA CACATACGAC 

30 GATGTGGTGG TGCTACTCTT GGCATTAGCC 
ATCCTTAGAG AGTTGATGAA CGTGGCCX3GG 
TGCTGCTTGG CATAGATGAG CCTCAGCTTC 
GGCCCTGAAG TCCACCTGGC CTTCCTGTTC 
CTTGAGTTCA CCTTG6CTGT CCTAACAGGG 

35 TTTGATAGCG TCATTCAGAA GAAGGTGTCC 
CAG6TCCCTC GGGAGATCCT GGAGCTG6AG 
TCCAATGTCC AGATGGCCAG AAT6AATGTG 
TTTATGACTG CACTTCTAGC CAGTAGCTCT 
TGGGCTTAAA TCATGGGCTA TCTCTCCACA 

40 GCTCAATAAA TGCTTGCTGA TTGACTGATG 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



PCT/IIS02/04915 



AAAGGAAAGC AGGAGGACCA CAGAAGCTTT 120 

CGGAGATTCC GGCTGAAGGA GCTGTCCAGC 180 

CCTCCGCACC GGATTCCGGA GCACAAGCTC 24 0 

GGCTCAGGAG GACTGAGAAT TTTCTGACCG 300 

ATAATTCCTG GGCTCCAGAA AGAAGAT^AAG 360 

CTGAAAGCTC TGAAGAAGCT AAGGATTGAG 420 

GGCTCAGGCG GAGGACTGCG GGCTCACATT 480 

GAACAGGGCC TGTT6GATGC CGTCACGTAC 540 

ATATCTTCTC TCTACACCAA TGATG6TGAC 600 

CGATTTACCC GACAG6AGT6 G6ACTT6GCT 660 

AGGTCTGAGA ATTACTCTCT GACCGACTTC 720 

AGAGAACTGC CGGAGTCTCA TTT6TCCAAT 780 

CCCTACCCAA TATTTGCAGC CATTGACAAT 840 

6CACCAGA6A CCT6GTTCGA GTTCACCCCT 900 

TTTGTTTCCA TAACCCACTT CGGAAGCA7VA 960 

CCTGAGAGAG ACCTGACTTT CCTGAGAGGT 1020 

GTCATTAGGG AATACATTTT TGACCAGTTA 1080 

AGGGCTGTTG CTAATGCTAA AAGCATTGGA 114 0 

CAAGAAAGTT CACAAGGGGA ACATCCTCCC 1200 

ACCTGGCTGA CTGAGATGCT CGAGAATTGG 1260 

CCCCATGAGG ACCCCGAAAG GAAAGGCTCA 1320 

ACAGGCATTT GCGCTTCAAA GTGGGAATGG 1380 

GGTGGCATCC GGGACAAGAT AATGAGCAGC 144 0 

TTAGCCATCA ACACTCCCTT CCCACTCGTG 1500 

CTCTCCTTCG ACTTCAGTGC GGGAGATCCT 1560 

TGCCGCCGCC ACAAGATCCC CTTTCCCCAA 1620 

AAGGCCCCCG CCAGCTGCTA CATCCTGAAA 1680 

CCCCTGTTCA ACATAGATGC CTGTGGAGGT 1740 

ACATTCAAGC TTGCTGACAC CTACACTCTA 1800 

AAGAAGAAT6 TCAGGGAAAA CAAGAAGAAG 1860 

CTCTACTACC CGAAGGATAG TGCCCGAAGT 1920 

CAGGGCACTG TGGGCCTGTT GGTCTACTAG 1980 

TTCACTCCCT TCAGCCACAC GCTTCATGGC 2040 

CCAATCACCA GTGACCAGCT AGACTGTGAT 2100 

AA6GAGCTGA AGGTGGTGAA ATTTGTCCTG 2160 

CAT6AGTGTC TGACAATCAG AAGCATCATG 2220 

ATAGTTCAGA CCAATGCCTT CCACTGCTCC 2280 

6CACAA6TTA GCTCTGTAGA AGTAAGAACT 2340 

GCCAAGTGGA GCTCTGA6AA TACAACAAGT 2400 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAA 



Seq ID No: 93 Protein sequence: 
45 Protein Accession #: NP_003697.l 



1 11 21 31 41 51 

<n ' ' ' l ' • 

5U M6SSEVSIIP GliQKEEKAAV ERRRI^HVLKA IiKKIiRIEADE APWAVL6SG GGZiRAHIACL 60 

GVLSBMKEQG LLDAVTYIiAG VS6STWAXSS LYTNDGDMEA IiEADLKHRFT RQEWDIiAKSIi 120 

QKTZQAARSE NYSLTDFWAY MVISKQTREIi PBSHLSNMKK PVEEGTLPYP IFAAIDNDLQ 180 

PSWQEARAPE TWFEFTPHHA 6FSALGAFVS ITHF6SKFKK GRLVRTKPER DIiTFLRGLWG 240 

SALGNTEVIR EYIFDQLRNIi TIiKGLWRRAV ANAKSZGHIiI FARLLRLQES SQ6EHPPPED 300 

55 EGGEPEHTWL TEMLENWTRT SI«EKQEQPHE DPERKGSLSH LMDFVKKTGI CASKWEWGTT 360 

HNFLYKH6GI RDKIMSSRKH UiLVDAGU^Z NTPFPLVLPP TREVHLIIjSF DFSAGDPFET 420 

IRATTDYCRR HKIPFPQVEE AEIjDIjWSKAP ASCYZLKGET GPWZHFPXiF NZDACGGDZE 480 

AWSDTYDTFK liADTYTIiDW VIiIjIiALAKKN VRENKKKZIjR EIiMNVAGLYY PKDSARSCCZi .540 
A 



60 



65 



Seq ID NO: 94 DNA sequence 
Nucleic Acid Accession #: AK027351 

Coding sequence: 1-642 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

AGGGAAAAAA ACTCCATTAA AAAGCCCAGC TTTCCTCCAT GTTAGATGTG ACTTGGAAAA 60 

TGAGAAAGAT TTAGCAAAAT TCCACCGTAT CTTTTGCCAG GCTAGAGACA GGGAGAGCAG 120 

70 AGTAAAACCC TCAGGCTGCT GAAATTTCTA GGCTGTTAGG AAGCCCCTCG AATTCTGTGA .180 

AAATGAGGGT TTCTTAACTC ACACTGAGAG CGGAAAGGGG CAGACCCTTT TCATAACTCC 240 

CTCAAGTGTG TGTTACCTTT CTTTACCAGC ATGGTAAGCA ACAGGACATA TCCCAGCCTC 300 

GGACATGTCT GTATGATCCA AGGTACCCAA AGTCAGACAG AGTAAACTCA AGCCTGGCAC 360 

TGGCTTTCTG CCGCTTCATG TGCTTTGGAA AAAGCAGGAG AAGCAATAGC AGCAGGAGTC 420 

75 CCCAGCAGCT GGAGCCGCAA GAATGAACTG CAAAGAGGGA ACTGACAGCA GCTGCGGCTG 480 

CAGGGGCAAC 6ACGAGAAGA AGATGTTGAA GTGTGTGGT6 GTGG6GGACG GT6CCGTGGG 540 
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GAAAACCTGC CTGCTGATGA GCTACGCCAA 
TGTC5TTT6AC CACTATGCAG TTACTGTGAC 
6TATGACACC GC6GGACA6G AGGACTACAA 
GGATGTGTTT TT6ATCTGCT TCTCTGTCGT 
5 GGAATG6GTC CCCGAGCTCA AGGACTGCAT 
CCAGATTGAT CTCCGT6ATG ACCCAAAAAC 
ACCTCTCACT TACGAGCATG GTGTGAAGCT 
GGAATGTTCA GCTCTGACTC AG7UUVGGTCT 
CATTTTCCAC CCCAAGAAAA AGAAGAAACG 

10 TATCTGAGGT TGTCTGGGAC CTGCCTCCAC 
TCTGTGGCCA AGCTCCAGCC AAAAAGGA6G 
GAGGCTTGCC CCATCACCCT CTGAGCCCTC 
ACGACCTCCC TGCCAGCCAG AAGCATCCGT 
GATTGCAGAC AGTGCCGCTG CTGATCGCAT 

15 TTACAAATCC CCAGCTCATG AACGTGAAGC 
AAGAAACTTG ATTCCTCTAT TGCTGGCCTT 
CAATACTAAC CTTTTTTTCT GAATCTGCTG 
TTATTTCAAG AMTGTACTA ATTTCCAGTT 
AGCCTAAAGA CAAGGCATTT TATATTCATT 

20 TTAGAACCAA CACGTACCTC TGAATGCCCG 
GTTTTGGJ\AA TTTACAGAGC CATGATTTTT 
GTTGCAGGTC CACATTTTTG CCAAAGATAC 
TTTTTCCATG TATTGCCACG ACAAACTAAA 
TGTTTTTCAT CCAAGTTGAT TGGGGGAAGA 

25 TGTATTCAGT AAAGTGGACA TTCCTGCTCC 
• • CCTTGATTTC ACTTTCTCTC ATGCCCX3GAT 
ATAAAAGAAA GATCTGAGCA TAAAGATACX3 
TGCCTTAATT TTTCTATTTG CTTCAACTGA 
CTCATTCCAT GTAAATGACA TTTTCCA6TT 

30 TTCCTTACTC ATCCTCCCAA ATGTCTTTGT 
TATCCATTTG TCCCCTGCCC TCCACAATGT 
AAAGCAAAAG CTCCCAGTAA G6AATCCTGT 
CAGGAATTTT TGTATCATAG AGCGAATTAC 
CTTCTAATTA GTCTTAGTTG CTTATAAGTG 

35 TTTTTTTCAG TTGCATGAGC AAAGTGCTTC 
ACTTTCCAGA TTCAAGCTCC CACTGTTGGA 
CTGGAAT7UVG CACTTAAGAA TTGCGTGATA 
CAACACTTAG GGAGGCTGAG 6TGGGTGGGC 
GGGTAATATA GTGAGATCCT GTGTCTCTAT 

40 ACACATACCT GTAGTCCCAG CTACTCAGGA 
AAGGTAAGGC TGCAGTGAGC TGTGACTGTG 
AGAACCTGTC AAAAAAAAAA AAAAAACAAC 
CCATCTAATT GCTAAAGATT TTCTTTCATA 
AGTTTTCAGT CAAAGCTTGA CATTTAGAGA 

45 AATCAACTGT GTATGAACTA TAACTCTGCA 
AATGAACTTT TAGTCCTGTA ATAAATGAAA 
ATAGTTATAT CTTGCTAAGG 6GCCACTCAT 
CAGTATGTTT CCAGGTGT6T GCACTCAACA 
TAATGGGCCA CAGGAAGTAA GTTGATCTT6 

50 ACTGGATAGA GACT6TTGTT AGTGTCTGGG 
CTAATTACTG 7VATAAAACAA TCTAGAACAA 



CGACGCCTTC CC^VGAGGAAT ACGTGCCCAC 600 

TGTGGGAG6C AAGCAACACT TGCTCGGACT 660 

CCAGCTGAGG CCACTCTCCT ACCCCAACAC 720 

AAACCCT6CC TCTTACCACA ATGTCCA6GA 780 

GCCTCACGTG CCTTATGTCC TCATAGGGAC 840 

CTTGGCCCGT TTGCTGTATA TGAAAGAGAA 900 

CGCAAAAGCG ATCG6AGCAC AGTGCTACTT 960 

CAAAGCGGTT TTT6AT6AAG CAATCCTCAC 1020 

CTGTTCTGA6 GGTCACAGCT GCTGTTCAAT 1080 

CCCATCCAGG GATGAGAATG GCAGCCAATC 114 0 

GCACGACCAG AAAGGAACTC CCTTTGCACG 1200 

CCAACACAGC ACACTAGTCA GCCCACTGCC 1260 

ACTGCACGCT GTCTGAGAAT GCTGGGCCTG 1320 

CAAAAACAAA GTCAAAGGCC ATCTCACATT 13 80 

TGATAGGAAA TCACCCCAGG GAACCCGAAA 144 0 

ACTTGATGTC TTTTATAAAA CTTGGGACTA 1500 

TTCTACCCAT GTGTCTCACA TTCATTTGTA 1560 

CACTCAGGCC TTACTAATCC ATACCAAATT 1620 

TCTATTTTCA GCATGTTTCT ACCAAAGCTA 1680 

ATTATAAGAA GACATGAGAA GACTTTAAAA 1740 

GAACCTAATT GAAAGAAAAC CATCTGAATT 1800 

ACTCTATAGA TGCTTAGTAG TGGCCTGATT 1860 

AATGAACTGT GTTTAAGAAT GTAGTATTTC 1920 

ATATGGCAGG ATCCATCTTT TACAGTATTT 1980 

TCCCTTCCCC CATTGCATGC CCTCTTCCTC 2040 

CCTTTTATTC TCCTCCAGTTA TAACCCAGTT 2100 

TGTTTAAAAA TAACTAAAAG TAAAGGAAAG 2160 

AAGTGCTTCT CAGCTCGCCC CATGTAAGTT 2220 

ACAACTGGTA CTGAGATTTT GCCTCTCTCT 2280 

GGGA6CCATA TCAGTGGATA CCAAGCTCTG 2340 

GTGACATAGA ACAGGGACTT TGGCCCTGGG 2400 

GCCCAATGAT GTAAAACAAT TCCAAACATC 2460 

TTCCTATCTT TTCATTAGAG GCTATGAGGA 2520 

CCCTGGAATC ACCCAGGTAG GCACTTAATT 2580 

TTAGTAGTGT GAAATTACAA CAACTTTAAG 2640 

AAAAGCCAGC CTTTCTAATC TCTTCTGCTA 2700 

GCCAGGCACC GTGGCTCATG CCTGTAATCC 2760 

CGCTTGAGCT CAGGAGTTCA AGACCAGCCT 2820 

AAAAAAATTA T^TkAATTAGTC AGTTGTAGTG 2880 

GGCTGAGGTG GAAGGATCAC TTGAGCCCAG 2940 

CCACTACACT CCAGCCTGAG TGACAGAGAA 3 000 

CTACATTTCA AGTACTATTT CCCTTCTCTC 3060 

CGCACACACT CCAGTGACTG GAAAAACGGG 3120 

AAACAAGGAC TTTCTGCCTT TATAAATG6A 3180 

GAGGTTATGA ATTCATCCTT TACAAACAAT 3240 

TGTTATTAGG CAGCTTTGTT GCATGATTGC 3300 

TTCTCACTGA TGTG6ATGAA AAAATG2U3AG 3360 

6GCAAATAGC TCCCGAGGTC ACCACTTCCC 3420 

AT6GGGAGAT CACGTCACCC AGAACCAGCA 3480 

TA6AGCACAG 6CTCCCAGGG GTCTTAAGAG 3540 
AGCAA 



Seq ID No: 95 Protein sequence ; 
Protein Accession #: CAC06611.1 

55 



1 11 21 31 41 51 

I I I I I I 

MNCKEGTDSS CGCRGNDEKK MLKCWVGDG AVGKTCIiLMS Y7UMDAFPEEY VPTVPDHYAV 
60 TVTVGGKQHL LGLYDTAGQE DYNQLRPLSY PNTDVFLICP SWNPASYHN VQEEWVPELK 
DCMPHVPYVL IGTQIDIiRDD PKTLARLLYM KEKPLTYEHG VKIAKAIGAQ CYIiECSALTQ 
KGI.KAVFDEA ILTIPHPKKK KKRCSBGHSC CSII 



60 
120 
180 



65 .Seq ID NO: 96 DNA sequence 

Nucleic Acid Accession #: NM_003654.1 

Coding sequence: 367-1602 (underlined sequences correspond to start and stop codons) 



70 



75 



1 
I 

GGGGAGGGCG 
GTCCCCX3GCG 
CCCGGOGCGT 
TCCCCAGCTG 
GGCTGCC6CA 
CCAGCTTGGA 



11 
I 

CGGGAGGCG6 
ACCCTACTCC 
CCCCGACCAG 
CATTCCCGGA 
CT6GCTGGGA 
6CAGTCCCTC 



21 
I 

AGGATGCCGC 
AGACCCX3AGG 
GTAGCTGGTG 
6GC6CCCTTT 
CT6CCA6CTG 
TTTGACCTCA 



31 
1 

OGCGGCTGCT 
ATGGAGCCGG 
TCACTTCGGT 
CGACCTGGAG 
GGCCTGGAGA 
CCCCTTGGA6 



41 
I 

GCCGCCGCCX3 
CGCTG6GC6C 
6TGGTTGGAA 
GCCGGGTCTG 
CGCTGGT6GC 
AAGCAGCCCC 



51 
I 

CCACCCGCGG 
TGCAGCTGCT 
GAAGACTTTC 
CTGGCCACAG 
TGTGGACTCC 
ATGAAGGTGC 



60 
120 
180 
240 
300 
360 
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10 



15 



20 



25 



30 



35 



CCAGCC ATG C 
CAGTACACGG 
GAGGCCGGGC 
CGCAAGACCC 
CTCTTCAACC 
ACGCTCATCC 
GCCAGCCGCG 
ATCAAGCCGC 
GTCCTCTGCT 
GGGGACTGTG 
GAGC6CAGCC 
CTGGTGGAAG 
ATTCTGGCTT 
GGCACCGGGA 
TTCTCCAACT 
TTGGTGCGCT 
TTCCTGGGCA 
GACCCCACCC 
AAGTGGCGCT 
GTGCTGGCCC 
GTCAGCCTGG 
GGGCGGGAGG 
T7VACCCCTCC 
CTTCTGCCCC 
AGAGGGCGCC 
TCTCTGTGCG 
CGGGCACTCG 
TTTGCACTGT 
TTTATGAATG 
CCXTTTAGAGC 
TTACTGTGAG 
GTCT6AGTCT 
TTGATCTCGG 
CAGTTTGCTA 
A6T6CAATAA 



AATGTTCCTG 
CCATCCGCAC 
TGGCCGAGCG 
ACATCCTCAT 
AGCACCTGGA 
CCCGCTTCAC 
ACCTCCTGCG 
CGCCGGTCAA 
CCCGGCCTGT 
TGCGCAAGTG 
ACGTGGCCAT 
ACCCGCXSATT 
CGCGCAGCGA 
GGAAACCCTA 
CCGTGTCCAC 
ACGAGGACCT 
TCCCGCTGGA 
TGGGCAAGCA 
TCCGCCTCTC 
AGCTGGGCTA 
TGGAGGAGCG 
CGCAAGGTGT 
CTCTCCCACC 
TTTTTTGTCT 
TGAAGTAGGG 
GACGGTGACA 
C6A66CGACT 
CTTACTATTC 
GTGTCCATCC 
A6CGAAACTG 
GTGAACGTG6 
CGTGGCCGCC 
GGTCCATCTG 
AATGAACATT 
TCACC 



GAAGGCCGTC 
CTTCACCGCC 
ACTGTGCGAG 
CCTGGCCACC 
CGTCTTCTAC 
CCAGGGCAAG 
GAGCCTCTAC 
CCACACCACC 
GTGCGACCCT 
CGGGCTACTC 
CAA6ACX3GTG 
AAACCTCAAG 
GACCTTCCGC 
CT^CCTGGAC 
CGGCCTCATG 
GGCTCGGAAC 
CAGCCACGTG 
CT^TACGGC 
CTACGACATC 
CAAGATCGCC 
GGACTTCCGC 
CGGTTTTGAT 
TCATCTTCGT 
CTGAAATTTG 
TCCCGCCCCC 
ATGTTTACAA 
TCTCAAGCTT 
AAGGTAAGAG 
TTTCCCCATC 
CCCCCTCCTG 
ACCTGTTTCT 
CCTGGACCAG 
TGATATTTCT 
GAAATTGAAA 



CTCCTCCTTG 

AAGTCCTTTC 

GAGAGCCCCA 

ACGCGCAGCG 

CTGTTTGAGC 

AGCXCXSGCCG 

GACTGCGACC 

GACAGGATCT 

CCGG6GCCAG 

AACCT6ACCG 

CGCGTGCCCG 

6TCATCCAGC 

GACAC6TACC 

GTGACGCA6C 

CXSGCCCCCGT 

CCTATGAAGA 

GCCCGCTGGA 

ACCGTGCGAA 

GTGGCCTTTG 

GCCTCGGAGG 

CCCTTCTCGT, 

AAAATGGACC 

GTCCTTCCTG 

CACTACGTCT 

CCCACCCCAT 

GCACCACATT 

TTGAATGGGT 

GATACAAACA 

CCTGCCTCCT 

CCCGCCCTTG 

GTTTCCAGTC 

TGATGACTGA 

TTGTGCCAAA 

TGCTTTATCT 



CCCTGGCCTC 
ACACCTGCCC 
CCTTCGCCTA 
GCTCCTCCTT 
CCCTCTACCA 
ACCGGCGGGT 
TCTACTTCCT 
TCC6CCGCGG 
CCGACCTGGT 
TGGCG6CCGA 
AGGT6AACGA 
TGGTCCX3AGA 
GGCTCTGGCG 
TGACCACGGT 
G6CTCAAGGG 
AGACCGAGGA 
TCCAGAACAA 
ACTCGGCGGC 
CCCAGAACGC 
AGGAGCTGAA 
GACCCGGGCG 
GTTTTTAACT 
CCCCCAGCTC 
TGGACGGGAA 
TCAGACACAT 
TACACATCCA 
GAGTGGTCGG 
AGAGGACCAC 
GCCCCT6ACG 
CCTGTCGGTG 
TGTG6TGATG 
TGAATCTTAT 
AAGAAAAAAA 
6TGTTTTCTG 



CATTGCCATC 
CGGGCTGGCA 
CAACCTCTCC 
CGTGGGCCAG 
CGTCCAGAAC 
CATGCTAGGC 
GGAGAACTAC 
GGCCAGCCG6 
CCTGGAGGAG 
GGCGTGCCGC 
CCTGCGCGCC 
CCCCCGCG6C 
GCTCTGGTAC 
GTGCGAGGAC 
CAAGTACATG 
GATCTACGGG 
CACGCGGGGC 
CACGGCCGAG 
CTGCCAGCAG 
GAACCCCTCG 
GTGCGGGTGG 
GTTGCCTTAT 
ACCCCACTCC 
TCACTGGGGC 
GGATGTTGGG 
CACACGCACA 
GTATCTAGTT 
TTGTCTCTAA 
CCCATTTCCC 
AGGCAGGTTT 
CTGTCTGTCT 
GAGCTTCTGA 
AA6AGTGGAT 
TAAATAAAAG 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



Seq ID No: 97 Protein sequence; 
Protein Accession NP 003645.1 



40 



45 



50 



1 

I 

MQCSWKAVLI, 
THILILATTR 
RDLLRSLYDC 
CVRKCGIiLNL 
ASRSETFRDT 
RYEDLARNPM 
RFRLSYDIVA 



11 

I 

LALASIAIQY 
SGSSFVGQLF 
DLYFLENYIK 
TVAAEACRER 
YRLWRliWYGT 
KKTEEIYGFI, 
FAQNACQQVIi 



21 
I 

TAIRTFTAKS 
NQHIiDVFYIiF 
PPPVNHTTDR 
SHVAIKTVRV 
GRKPYNIiDVT 
GIPIiDSHVAR 
AQI3GYKIAAS 



31 
I 

FHTCPGIiAEA 
EPLYHVQNTL 
IFRRGASRVli 
PEVNDLRAIjV 
QLTTVCEDFS 
WXQNNTRGDP 
BBELKNPSVS 



41 

I 

GIiAERLCEES 
IPRFTQGKSP 
CSRPVCDPPG 
EDPRIiNLKVI 
NSVSTGLMRP 
TLGKHKY6TV 
IiVEERDFRPF 



51 
I 

PTFAYNLSRK 
ADRRVMLGAS 
PADliVLEEGD 
QLVRDPRGIIi 
PWLKGKYMLV 
RNSAATAEKW 
S 



60 
120 
180 
240 
300 
360 



55 



60 



65 



70 



75 



Seq ID NO: 98 DNA sequence 

Nucleic Acid Accession #: NM_002852.1 

Coding sequence: 68-1213 (underlined sequences correspond to start and stop codons) 



1 
I 

CTCAAACTCA 
TCCAGCAATG 
GAACTCGGAT 
CCATCCCACT 
GCTCTTCATC 
CGACGTCCTG 
CCTGGCGAGG 
CGAGCTGCTG 
GGCGCAGCGC 
GACGCGAGCC 
TTGTGAAACA 
AGTGAGACCA 
ATTAAACAAA 
GTATCTCAGC 
TGAAGCCATG 
AGGGCTCACA 
AG6TCACATT 
T6TGGGTGGT 
CTGG6ATAGT 



11 
I 

GCTCACTTGA 
CATCTCCTTG 
GATTATGATC 
GAGGACCCCA 
ATGCTGGAGA 
CGGGGCGAGC 
CCGTGCGCGC 
CAGGCGACCC 
CCAGAGGAGG 
GACCTGCACG 
GCTATTTTAT 
ATGAGGCTTG 
ACCATCCTGT 
TACCAATCCA 
GTTTCCCTGG 
TCCTTGTGGG 
GTTCCTGAGG 
GGCTTTGATG 
GTTCTTAGCA 



21 

I 

GAGTCTCCTC 
CGATTCTGTT 
TCATGTATGT 
CGCCGTGCGA 
ACTCGCAGAT 
TGCAGAGGCT 
CGGGGGCTCC 
GCGACGCGGG 
CGGGGCGCGC 
CGGTGCAGGG 
TCCCAATGCG 
AGTCTTTTA6 
TTTCCTATGG 
TAGTGTTTGT 
GAAGGT6GAC 
TAAATGGTGA 
GAG6AATCCT 
AAACATTA6C 
ATGAAGAGAT 



31 
I 

CCGCCAGCTG 
TTGTGCTCTC 
GAATTTGGAC 
CTGCGGTCAG 
GA6AGAGCGC 
GCGGGAGGAG 
CGCAGAGGCC 
CCGCAGGCTG 
CCTGGCCGCG 
CTGGGCTGCC 
TTCCAAGAAG 
TGCCTGCATT 
CACAAAGAG6 
G6TGG6TGGA 
CCACCTGTGC 
ACTGGCX3GCT 
GCAGATTGGC 
CTTCTCTGGG 
AAGAGAGACC 



41 

I 

TGGAAAGAAC 
TGGTCTGCAG 
AACGAAATAG 
GAGCACTCGG 
ATGCTGCTGC 
CTGGGCCGGC 
AGGCTGACCA 
GCGCGTATGG 
GTGCTAGAGG 
CGGAGCTGGC 
ATTTTTGGAA 
TGGGTCAAAG 
AATCCATAT6 
6AGGAGAACA 
66CACCT6GA 
ACCACTGTTG 
CAAGAAAAGA 
AGACTCACAG 
GGAGGA6CAG 



51 
I 

TTTGCGTCTC 
TGTTGGCCGA 
ACAATGGACT 
AATGGGACAA 
AAGCCACGGA 
TCGCGGAAAG 
GTGCTCTGGA 
AGGGCGCGGA 
AGCTGCGGCA 
TGCCGGCAGG 
GCGTGCATCC 
CCACAGATGT 
AAATCCAGCT 
AACTGGTTGC 
ATTCAGAGGA 
AGATQGCCAC 
ATGGCTGCTG 
GCTTCAATAT 
AGTCTTGTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



226 
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CATCCGGGGG 7VATATTGTTG GGTGGGGAGT 
GTATGTTTCA TAAA TGTTGT GAAACTCCAC 
AACACATGCC AGTT6GGAA6 GTCTGAAAAC 
TGAAAGA6AG AGTTGAGACC AATCTTTATT 
5 AAGGAAAGAC ATTG6AAAAA QCTTTTGAGG 
TTTCAGTTTA ATGCTGTGTC TCTGTCAGAT 
TGTTGAACAG AG6GACAATT GTTTTACTTT 
G/iATTTTACA TTGGAAGAAT AACAAAATAA 
TATGTACCTT ATTACAAAAA AAAT6ATGAA 
10 ACTATAAATG TAGTTTATGT GTTATAATCG 
AA6TTATATT GCAAAAGGGA TTTGTATTAA 
AAATAAAATA TTTTATAAAA CTAAAAAAAA 



CACAGAGATC CAGCCACATG GAGGAGCTCA 1200 

TTGAAGCCAA AGAAAGAAAC TCACACTTAA 1260 

TCAGTGCATA ATAGGAACAC TTGAGACTAA 1320 

TGTACTGGCC AAATACTGAA TAAACAGTTG 1380 

ATAATGTTAC TAGACTTTAT GCCATGGTGC 1440 

AAACTCTCAA ATAATTAAAA AGGACTGTAT 1500 

TCTTTG6TTA ATTTTGTTTT GGCCAGAGAT 1S60 

GATTTGTTGT CCATTGTTCA TTGTTATTGG 1620 

AACATATTTA TACTACAAG6 TGACTTAACA 1680 

AATGTCACGT TTTTGAGAAG ATAGTCATAT 1740 

TTTAAGACTA TTTTTGTAAA GCTCTACTGT 1800 
AAAAAAA 



15 



Seq ID No: 99 Protein sequence: 
Protein Accession #: NP 002843.1 



20 



25 



MHLLAILFCA 
XMIiEMSQMRE 
LQATRDAGRR 
TAIIiFPMRSK 
SYQSIVFWG 
IVPEGGILQI 
GNIVGWGVTE 



11 
I 

LWSAVIiAENS 
RMLLQATDDV 
LARMEGAEAQ 
KIFGSVHPVR 
GEENKLVAEA 
GQEKNGCCVG 
IQPHGGAQYV 



21 

I 

DDYDIiMYVNL 
LRGELQRLRE 
RPEEAGRAIiA 
PMRIjESFSAC 
MVSLGRWTHI* 
GGFDBTLAFS 
S 



31 

I 

DNEIDNGIiHP 
ELGRIiAESIiA 
AVLEEIjRQTR 
IWVKATDVLN 
CGTWNSEEGL 
GRIiTGFNIWD 



41 
I 

TEDPTPCDCG 
RPCAPGAPAE 
ADLHAVQGWA 
KTILFSYGTK 
TSLWVNGELA 
SVIiSNBEIRE 



51 

I 

QEHSEWDKIiF 
7UUiTSAIJ)EL 
ARSWIjPAGCE 
RNPYEIQLYL 
ATTVEMATGH 
TGGAESCHIR 



60 
120 
180 
240 
300 
360 



30 



Seq ID NO: 100 DNA sequence 

Nucleic Acid Accession #: NM_007351.1 

Coding sec[uence: 72-3758 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

35 1 I I 1 I I 

CTGCTATCAA AAAGGCCATA AGGATTTTGT CCCCAAATTT CACATGAGCT ACCTTGCTTC 60 

AAACTACTGA G ATGA AGGGG GCAAGATTAT TTGTCCTTCT TTCTAGTTTA TGGAGTGGGG 120 

GCATTGGGCT TAACAACAGT AAGCATTCTT GGACTATACC TGAGGATGGG AACTCTCAGA 180 

AGACTATGCC TTCTGCTTCA GTTCCTCCAA ATAAAATACA AAGTTTGCAA ATACTGCCAA 240 

40 CCACTCGGGT CATGTCX3GCG GAGATAGCTA CAACTCCAGA GGCAAGAACT TCTGAAGACA 30 0 

GTCTTCTTAA ATCAACACTG CCTCCCTCAG AAACAAGTGC ACCTGCTGAG GGTGTGAGAA 360 

ATCAAACTCT CACATCCACA GAGAAAGCAG AAGGAGTGGT CAAGTTACAG AATCTTACCC 42 0 

TCCCAACCAA CGCTAGCATC AAGTTCAATC CTGGAGCAGA ATCAGTGGTC CTTTCCAATT 4 80 

CTACACTGAA ATTTCTTCAG AGCTTTGCCA GAAAGTCAAA TGAACAAGCA ACTTCTCTAA 54 0 

45 ACACAGTTGG AGGCACTGGA GGCATTGGAG GCGTTGGAGG CACTGGAGGC GTGGGAAATC 600 

GAGCCCCACG GQAAACATAC CTCAGCCGGG GTGACAGCAG TTCCAGCCAA AGAACTGACT 660 

ACCAAAAATC AAATTTCGAA ACAACTAGAG GAAAGAATTG GTGTGCTTAT GTACATACCA 720 

G6TTATCTCC CACAGTGACA TTGGACAACC AG6TCACTTA T6TCCCAGGT GGGAAAGGAC 780 

CTTGT6GCTG GACCGGTG6A TCCTGTCCTC AGAGATCTCA GAAGATATCC AATCCTGTCT 840 

50 ATAGGATGCA ACATAAAATT GTCACCTCAT TGGATTGGAG GTGCTGTCCT GGATACAGTG 900 

GGCCGAAATG TCAACTAAGA GCCCAGGAAC AGCAAAGTTT GATACACACC AACCAG6CTG 960 

AAAGTCATAC A6CTGTTGGC AGAGGAGTAG CTGAGCA6CA GCA6CAGCAA GGCT6T6GTG 1020 

ACCCA6AAGT 6ATGCAAAAA ATGACTGATC AGGTGAACTA CCAG6CAAT6 AAACTGACTC 1080 

TTCTGCA6AA 6AA6ATTGAC AATATTTCTT TGACT6T6AA TGATGTAA6G AACACTTACT 1140 

55 CCTCCCTAGA A6GAAAAGTC AGCXSAAGATA AAAGCAGAGA ATTTCAATCT CTTCTAAAAG 1200 

GTCTAAAATC CAAAAGCATT AATGTACTGA TAAGAGACAT AGTAAGAGAA CAATTTAAAA 1260 

TTTTTCAAAA TGACATGCAA GAGACTGTAG CACAGCTCTT CAAGACTGTA TCAAGTCTAT 1320 

CAGAGGACCT CGAAAGCACC AGGCAAATAA TTCAAAAAGT TAATGAATCT GTGGTTTCAA 1380 

TAGCAGCCCA GCAAAAGTTT GTTTTGGTGC AAGAGAATCG GCCCACTTTG ACTGATATAG 144 0 

60 TGGAACTAAG GAATCACATT GTGAATGTAA GGCAAGAAAT GACTCTTACA TGTGAGAAGC 1500 

CTATTAAAGA ACTAGAAGTA AAGCAGACTC ATTTAGAAGG TGCTCTAGAA CAGGAACACT 1560 

CAAGAAGCAT TCTGTATTAT GAATCCCTCA ATAAAACTCT TTCTAAATTG AAGGAAGTAC 1620 

ATGAGCAGCT TTTATCAACT GAACAGGTAT CAGACCAGAA GAATGCTCCA GCTGCTGAGT 1680 

CAGTTAGCAA TAATGTCACT GAGTACATGT CTACTTTACA TGAAAATATA AAGAAGCAGA 174 0 

65 GTTTGATGAT GCTGCAAATG TTTGAAGATT TGCACATTCA AGAAAGCAAG ATTAACAATC 1800 

TCACCGTCTC TTTGGAGATG GAGAAAGAGT CTCTCAGAGG TGAATGTGAA GACATGTTAT 1860 

CCAAATGCAG AAATGATTTT AAATTTCAAC TTAAGGACAC AGAAGAGAAT TTACATGTGT 1920 

TAAATCAAAC ATTGGCTGAA GTTCTCTTTC CAATGGACAA TAAGATGGAC AAAATGAGTG 1980 

AGCAACTAAA TGATTT6ACT TATGATATGG A6ATCCTTCA ACCCTTGCTT 6A6CAGGGAG 2040 

70 CATCACTCAG ACAGACAATG ACATATGAAC AACCAAA6GA A6CAATAGTG ATAAG6AAAA 2100 

AGATAGAAAA TCTGACTA6T 6CT6TCAATA GTCTAAATTT TATTATCAAA GAACTTACTU^ 2160 

AAAGACACAA CTTACTTAGA AATGAAGTAC AGGGTC6TGA TGAT6CCTTA GAAAGACGTA 2220 

TCAATGAATA TGCCTTAGAA ATGGAAGAT6 6CCTCAATAA GACAATGACT ATTATAAATA 2280 

ATGCTATTGA TTTCATTCAA GATAACTATG CCCTAAAAGA GACTTTAAGT ACTATTAAG6 2340 

75 ATAATAGTGA GATCCATCAT AAATGTACCT CCGATATGGA AACTATTTTG ACATTTATTC 2400 

CTCAGTTCCA CCGTCTGAAT GATTCTATTC AGACTTT6GT CAAT6ACAAT CAGAGATATA 2460 



227 
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10 



15 



20 



25 



30 



ACTTTGTTTT 
AGTCCAACTT 
ACCAGC7VAAA 
ATTTTGAGAC 
ATATTTCAGT 
AAGTATTAAA 
TCTTTTCGCT 
GTGTGTCAGA 
AACTTCTTCA 
CTGCCCTATC 
TTGTCAAGTC 
CAACGGTAAA 
TATATCCTGA 
TAAATGGAAG 
CTATCAAGCT 
ATGCACCCAT 
TCCTGTTTAA 
TTAGAATTCC 
ATATTTCTGG 
TTAACAGTGA 
ATGGGCAGGA 
TTACTACATT 
CACCTTTATT 
TTGGTTTTTC 
TCTATTTTAT 
CTAAAG7UU\T 
TTATTTCTTC 
ATATTATCAG 
TAAATATATA 



GCAAGTCGCC 
CCAAAAGATG 
TATGAGTCAT 
TCGGTTGCAA 
TAAAAAAG6C 
TTCCAGATTT 
TAACAAAACT 
ACTGAATGCT 
GAAAGGTCTA 
TAATTCAACT 
TCAGAAGCAA 
TCTTACCACA 
GGAGTATTCA 
T^CTAGCTTT 
TGTGGAAGAA 
GGTGGCATTT 
TAACTTGGAT 
GTATCTTGGA 
ATTTTTAGTG 
MTACACTGT 
AGTCTGGTTA 
TAGTGGCTAT 
GAGAAACAGC 
TACAGGAT^T 
AAAATTATTT 
TTAGTGGCAC 
ATTTTAAGTC 
TCACAGTTTT 
ACACACATTT 



AAGACCCTTG 
TATCAAATGT 
TTGGAAGAAA 
GACATTGAGT 
AGTGTAGTTA 
AA6GCGTT66 
CTCCAC6AAG 
ACCATCCCTA 
ACAGAATTTG 
TGTTGTATAG 
GTAAAATCAT 
GTCCTGATAG 
AGCTGTAGTC 
ACCTGTGCCT 
AATGCTTTAG 
TTTGCATCTC 
GTCAATTATG 
GTATATGTTT 
GTTGATGGAA 
GATAGGGTTT 
CGACTTGCAA 
TTATTATATC 
CAGTGTTTTC 
GAAAATCAAC 
GAATATTGTT 
AGAAAACAAA 
ATTGCAAT6G 
CTTTCCAATT 
TCTAGATTCA 



CAGGTATTCC 
TCAATGAAAC 
AACTACTCTT 
CTAAA6TTAC 
CAAAT6A6AG 
AA6CA7VAATC 
TTTTAACAAT 
A6TG6ATAAA 
TGGAACCAAT 
ATCGATC6TT 
TGCCAAA6AA 
GCCGGACTCA 
GGCATCCGTG 
GCAGACATCC 
CTCCAGATTT 
ATACGTATGG 
GAGCTTCATA 
TCAAGTACAC 
TAGACAAGCT 
TAACTGGGGA 
AAGGAACAAT 
GTACATAAGT 
ATTTATCTTT 
TTGTTTTTTT 
TAATGTCTGA 
GTGAATTTGT 
AAA6TAATAT 
AAACACTTAA 
CAAATTTAAA 



CAGAGATGAG 
CACTTCCCAA 
AACTACCAAG 
CCAGACGCTC 
AGATCAGGCT 
TATCCATCTT 
GTGTCACAAT 
ACATTCCCT6 
AATTCAAATA 
GCCTGGTAGT 
AATTAACGCA 
AAGAAACACG 
CCAAT^TGGG 
TTTTACTGGT 
TTCCAAAGGA 
AATGACTATA 
TACCCCAAGA 
CATCGAGTCA 
TGCATTTGAG 
TGCCTTATTA 
TCCAGCCAAG 
TAGTATGAAA 
GCTTGCACAT 
AATATGAGTA 
ATATGAAAGA 
TAGCATAATT 
7ATAAAACG6 
CTTTTGTTAT 
TAAATTACTC 



AAACTAAATC 
GTGAGAAAAT 
ATTTCCAAAA 
ATACCTTATT 
CTTCAACTGC 
TCAATTAACT 
GCTTCTACAA 
CCAGATATTC 
AAAACTCAA6 
CT6GCAAAT6 
CTTAAGATUIC 
GACAACATAA 
6GCAC6TGCA 
GACAACTGCA 
TCTTACAGAT 
CCTGGTCCTA 
ACTGGAAAAT 
TTTAGTGCTC 
TCTGAAAATA 
GAATTAT^TT 
TTTCCCCCTG 
AACAGACTAT 
CTGCTCTGTT 
AACTTGTATG 
GTTCTTGATC 
ATTCCTATTC 
TAATTACAAC 
TCCCT6TATA 
AAAAAATG 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 



Seq ID No: 101 Protein sequence! 
Protein Accession #: NP 031377.1 



35 



40 



45 



50 



55 



1 

I 

MKGARLFVL,!, 
MSAEIATTPE 
ASIKFNPGAE 
ETYLSRGDSS 
TGGSCPQRSQ 
AVGRGVAEQQ 
GKVSEDKSRE 
ESTRQIIQKV 
LEVKQTHIjEG 
NVTEYMSTIiH 
NDFKFQIiKDT 
QTMTYEQPKE 
ALEMEDGIiNK 
RLNDSIQTIjV 
MSHI^EEKLLL 
SRFKALEAKS 
KGIjTEFVEPI 
LTTVblGRTQ 
VEENAIiAPDF 
YLGVYVFKYT 
VWLRLAKGTI 



11 

I 

SSLWSGGIGIi 
ARTSEDSLLK 
SWLSNSTLK 
SSQRTDYQKS 
KISNPVYRMQ 
QQQGCGDPEV 
FQSIiIiKGIiKS 
NESWSIAAQ 
ALEQEHSRSI 
ENIKKQSLMM 
EENLHVIiNQT 
AIVIRKKIEN 
TMTIINNAID 
NDNQRVMFVIi 
TTKISKNFET 
IHIiSINFFSL 
IQIKTQAAIjS 
RNTDNIIYPE 
SKGSYRYAPM 
lESFSAHISG 
PAKFPPVTTF 



21 

1 

NNSKHSWTIP 
STIiPPSETSA 
FLQSFARKSN 
NFETTRGKNW 
HKIVTSLDWR 
MQKMTDQVNY 
KSINVLIRDI 
QKFVLVQENR 
LYYESLNKTL 
LQMFEDIiHIQ 
LAEVLFPMDN 
LTSAVNSLNP 
FZQDNYALKE 
QVAKTLAGIP 
RIiQDIESKVT 
NKTLHEVLTM 
NSTCCIDRSL 
BYSSCSRHPC 
VAFFASHTYG 
FLWDGIDKL 
SGYtiLYRT 



31 
I 

EDGNSQKTHP 
PAEGVRNQTL 
EQATSIiNTVG 
CAYVHTRLSP 
CCPGYSGPKC 
QAMKLTLLQK 
VREQFKIFQN' 
PTLTDIVEIiR 
SKIiKEVHEQL 
ESKINNIiTVS 
KMDKMSEQIJI 
IIKEIiTKRHN 
TLSTIKDNSE 
RDEKIiNQSNF 
QTIflPYYISV 
CHNASTSVSE 
PGSIiANWKS 
QNG6TCINGR 
MTIPGPII^ 
AFESENZNSE 



41 
I 

SASVPPNKZQ 
TSTEKAEGW 
GTGGIGGVGG 
TVTIiDNQVTY 
QIiRAQEQQSI* 
KIDNISLTVN 
DMQETVAQLP 
NHIVNVRQEM 
LSTBQVSDQK 
liEMEKESXiRG 
DliTYDMEILQ 
LLRNEVQGRD 
IHHKCTSDME 
QKMYQMFNET 
KKGSWTNER 
UiATZPKWIK 
QKQVKSLPKK 
TSPTCACRHP 
NLDVNYGASY 
IHCDRVLTGD 



51 
I 

SI#QILPTTRV 
KLQNLTIiPTN 
TGGVGNRAPR 
VPGGKGPCGW 
IHTNQAESHT 
DVRNTYSSLB 
KTVSSLSEDL 
TliTCEKPIKE 
NAPAABSVSN 
ECEDMIiSKCR 
PIiIiEQGASIiR 
DALERRINEY 
TIIiTFIPQFH 
TSQVRKYQQN 
DQALQLQVLN 
HSLPDIQIjLQ 
INALKKPTVN 
FTGDNCTIKIj 
TPRTGKFRIP 
AIiIiELNYGQE 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



Seq ID NO: 102 DNA sequence 
Nucleic Acid Accession #: NM_000873.2 
60 Coding sequence: 57-884 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

03 ATCTCCCTCC AGGCAGCCCT TGGCTGGTCC CTGCGAGCCC GTGGAGACTG CCAGAG ATGT 60 

CCTCTTTCGG TTACAGGACC CTGACTGTGG CCCTCTTCAC CCTGATCTGC TGTCCAGGAT 120 

C6GATGAGAA GGTATTCGAG 6TACACGTGA GGCCAAAGAA GCTGGCGGTT GAGCCCAAAG 180 

GGTCCCTCGA GGTCAACTGC AGCACCACCT GTAACCAGCC TGAAGTG66T GGTCTGGA6A 240 

CCTCTCTAAA TAAGATTCTG CTG6ACGAAC A6GCTCAGTG GAAACATTAC TTGGTCTCAA 300 

70 ACATCTCCCA TGACAC6GTC CTCCAATGCC ACTTCACCTG CTCCGGGAAG CAG6AGTCAA 360 

TGAATTCCAA CGTCAGCGTG TACCA6CCTC CAAGGCAGGT CZATCCTGACA CTGCAACCCA 420 

CTTT6GT6GC TGTG66CAAG TCCTTCACCA TT6AGT6CAG GGTGCCCACC 6TGGAGCCCC 480 

TGGACAGCCT CACCCTCTTC CTGTTCCGTG GCAATGAGAC TCTGCACTAT GAGACCTTCG 540 

6GAAGGC71GC CCCTGCTCCQ CAGGA6GCCA CAGCCACATT CAACA6CAC6 GCTGACAGA6 600 

75 AGGATGGCCA CC6CAACTTC TCCTGCXrTGG CTGTGCTGGA CTT6ATGTCT CGCGGTGGCA 660 

ACATCTTTCA CAAACACTCA GCCCC6AAGA T6TT6GA6AT CTATGAGCCT 6TGTCG6ACA 720 
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GCCAGAT66T 
TCCTGCTCTG 
TGCGAGCGGC 
GGCCACCACC 
TGGCTGAAGG 
CAAACACCTG 



CATCATAGTC 
CTTCATCTTC 
TTGGAGGAGG 
ACG6T6GTCA 
ACTGTGACAG 
GACTT 



ACGGTGGTGT CGGTGTTGCT GTCCCTGTTC GTGACATCTG 780 

GGCCAGCACT TGCGCCAGCA GCX3GATGGGC ACCTACGGG6 840 

CTGCCCCAGG CCTTCCX3GCC ATAG CAACCA TGAGTGGCAT 900 

CTGGAACTCA GTGTGACTCC TCAGGGTTGA GGTCCAGCCC 960 

GCAGCAGAGA CTTGGGACAT TGCCTTTTCT AGCCCGAATA 1020 



10 



Seq ID No: 103 Protein aecatuence; 
Protein Accession U: NP 000864.1 



15 



20 



1 11 

I I 

MSSFGYRTLT VALFTLICCP 

ETSXiNKILLD EQAQWKHYIiV 

PTLVAVGKSF TIECRVPTVE 

REDGHRNFSC IiAVIiDIiMSRG 

SVIiLCFIFGQ HLRQQRMGTY 



21 31 41 51 

I I I i 

GSDEKVFEVH VRPKKLAVEP KGSLEVNCST TC^TQPEVGGIj 60 
SNISHDTVIiQ CHFTCSGKQE SMNSNVSVYQ PPRQVILTLQ 120 
PLDSLTIiFIiF RGNETI*HYET FGKAAPAPQE ATATFNSTAD 180 
GNIPHKHSAP KMLEIYEPVS DSQMVIIVTV VSVUjSLPVT 240 
6VRAAWRRLP QAFRP 



Seq ID NO: 104 DMA sequence 

Nucleic Acid Accession #: NM_001795.2 

Coding sequence: 121-2475 (underlined sequences correspond to start and stop codons) 



25 1 11 21 31 41 51 

I I i I I i 

GACGGTCGGC TGACAGGCTC CACAGT^CTC CACTCACGCT CAGGCCCTG6 ACGGACAG6C 60 

AGTCCAAOGG AACAGAAACA TCCCTCAGCC CCACAGGCAC GATCTGTTCC TCCTGGGAAG 120 

ATGC AGAGGC TCATGATGCT CCTCGCCACA TCGGGCGCCT GCCTGGGCCT GCTGGCAGTG 180 

30 GCAGCAGTGG CAGCAGCAGG TGCTAACCCT GCCCAACGGG ACACCCACAG CCTGCTGCCC 240 

ACCCACCGGC GCCAAAAGAG AGATTGGATT TGGAACCAGA TGCACATTGA TGAAGAGAAA 300 

AACACCTCAC TTCCCCATCA TGTAGGCAAG ATCAAGTCAA GCGTGAGTCG CAAGAATGCC 360 

AAGTACCTGC TCAAAGGAGA ATATGTGGGC AAGGTCTTCC GGGTCGATGC AGAGACAGGA 420 

GACGTGTTCG CCATTGAGAG GCTGGACCGG GAGAATATCT CAGAGTACCA CCTCACTGCT 4 80 

35 GTCATTGTGG ACAAGGACAC TGGTGAAAAC CTGGAGACTC CTTCCAGCTT CACCATCAAA 540 

GTTCATGACG TGAACGACAA CTGGCCTGTG TTCACGCATC GGTTGTTCAA TGCGTCCGTG 600 

CCTGAGTCGT CGGCTGTGGG GACCTCAGTC ATCTCTGTGA CAGCAGTGGA TGCAGACGAC 660 

CCCACTGTGG GAGACCACGC CTCTGTCATG TACCAAATCC TGAAGGGGAA AGAGTATTTT 720 

GCCATCGATA ATTCTGGACG TATTATCACA AT7VACGAAAA GCTTGGACCG AGAGAAGCAG 780 

40 GCCAGGTATG AGATCGTGGT GGAAGCGCGA GATGCCCAGG GCCTCCGGGG GGACTCGGGC 840 

ACGGCCACCG TGCTGGTCAC TCTGCAAGAC ATCAATGACA ACTTCCCCTT CTTCACCCAG 900 

ACCAAGTACA CATTTGTCGT GCCTGAAGAC ACCCOTGTGG GCACCTCTGT GGGCTCTCTG 960 

TTTGTTGAGG ACCCAGATGA GCCCC3VGAAC 066ATGACCA AGTACAGCAT CTTGCGGGGC 1020 

GACTACCAGG ACGCTTTCAC CATT6AGACA AACCCCGCCC ACAACGA66G CATCATCAAG 1080 

45 CCCATGMGC CTCTGGATTA TGAATACATC CAGCAATACA GCTTCATCGT CGAGGCCACA 1140 

6ACCCCACCA TCGACCTCCG ATACAT6A6C CCTCCCX3CG6 GAAACAGAGC CCAGGTCATT 1200 

ATCAACATCA CAGATGTGGA CGA6CCCCCC ATTTTCCAGC AGCCTTTCTA CCACTTCCAG 1260 

CTGAAGGAAA ACCAGAAGAA GCCTCTGATT GGCACAGTQC TGGCCATG6A CCCTGATGCG 1320 

GCTAGGCATA GCATTGGATA CTCCATCCGC AGGACCAGTG ACAAGGGCCA GTTCTTCCGA 1380 

50 GTCACAAAAA AGGGGGACAT TTACAATGAG AAAGAACTGG ACAGAGAAGT CTACCCCTGG 1440 

TATAACCTGA CTGTGGAGGC CAAAGAACTG GATTCCACTG GAACCCCCAC AGGAAAAGAA 1500 

TCCATTGTGC AAGTCCACAT TGAAGTTTTG GATGAGAATG ACAATGCCCC GGAGTTTGCC 1560 

AAGCCCTACC AGCCCAAAGT GTGTGAGAAC GCTGTCCATG 6CCAGCTGGT CCTGCAGATC 1620 

TCCGCAATAG ACAAGGACAT AACACCACGA AACGTGAAGT TCAAATTCAC CTTGAATACT 1680 

55 GAGAACAACT TTACCCTCAC GGATAATCAC GATAACACGG CCAACATCAC AGTCAAGTAT 174 0 

GGGCAGTTTG ACCGGGAGCA TACCAAGGTC CACTTCCTAC CCGTGGTCAT CTCAGACAAT 1800 

" GGGATGCCAA GTCGCACGGG CACCAGCACG CTGACCGTGG CCGTGTGCAA GTGCAACGAG 1860 

CAGGGCGAGT TCACCTTCTG CGAG6ATATG GCCGCCCAG6 TGGGCGTGAG CATCCAGGCA 1920 

GTGGTAGCCA TCTTACTCTG CATCCTCACC ATCACAGTGA TCACCCTGCT CATCTTCCTG 1980 

60 CGGCGGCGGC TCCGGAAGCA GGCCCGCGCG CACGGCAAGA GCGTGCCGGA GATCCACGAG 204 0 

CAGCTGGTCA CCTACGACGA GGAGGGCGGC GGCGAGATGG ACACCACCAG CTACGATGTG 2100 

TCGGTGCTCA ACTCGGTGCG CCGCGGCGGG GCCAAGCCCC CGCGGCCCGC GCTGGACGCC 2160 

C6GCCTTCCC TCTAT6CGCA GGTGCAGAAG CCACCGAGGC ACGCGCCTGG GGCACACGGA 2220 

GGGCCCGGG6 AGATGGCAGC CAT6ATCGAG GT6AAGAAG6 ACGAGGCGGA CCACX3ACGGC 2280 

65 GACGGCCCCC CCTACGACAC GCTGCACATC TACGGCTACG AGGGCTCX:6A GTCCATAGCX! 2340 

GAGTCCCTCA GCTCCCTGGG CACCGACTCA TCCGACTCTG ACGTGGATTA CGACTTCCTT 2400 

AAC6ACTGGG GACXrCAGGTT TAA6ATGCTG GCTGAGCTGT AC66CTCGGA CCCCCX3GGAG 2460 

GAGCTGCTGT ATJAgpCGGC CGAGGTCACT CTGGGCCTGG. GGACCCAAAC CCCCTGCAGC 2520 

CCAGGCCAGT CAGACGCCAG GCACCACAGC CTCCAAAAAT GGCAGTGACT CCCCAGCCCA 2580 

70 GCACCCCTTC CTCGTGGGTC CCAGAGACCT CATCAGCCTT GGGATAGCAA ACTCCAGGTT 2640 

CCTGAAATAT CCAGGAATAT ATGTCAGTGA TGACTATTCT CAAATGCTGG CAAATCCAGG 2700 

CTGGTGTTCT. 6TCTGGGCTC AGACATCCAC ATAACCCT6T CACCCACAGA CCGCCGTCTA 2760 

ACTCAAAGAC TTCCTCTGGC TCCCCAAGGC TGCAAAGCAA AACAGACTGT GTTTAACTGC 2 820 

TGCAGGGTCT TTTTCTAGGG TCCCTGAACG CCCTGGTAAG GCTGGTGAGG TCCTGGTGCC 2880 

75 TATCTGCCTG GAGGCAAAGG CCTGGACAGC TTGACTTGTG GGGCAG6ATT CTCTGCAGCC 2940 

CATTCCCAAG GGAGACTGAC CATCATGCCC TCTCTCGGGA GCCCTAGCCC T6CTCCAACT 3000 
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CCATACTCCA CTCCAAGTGC CCCACCACTC 
AGGAAGGGGC CCCATGGCAG CTCCTGACCT 
ATGCCAGTAA CTGTGCTGTA CTGAGCACTG 
TTTGAAGCAA CTGTGAATTC ATTCTGGAGG 
5 GGGTGAGGGC CACCTCCACA CCCACCCCCT 
GCTTTGAGAC TCCTCAGCAC CCCTCCAGTT 
CAGAAGACGT CTCCCCTTCT CTGCCTCACC 
TCTGTCTACT CCTTATCCCT TGGTTTAGAG 
GACAATGTCC AAACCCACTC ATGACTGCAT 

10 CGCTGTTGTC ACATCTCAGG GAACTGACCC 
GCCCTGCCCA ACCTCTGTGG TCACCCATGC 
CACCTTGGAG AAGTGGCATC AGTCAACAGA 
CCTTCX3TCAT GGACC6AGGT TCCCACTCTG 
AGATAACACT GACTTGTTTG TTTTAACCAA 

15 AATGATACTT AC/^GTTTCT AGCTCTCACA 
AGCAGGTTGT TATTTAGGTT 7UVCAATATTA 
CTGTAACCTT CTATTTTCTA TAATTGTAGT 
GCCAAACTGG TGCATGACAA GTACTGTATT 
GCCTGGGCAA CAAAAAAA 

20 

Seq ID No; 105 Protein geouence t 
Protein Accession #: NP__001786.1 



CCCAACCCCT CTCCAGGCCT GTCAAGAGGG 3060 

TGGGTCCTGA AGTGACCTCA CTGGCCTGCC 3120 

AACCACATTC AGGGAAATGG CTTATTAAAC 3180 

GGCAGTGGAG ATCAGGAGTG ACAGATCACA 3240 

CTGGAGAAGG CCTGGAAGAG CT6AGACCTT 3300 

TTGCCTGAGA AGGGGCAGAT GTTCCCGGAG 3360 

TGGTCGCCAA TCCATGCTCT CTTTCTTTTC 3420 

GAACCCAAGA TGT6GCCTTT AGCAAAACTG 3480 

GACGGAGCCG AGCCATGTGT CTTTACACCT 3540 

TCAGGCACAC CTT6CAGAA6 6CAAG6CCCT 3600 

ATCTTCCACT GGAACGTTTC ACTGCAAACA 3660 

GA6GG6CAGG GAA6GAGACA CCAAGCTCAC 3720 

GGCAAAGCCC CTCACACTGC AAGGGATTGT 3780 

TAACTAGCTT CTTATAATGA TTTTTTTACT 3840 

GACATATAGA ATAAGGGTTT TTGCATAATA 3900 

ATTCAGGTTT TTTAGTTGGA AAAACAATTC 3960 

AATTGCTCTA CAGATAATGT CTATATATTG 4020 

TTTTTATACC TAAATT^AAGA AAAATCTTTA 4080 



25 1 11 21 31 41 51 

I I I I I I 

MQRLMMIiLAT SGACUSIiLiAV AAVAAAGANP AQRDTHSLLP THRRQKRDWI WNQMHIDEEK 60 

NTSliPHHVGK IKSSVSRKNA KYLUCGEYVG KVFRVDAETG DVFAIERIiDR ENISEYHLTA 120 

VIVDKDTGEN LETPSSFTIK VHDVNDNWPV FTHRLFNASV PESSAVGTSV ISVTAVDADD 180 

30 PTVGDHASVM YQIIiKGKEYF AIDNSGRIIT ITKSLDREKQ ARYEIWEAR DAQGIiRGDSG 240 

TATVLVTLQD INDNFPFFTQ TKYTFWPED TRVGTSVGSIt FVEDPDBPQN RMTKYSILRG 300 

DYQDAFTIET NPAHNEGIIK PMKPLDYEYI QQYSFIVEAT DPTIDLRYNS PPAGNRAQVI 360 

INZTDVDEPP IFQQPFYHFQ LKENQKKPLI GTVLAMDPDA ARHSIGYSIR RTSDKGQFFR 420 

VTKKGDIYNE KELDREVYPH YNLTVEAKEIi DSTGTPTGKE SIVQVHIEVIj DENDMAPEFA 480 

35 KPYQPKVCEN AVHGQI.VI,QI SAIDKDITPR NVKFKFTLNT ENNFTLTDNH DNTANITVKY 540 

GQFDREHTKV HFIiPWZSDN GMPSRTGTST LTVAVCKCNE QGEFTFCBDM AAQVGVSIQA 600 

WAILLCILT ITVITIiIilFL RRRIjRKQARA HGKSVPEIHE QLVTYDBEGG GEMDTTSYDV 660 

SVIiNSVRRGG AKPPRPALDA RPSLYAQVQK PPRHAPGAHG GPGEMAAMIB VKKDBADHDG 720 

DGPPYDTLHI YGYE6SESIA ESLSSLGTDS SDSDVDYDFL HDWGPRFKMIj AELYGSDPRE 780 

40 EIiLY 



45 



Seq ID NO: 106 DNA fleoruence 

Nucleic Acid Accession # : none found 

Coding sequence: 1-474 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ACAGTACTCT GTGCAAAAAA CCTGGTGAAA AAGGATTTTT TCCGACTTCC TGATCCATTT 60 

GCTAAGGTGG TGGTTGATGG ATCTGGGCAA TGCCATTCTA CAGATACTGT 6AAGAATACG 120 

50 CTTGATCCAA AGTGGAATCA GCATTATGAC CTGTATATTG GAAAGTCTQA TTCAGTTACG 180 

ATCAGTGTAT GGAATCACAA GAAGATCCAT AA6AAACAA6 GT6CTG6ATT TCTCGGTTGT 240 

GTTCGTCTTC TTTCCAATGC CATCAACCGC CTCAAAGACA CTGGTTATCA 6AGGTTGGAT 300 

TTATGCAAAC TCGGGCCAAA TGACAATGAT ACA6TTAGA6 GACA6ATAGT AGTAAGTCTT 360 

CAGTCCAGAG ACCGAATAGG CACAGGAGGA CAAGTTGTGG ACTGCS^GTCG TTTATTTGAT 420 

55 AACGATTTAC CAGAC6GAGC TCATTATTTG TG6ACTTGGA AAGATA6ATG TTAAT GACTG 480 

6AAG6TAAAC ACCCGGTTAA AACACTGTAC ACCA6ACAGC AACATTGTCA AATGGTTCTG 540 

GA7VA6CTGTG GAGTTTTTTG ATGAAGAGC6 ACGAGCAAGA TTGCTTCA6T TTGTGACAGG 600 

ATCCTCTCGA GTGCCTCTGC AGGGCTTCAA AGCATTGCAA GGTGCTGCAG GCCCGAGACT 660 

CTTTACCATA CACCAGATTG ATGCCTGCAC TAACAACCTG CGGAAAGCCC ACACTTGCTT 720 

60 CAATCGAATA GACATTCCAC CCTATGAAAG CTATGAAAAG CTATATGAAA AGCTGCTAAC 780 

AGCCATTG7U\ GAAACATGTG GATTT6CT6T GGAATGACAA GCTTCAAGGA TTTACCCAGG 840 
AC 

Seq ID No: 107 Protein sequence; 
65 Protein Accession #: none found 



1 11 21 31 41 51 

«^ I I I I I I 

/U TVIjCAKNLVK KDPFRIjPDPF AKWVDGSGQ CHSTDTVKNT IiOPKMNQHYD IiYIGKSDSVT 60 
ISVWNHKKIH KKQGAGFLGC VRUiSNAINR IiKDTGYQRI«D LCKLGPNDND TVR6QIWSL 120 
QSRDRI6TGG QWDCSRLFD NDLPDGAHYL WTWKDRC 

Seq ID NO: 108 DNA sequence 
75 Nucleic Acid Accession #: NM_002318.1 

Coding sequence: 248-2572 (tinderlined sequences correspond to start and stop codons) 
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5 ACTCCAGCGC GCGGCTACCT ACGCTTGGTG CTT6CTTTCT CCAGCCATCG GAGACCAGAG 60 

CCGCCCCCTC TGCTCGAGAA AGGGGCTCAG CGGCGGCGGA AGCGGAGGGG GACCACCGTG 120 

GAGAGCGCGG TCCCAGCCCG GCCACTGCGG ATCCCTGAAA CCAAAAAGCT CCTGCTGCTT 180 

CTGTACCCCG CCTGTCCCTC CCAGCTGCGC AGG6CCCCTT CGTGGGATCA TCA6CCCGAA 240 

GACAGGG ATG GAGAGGCCTC TGTGCTCCCA CCTCTGCAGC TGCCTGGCTA TGCTGGCCCT 300 

10 CCTGTCCCCC CTGAGCCTGG CACAGTATGA CAGCTGGCCC CATTACCCCG AGTACTTCCA 360 

GCAACCGGCT CCTGAGTATC ACCAGCCCCA GGCCCCCGCC AACGTGGCCA AGATTCAGCT 420 

GCGCCTGGCT GGGCAGAAGA GGAA6CACAG CGAGGGCCGG GTGGAGGTGT ACTATGATGG 480 

CCAGTGGGGC ACCGTGTGCG AT6ACGACTT CTCCATCCAC GCTGCCCACG TCGTCTGCCG 540 

GGAGCTGGGC TATGTGGAGG CCAAGTCCTG GACTGCCAGC TCCTCCTACG GCAAGGGAGA 600 

15 AGGGCCCATC TGGTTAGACA ATCTCCACTG TACTGGC7AC GAGGCGACCC TTGCAGCATG 660 

CACCTCCAAT GGCTGGGGCG TCACTGACTG CAAGCACACG GAGGATGTCG GTGTGGTGTG 720 

CAGCGACAAA AGGATTCCTG GGTTCAAATT TGACAATTCG TTGATCAACC AGATAGAGAA 780 

CCTGAATATC CAGGTGGAGG ACATTCGGAT TCGAGCCATC CTCTCAACCT ACCGCAAGCG 840 

CACCCCAGTG . ATGGAGGGCT ACGTGGAGGT GAAGGAGGGC AAGACCTGGA AGCAGATCTG 900 

20 TGACAAGCAC TGGACGGCCA AGAATTCCCG CGTGGTCTGC GGCATGTTTG GCTTCCCTGG 960 

GGAGAGGACA TACAATACCA AAGTGTACAA AATGTTTGCC TCACGGAGGA AGCAGCGCTA 1020 

CTGGCCATTC TCCATGGACT GCACCGGCAC AGAGGCCCAC ATCTCCAGCT GCAAGCTGGG 1080 

CCCCCAGGTG TCACTGGACC CCATGAAGAA TGTCACCTGC GAGAATGGGC TGCCGGCCGT 1140 

GGTGAGTTGT GTGCCTGGGC AGGTCTTCAG CCCTGACGGA CCCTCGAGAT TCCGGAAAGC 1200 

25 ATACAAGCCA GAGCAACCCC TGGTGCGACT GAGAGGCGGT GCCTACATOG GGGAGGGCCG 1260 

CGTGGAGGTG CTCAAAAATG GAGAATGGGG GACCGTCTGC GACGACAAGT GGGACCTGGT 1320 

GTCGGCCAGT GTGGTCTGCA GAGAGCTGGG CTTTGGGAGT GCCAAAGAGG CAGTCACTGG 1380 

CTCCCGACTG GGGCAAGGGA TCGGACCCAT CCACCTCAAC GAGATCCAGT GCACAGGCAA 1440 

TGAGAAGTCC ATTATAGACT GCAAGTTCAA TGCCGAGTCT CAGGGCTGCA ACCACGAGGA 1500 

30 GGATGCTGGT GTGAGATGCA ACACCCCTGC CATGGGCTTG CAGAAGAAGC TGCGCCT6AA 1560 

CGGCGGCCX3C AATCCCTACG AGGGCOGAGT GGAG6TGCTG GTG6AGAGAA ACGGGTCCCT 1620 

TGTGTGGGGG ATGGTGTGTG GCCAAAACTG GGGCATCGTG 6A6GCCATGG TGGTCTGCCG 1680 

CCAGCTCG6C CTGG6ATTCG CCAGCAACGC CTTCCA6GAG ACCTGGTATT GGCACGGAGA 1740 

TGTCAACAGC AACAAAGTGG TCATGAGTGG AGTGAAGTGC TCGGGAACGG AGCTGTCCCT 1800 

35 GGCGCACTGC CGCCACGACG GGGAGGACGT GGCCTGCCCC CAGGGCGGAG TGCAGTACGG 1860 

GGCCX3GAGTT GCCTGCTCAG AAACCGCCCC TGACCTGGTC CTCAATGCGG AGATGGTGCA 1920 

GCAGACCACC TACCTGGAGG ACCGGCCCAT GTTCATGCTG CAGTGTGCCA TGGAGGAGAA 1980 

CTGCCTCTCG GCCTCAGCCG CGCAGACCGA CCCCACCACG GGCTACCGCC GGCTCCTGCG 2040 

CTTCTCCTCC CAGATCCACA ACAATGGCCA GTCCGACTTC CGGCCCAAGA ACGGCCGCCA 2100 

40 CGCGTGGATC TGGCACGACT GTCACAGGCA CTACCACAGC ATGGAGGTGT TCACCCACTA 2160 

TGACCTGCTG AACCTCAATG GCACCAAGGT GGCAGAGGGC CACAAGGCCA GCTTCTGCTT 2220 

GGAGGACACA GAATGTGAAG GAGACATCCA GAAGAATTAC GAGTGTGCCA ACTTCGGCGA 2280 

tCAGGGCATC ACCATGGGCT GCTGGGACAT GTACCGCCAT GACATCGACT GCCAGTGGGT 234 0 

TGACATCACT GACGTGCCCC CTGGAGACTA CCTGTTCCAG GTTGTTATTA ACCCCAACTT 2400 

45 CGAGGTTGCA GAATCCGATT ACTCCAACAA CATCATGAAA TGCAGGAGCC GCTATGACGG 2460 

CCACCGCATC TGGATGTACA ACTGCCACAT AGGTGGTTCC TTCAGCGAAG AGACGGAAAA 2520 

AAAGTTTGAG CACTTCAGCX3 GGCTCTTAAA CAACCAGCTG TCCCCGCAGJL^AAAGAAGCCT 2580 

GCGTGGTCAA CTCCTGTCTT CAG6CCACAC CACATCTTCC ATGGGACTTC CCCCCAACAA 2640 

CTGAGTCTGA ACGAATGCCA CGT6CCCTCA CCCAGCCCGG CCCCCACCCT GTCCAGACCC 2700 

50 CTACAGCTGT GTCTAAGCTC AGGAGGAAAG GGACCX!TCCC ATCATTCATG GGGGGCTGCT 2760 

ACCTGACCCT TGGGGCCTGA GAAGGCCTTG GGGGGGTGG6 GTTTGTCCAC AGAGCTGCTG 2820 

GAGCAGCACC AAGAGCCAGT CTTGACCGGG ATGAGGCCCA CAGACA6GTT OTCATCAGCT 2880 

TGTCCCATTC AAGCCACCGA GCTCACCACA GACACAGTG6 AGCCGCGCTC TTCTCCAGTG 2940 

ACACGTGGAC AAATGCGGGC TCATCAGCCC CCCCAGAGAG GGTCAGGCCG AACCCCATTT 3000 

55 CTCCTCCTCT TAGGTCATTT TCAGCAAACT TGAATATCTA GACCTCTCTT CCAATGAAAC 3060 

CCTCCAGTCT ATTATAGTCA CATAGATAAT GGTGCCACGT GTTTTCTGAT TTGGTGAGCT 3120 

CAGACTTGGT GCTTCCCTCT CCACAACCCC CACCCCTTGT TTTTCAAGAT ACTATTATTA 3180 

TATTTTCACA GACTTTTGAA GCACAAATTT ATTGGCATTT AATATTGGAC ATCTGGGCCC 3240 

TTGGAAGTAC AAATCTAAGG AAAAACCAAC CCACTGTGTA AGTGACTCAT CTTCCTGTTG 3300 

60 TTCCAATTCT GTGGGTTTTT GATTCAACGG TGCTATAACC AGGGTCCTGG GTGACAGGGC 3360 

GCTCACTGAG CACCATGTGT CATCACAGAC ACTTACACAT ACTTGAAACT TGGAATJUU^ 3420 
GAAAGATTTA TG 

Seq ID No; 109 Protein aeauence_L 
65 Protein Accession #: NP_002309.1 



1 11 21 31 41 51 

I I I 1 I ' 

70 MERPIiCSHLC SCIiAMTiALIiS PLSIiAQYDSW PHVPEYPQQP APEYHQPQAP ANVAKIQLRI. 60 

AGQKRKHSEG RVEVYYDGC2W GTVCDDDFSI HAAHWCREL GYVBAKSWTA SSSYGKGBGP 120 

IWIiDNIiHCTG NEATtiAACTS NGVK3VTDCKH TEDVGWCSD KRIP6PKPDN SLINQIENIiN 180 

IQVEDIRIRA ILSTYRKRTP VMBGYVEVKE GKTWKQICDK HWTAKNSRW C6MFGPPGER 240 

TYNTICVYKMF ASRRKQRYWP PSMDCTGTEA HISSCKLGPQ VSLDPMKNVT CENGLPAWS 300 

75 CVPGQVFSPD 6PSRFRKAYK PEQPLVRliRG 6AYIGEGRVB VLKNGBWGTV CDDKWDLVSA 360 

SWCRELGFG SAKBAVTGSR LGQGIGPIHL NEIQCTGNEK SIZDCKFNAE SQGCNHBEDA 420 
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GVRCNTPAMG LQKKLRIINGG RNPYEGRVEV LVERNGSLVW GMVCGQNWGI VEAMWCRQL 480 

GLGFASNAFQ ETWYWHGDVN SNKWMSGVK CSGTELSU^H CRHDGEDVAC PQGGVQYGAG 540 

VACSETAPDL VLNAEMVQQT TYIiEDRPMFM LQCAMEENCL SASAAQTDPT TGYRRLLRFS 600 

SQIHNNGQSD FRPKNGRHAW IWHDCHRHYH SMEVPTHYDL tiNLNGTKVAE GHKASFCLED 660 

TECEGDIQKN YECANFGDQG ITMGCWDMYR HDIDCQWVDI TDVPPGDYLF QWINPNFEV 720 
AESDYSNNIM KCRSRYDGHR IWMYKCHI6G SFSBETEKKF EHFSGLIiNNQ LSPQ 



10 



Seq ID NO: 110 DNA sequence 

Nucleic Acid Accession #: none found, CAT_73007_3 

Coding sequence: 1--495 (underlined sequences correspond to start and stop codons) 



15 



20 



25 



30 



CGGACGCGTG 
GCTGCTGCCT 
CTGGGTGATC 
ACCTCTAGCT 
ACCCAGGATA 
CCCAGGCGCA 
ACTGTTCCCC 
CCAGCCCCAC 
GCCCACTCTG 
CACTTTGCTC 
GTGGTGATCA 
TCTGAAGATC 
GCX3AGAACGA 
GCACTCAGCA 
GGGC 



11 
I 

GGTCGACCCA 
GCCCGCCACA 
CTGGGCTTCC 
CTCAGGGAGG 
CATCCCTTCC 
GGTGTCCCCA 
CCAATTCAAC 
GTCAGAGACT 
GC TAG GGGGC 
CTGGAGCCTG 
TCCTAGTTGG 
CCAGAACCTG 
CAGCATCGCC 
GC6A6CATCA 



21 
I 

CGCGTCCGCC 
TACCCAGCTG 
TCCTGTTCCG 
CCTTGGCGGT 
TCAGAGGCTA 
GCAGTGGAAG 
CACCATGAGC 
GTGCTCACTG 
GGCAGGCTGA 
TCTACCACAG 
TGTGGTCAGC 
GGAGTACAGG 
CATTATCACA 
AA6GAGCCTA 



31 
I 

CACGCGTCC6 
ACATGGGCAC 
AGGCCACAAC 
CTAAGTCTGA 
ACAGGCCAAG 
AGACGGAGGC 
CTGAGCATGA. 
TGGCTGCATT 
GAGCTCACCT 
TGTTATCAGC 
CTGAGGGTTC 
GCGTGTCTGA 
TGGAA6ACAT 
CGCATGGCCC 



41 
I 

TATG6ACAGA 
CGCA6GA6CC 
TCCCAGCCCA 
CCACAGAGCC 
CCATCTGTCC 
ACAAGCAGAG 
GGGAAGATGC 
TGGGATGGAG 
GTTCAGCAGA 
TTCATTGTCA 
AGTGTCGGAA 
CAAGCTGGTC 
CACACGACTT 
AGACTGA6A6 



51 
I 

GCCTCCACTG 
ATGCAGCTGT 
CAATGACCCA 
AGTTTCTTCC 
AGCACTGGTA 
ACACATTTCA 
GACCATCCTG 
TCGGGTGGAG 
GAAGTGGAAC 
TCCTGGTGGT 
GAGCAAGGAG 
ACAGACCATG 
AGGGCAACAC 
CAAGCACAAA 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



Seq ID NO: 111 Protein sequence; 

Protein Accession #: none found, CAT_73007_3 



35 



40 



45 



50 



55 



60 



65 



70 



75 
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I I I I I I 

RTRGSTHASA HASVWTEPPL AAACPPHTQL TWAPQEPCSC LGDPGIiPPVP RPQLPAHNDP 

TSSSQGGLGG LSLTTEPVSS TQDTSLPQRIi TGQAICPALV PRRRCPQQHK RRRHKQRHIS 

TVPPNSTTMS IiSMRBDATIIj PAPRQRIiCSXi WIiHLGWSRVE AHS6 



60 
120 



Seq ID NO: 112 DNA sequence 

Nucleic Acid Accession #; NM_005424.1 

Coding sequence: 37-3453 (underlined sequences correspond to start and stop codons) 



CGCTCGTCCT 
TTGCTCCCCA 
GCCAACCTGC 
GGGGC6GGGA 
ATCGTGCGCA 
ACGCTTCGCG 
GCTGGG6CGC 
CCAGACAAGG 
CACAAGGAGA 
GACTGGCATG 
TCGAGCGGCA 
CGGCTCATCG 
CCAGGTTGCC 
GGCTTCACTG 
CAGGAGCAGT 
TATGGCTGCT 
GGTCATTTTG 
CGGTTCAGTG 
CGGATCCCCC 
CGGATCAACT 
AAGCCA6AC6 
GCTGAGTTCG 
TCCACATCTG 
CCCCT6GCTG 
GTCTCGTTCT 
A6TACCATGG 
CT6AGGCCAA 
GAG6GG6CCT 
CCGTGGTTGG 
CCCTTGGTGC 



11 
I 

GGCTGGCCTG 
TCCTCTTCTT 
66CTCACGGA 
GGGGCTC66A 
CCCCGCCCGG 
6CTTCTCCAA 
GGCGCACGCG 
TCACACACAC 
AGCAGACAGA 
AAGCCCAGGA 
TCTACAGTGC 
TGCGGGGTTG 
TACATGGAGG 
GCACCCGCTG 
GCCCAGGCAT 
CTTGTGGATC 
GGGCTGATTG 
GTTGTGTCTG 
AGATCCTCAA 
GTGCAGCTGC 
GCACTGTGCT 
AGGTGCCCCG 
GCGGCCAAGA 
CACCTCGGCT 
CTGGGGATGG 
ACTGGTCGAC 
AGACAGGATA 
G6GGGCCTCC 
AGGGCTGGCA 
CCGGGCCACT 



21 

I 

GGTCGGCCTC 
GGCTTCTCAT 
CCCCCAGCGC 
CGCCTGGGGC 
GCCACCCCTG 
GCCCTCGGAC 
CGTCATCTAC 
TGTGAACAAA 
CGT6ATCTGG 
TGGGCGGTTC 
CACTTACCTG 
TGGGGCTGGG 
TGTCTGCCAC 
TGT^CAGGCC 
ATCAGGCTGC 
TGGCTGGAGA 
CCGACTCCAG 
CCCCTCTGGG 
CATGGCCTCA 
AGGGAACCCC 
CCTGTCCACC 
CTTGGTTCTT 
CAGCCGGCGC 
CCTGACCAAG 
ACCCATCTCC 
CATTGTGGTG 
CAGTGTTCGT 
CACCCTCATG 
TGTGGIUVGGC 
GGTGGGCGAC 



31 
I 

TGGAG TATG G 
GT6G6CGCGG 
TTCTTCCT6A' 
CC6CCCCT6C 
CGCCTGGCGC 
CTC6TG6GCG 
GTGCACAACA 
GGTGACACCG 
AAGAGCAACG 
CTGCTGCAGC 
GAAGCCAGCC 
CGCTGGGGGC 
GACCATGACG 
TGCAGAGAGG 
CGGGGCCTCA 
GGAAGCCAGT 
TGCCAGTGTC 
TGGGATGGAG 
GAACTGGAGT 
TTCCCCGTGC 
AAGGCCATTG 
GCGGACAGTG 
TTCAAGGTCA 
CAGA6CCGCC 
ACTGTCCGCC 
GACCCCAGTG 
6TGCAGCTGA 
ACCACAGACT 
ACTGACC6GC 
GGTTTCCTGC 



41 
I 

TCTGGCGG6T 
CGGTGGACCT 
CTTGCGTGTC 
T6CTGGA6AA 
GCAACGGTTC 
TCTTCTCCTG 
GCCCTGGAGC 
CTGTACTTTC 
GATCCTACTT 
TCCCT^AATGT 
CCCTGGGCAG 
CAGGCTGTAC 
GCGAATGTGT 
GCCGTTTTGG 
CCTTCTGCCT 
GCCAAGAAGC 
AGAATGGTGG 
TGCACTGTGA 
TCAACTTAGA 
GGGGCAGCAT 
TGGAGCCAGA 
GGTTCTGGGA 
ATGT6AAAGT 
A6CTTGTGGT 
TGCACTACCG 
AGAACGTGAC 
GCC6GCCAGG 
GTCCTGAGCC 
TGCGAGTGAG 
TGCGCCTGTG 



51 
I 

GCCCCCTTTC 
GACGCTGCTG 
T6GGGAGGCC 
GGAC6ACC6T 
6CACCAGGTC 
CGTGGGCGGT 
CCACCTGCTT 
TGCACGTGTG 
CTACACCCTG 
GCAGCCACCA 
CGCCTTCTTT 
CTUIGGAGTGC 
ATGCCCCCCT 
GCAGAGCTGC 
CCCAGACCCC 
TTGTGCCCCT 
CACTTGTGAC 
GAAGTCAGAC 
GACGATGCCC 
AGAGCTACGC 
GAAGACCACA 
GTGCCGTGTG 
GCCCCCCGTG 
CTCCCC6CTG 
6CCCCAG6AC 
6TTAATGAAC 
GGAA66AGGA 
TTTGTTGCAG 
CTGGTCCTTG 
GGACGGGACA 
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CGGGGGCAGG 
GGACTCACGC 
GGCCCGGCCT 
CACCTCCACG 
GCTCTGCCTG 
GACCCACTGT 
AACGCCA6CA 
AGCAACACAG 
GAGAGCCX366 
GTGTCTGCCA 
AGCTGCCTGC 
CTGCAGTTCA 
CTGA6CTACC 
AACTTCGGCC 
ATCAAAATGC 
GAAGTTCTGT 
AACCGAGGTT 
CTGCGGAAAA 
TCTACCCTTA 
TACCTGAGTG 
GAGAACCTAG 
AAG7VAGACGA 
GTCTATACCA 
CTTGGAGGTA 
GGCTACCGCA 
TGCTGGCGGG 
CGCATGCTGG 
GCGGGCATTG 
GCTGGCCGGA 
CTTAAGCTGC 
GGTGGGCTTA 

CAAACCCCCA 
CAGCTACTCC 
AAAAA 



AGCGGCGGGA 

CTGGCACCCA 

CGCCCCCTGC 

CCCAQGCCCT 

GGCCT^TATC 

GGATAGACGT 

CGCGCTACCT 

TAGAA6AGTC 

CA6CTGAAGA 

CCTGCCTCAC 

ATCG6AGACG 

GCTCAGGGAC 

CA6TGCTAGA 

AGGTCATCCG 

TGAAAGAGTA 

GCAAATTGGG 

ACTTGTATAT 

GCCGG6TCCT 

GCTCCCGGCA 

AGAAGCAGTT 

CCTCCAAGAT 

TGGGGCGTCT 

CCAAGAGTGA 

CACCCTACTG 

TGGAGCAGCC 

ACCGTCCCTA . 

AAGCCAGGAA 

ATGCCACAGC 

GCAAACTCTG 

CTCAAGGAAT 

6G66AACT6G 

TAGTTCAGCT 

CTCCAGCTCC 

CACTCCCGGC 



GAACGTCTCA 
CTACCAGCTG 
ACACGTGCTT 
CTCAGACTCC 
CAAGTACGTT 
GGACAG6CCT 
CTTCCGCATG 
CACCCTG6GC 
GGGCCTGGAT 
CATCCTG6CC 
CACCTTCACC 
CTTGACACTT 
GTGGGAG6AC 
GGCCATGATC 
TGCCTCTGAA 
GCATCACCCC 
CGCTATTGAA 
AGAGACTGAC 
GCTGCTGCGT 
CATCCACAGG 
TGCAGACTTC 
CCCTGTGCGC 
TGTCTGGTCC 
TGGCATGACC 
TCGAAACTGT 
TGAGCGACCC 
GGCCTATGTG 
TGAGGA6GCC 
CT6TCTAACC 
TTTTTTAACT 
GTTCCCATGC 
GCCCCACA6G 
TTCGCTTAAG 
CTGTCATTCA 



TCCCCCCAGG 
GATGTGCAGC 
CTGCCCCCCA 
GAGATCCAGC 
6T6GAGGTGC 
GA6GAGACAA 
CGGGCCA6CA 
AACGGGCTGC 
CAGCAGCTGA 
GCCCTTTTAA 
TACCAGTCAG 
ACCCG6C6GC 
ATCACCTTTG 
AAGAAGGACG 
AATGACCATC 
AACATCATCA 
TATGCCCCCT 
CCAGCTTTTG 
TTCGCCAGTG 
GACCTGGCTG 
GGCCTTTCTC 
TGGATGGCCA 
TTTGGAGTCC 
TGTGCCGAGC 
GACGATGAAG 
CCCTTTGCCC 
AACATGTCGC 
IG^GCTGCCA 
TGT6ACCAGT 
TAAGGGA6AA 
TTTGTAGGTG 
TGTGTTTCCC 
CCAGCACTCA 
GAAAAA7UVTA 



CCCGCACTGC 
TCTACCACTG 
GTGGGCCTCC 
TGACATGGAA 
AG6TGGCTGG 
GCACCATCAT 
TTCAGGGGCT 
AG6CTGAGGG 
TCCT6GCGGT 
CCCTGGTGTG 
GCTCGGGCGA 
CAAAACTGCA 
AGGACCTCAT 
GGCTGAAGAT 
GTGACTTTGC 
ACCTCCTGGG 
ACGGGAACCT 
CTCGAGAGCA 
ATGCGGCCAA 
CCCGGAATGT 
GGGGAGAGGA 
TTGAGTCCCT 
TTCTTTGGGA 
TCTATGAAAA 
TGTACGAGCT 
AGATTGCGCT- 
TGTTTGAGAA 
TCCAGCCAGA 
CT6ACCCTTA 
AAAAAGG6AT 
TCTCATA6CT 
ATCCCT^CTGC 
CACCACTAAC 
AATGTTCTAA 



CCTCCTGACG 
CACCCTCCTG 
AGCCCCCCGA 
6CACCCGGAG 
GG6TGCAG6A 
CCGTGGCCTC 
CGGGGACTG6 
CCCAGTCCAA 
GGTGGGCTCC 
CATCC6CAGA 
GGAGACCATC 
GCCCGAGCCC 
CGGGGAGGGG 
GAACGCAGCC 
GGGAGAACTG 
GGCCTGTAAG 
GCTAGATTTT 
TGGGACAGCC 
TGGCATGCAG 
GCTGGTCGGA 
GGTTTATGTG 
G7VACTACAGT 
GATA6TGAGC 
GCTGCCCCAG 
GATGCGTCAG 
ACAGCTAGGC 
CTTCACTTAC 
ACGTGGCTCT 
CA6CCTCTGA 
CTGGGGAT6G 
ATCCTGG6CA 
TCCCCCAACA 
ATGCCCTGTT 
TAAGCTCCAA 
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Seg ID No: 113 Protein secmence! 
Protein Accession #: NP 005415.1 



40 



45 



50 



55 



60 



MVWRVPPPLL 
LLLEKDDRIV 
NSPGAHLLPD 
QLPNVQPPSS 
DGECVCPPGF 
QCQBACAPGH 
EFNI.ETMPRI 
SGFWECRVST 
RLHYRPQDST 
DCPBPLLQPW 
QARTALLTGIi 
QltTWKHPETOj 
SIQGXiGDWSN 
LTLVCIRRSC 
FEOIiIGBGNF 
INLIiGACKNR 
SDAANGMQYL 
AIESLNYSVY 
EVYBIiMRQCW 



11 
I 

PILFLASHVG 
RTPPGPPLRL 
KVTHTVNKGD 
GIYSATYLEA 
TGTRCEQACR 
FGADCRLQCQ 
NCAAAGNPFP 
SGGQDSRRFK 
MDWSTIWDP 
IjEGWHVEGTD 
TPGTHYQIiDV 
PGPISKYWE 
TVEBSTIjGNG 
LHRRRTFTYQ 
6QVIRAMIKK 
GYLYIAIEYA 
SEKQPIHRDL 
TTKSDVWSFG 
RDRPYERPPF 



21 
I 

AAVDIiTLLAN 
ARNGSHQVTL 
TAVLSARVHK 
SPIiGSAFPRL 
EGRFGQSCQE 
CQNGGTCDRF 
VRGSIELRKP 
VNVKVPPVPL 
SENVTLMNLR 
RLRVSWSIiPI* 
QLYHCTLLQP 
VQVAG6AGDP 
LQAEGPVQES 
SGSGEETILQ 
OGLKHNAAIK 
PYGNIiLDFIiR 
AARNVLVGEN 
VLLWEIVSIiG 
AQIALQLGRM 



31 
I 

IiRIiTDPQRFF 
RGFSKPSDIjV 
EKQTDVIWKS 
IVRGCGAGRW 
QCPGISGCRG 
SGCVCPSGWH 
DGTVLIiSTKA 
AAPRLLTKQS 
PKTGYSVRVQ 
VPGPIiVGDGF 
ASPPAHVIiLP 
IiWIDVDRPEE 
RAABEGIiDQQ 
FSSGTLTIiTR 
MIifCBYASEND 
KSRVLETDPA 
liASKIADFGIi 
GTPYCGMTCA 
liEARKAYVNM 



41 

I 

LTCVSGEAGA 
GVFSCVGGAG 
NGSYFYTLDW 
GPGCTKECPG 
LTFCLPDPYG 
GVHCEKSDRI 
IVEPEKTTAB 
RQLWSPLVS 
LSRPGEG6EG 
LIiRIjWDGTRG 
PSGPPAPRHL 
TSTIIRGI^ 
LIIiAWGSVS 
RPKIiQPBPLS 
HRDFAGEIiEV" 
FAREHGTAST 
SRGEEVYVKK 
ELYEKLPQGY 
SLFENFTYAG 



51 
I 

GRGSDAWGPP 
ARRTRVIYVH 
HEAQDGRFLIi 
CLHGGVCHDH 
CSCX3SGWRGS 
PQlLNMASEIi 
FEVPRLVIiAD 
FSGDGPISTV 
AWGPPTIiMTT 
QERRENVSSP 
HAQAIiSDSEI 
STRYLFRMRA 
ATCLTILAAL 
YPVLEWEDIT 
IiCfCLGHHPNI 
IiSSRQIXRPA 
TMGRIjPVRWM 
RMEQPRNCDD 
IDATAEEA 
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Seq ID NO: 114 DNA sequence 
Nucleic Acid Accession #: NM_002632.l 

65 Coding sequence: 322-771 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GGGATTCX3GG CCGCCCAGCT ACGGGAGGAC CTGGAGTGGC ACTGGGCGCC CGACGGACCA 60 

70 TCCCCGGGAC CCGCCTGCCC CTCGGCGCCC CGCCCCGCCG GGCCGCTCCC CGTCGGGTTC 120 

CCCAGCCACA GCCTTACCTA CGGGCTCCTG ACTCCGCAAG GCTTCCAGAA 6ATGCTCX3AA 180 

CCACCX3GCCG GGGCCTCGGG GCAGCAGTGA GGGAGGCGTC CAGCCCCCCA CTCAGCTCTT 240 

CTCCTCCTGT 6CCA6GGGCT CCCCGGGGGA TGAGCATGGT GGTTTTCCCT CGGAGCCCCC 300 

TGGCTCGOGA CGTCTGAGAA GATGC CGGTC ATGAGGCTGT TCCCTTGCTT CCTGCAQCTC 360 

75 CTGGCCGGGC TG6CX5CTGCC TGCTGTGCCC CCCCAGCAGT GGGCCTTGTC T6CTGGGAAC 420 

G6CTCGTCAG A06TGGAA6T GGTACCCTTC CA6GAAGTGT GG6GCCGCAG CTACTGCCGG 480 
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GCGCTGGAGA 
AGCCCATCCT 
TGTGTGCCGG 
CGGCCCTCCT 
CGG6AGAAGA 
TGGAGGAGAG 
TCCTGCT6GT 
CCTTCAAGAC 
TGAGAGAAAG 
ACACGTGGCC 
GCAGAA6GAA 
GCAGCCCTTG 
AC6GCCTGGT 
CTTCTGAAGA 
TCCTTGTCCC 
TTTCCGGCCG 
GGCTGGAGAA 
GGGAGGAGCC 
CTGGCACCCC 
ATAAAGTATT 



GGCTGGTGGA 
GTGTCTCCCT 
TGGAGACGGC 
ACGTGGAGCT 
TGAAGCCGGA 
AGACCCCGCA 
ACCTGCCCTC 
GAGGG6CAGG 
AGAGAAGCCA 
TCX3TGAGGGG 
AGAA6GG6GC 
CTTTC66A6C 
G6TGGGAAG6 
TCAGAACATT 
CCGTGATCTC 
AGGTGCCACC 
AGAGCTGCCT 
TGTGCGTCCC 
CACAAGCTGT 
CTAGTGTGGA 



CGTCGTGTCC 
GCTGCGCTGC 
CAATGTCACC 
GACGTTCTCT 
AAG6TGCGGC 
CCCXSGCTCXST 
TATTTATTA6 
6AA6GACAGG 
GCCACAGACC 
CAAGCTAG6C 
CCTGCTACCT 
TCCTGTCCAA 
CCGGCAGC6G 
CAGCTCTGGA 
CCCTCACACT 
ACCCTGCCCC 
GGATGAGAAA 
AGCTGAAGGC 
CCCTGCAGGG 
AACGC , 



GAGTACCCCA 
ACCGGCTGCT 
ATGCAGCTCC 
CAGCACGTTC 
GATGCTGTTC 
GTATTTATTA 
CCAACTGTTT 
ACCCTCAGGA 
CCTGG6AGCT 
CCCA6AGGCC 
6TTCTTGGGC 
AGTA6GGATG 
GC6GA6GGGA 
GAACAGTGGT 
TTGCCATTTG 
CACTAAGAGA 
CAGCTCAGCC 
AGTGGCAGGG 
CCATCTGACT 



GCGAGGTGGA 
GCGGCGATGA 
TAAAGATCCG 
GCTGCGAATG 
CCCGGAGGTA. 
CCX3TCACACT 
CCCT6CTGAA 
ATTCAGTGCC 
TCCGCTTTQA 
CTGGAGGTCT 
CTCAGGCTCT 
CGGATTCTGC 
TTCA6CCACT 
T6CCTGG6GG 
CTTGTACTGG 
CACATACAGA 
AGTGGGGATG 
GAGCAGGTTC 
GCCAAGCCAG 



GCACATGTTC 
GAATCTGCAC 
TTCTGGGGAC 
CCGGCCTCTG 
ACCCACCCCT 
CTTCAGTGAC 
TGCCTCGCTC 
TTCAACAACG 
AAGAA6CAAG 
CCA6GG6CCT 
GCACAGACAA 
TGGGGCC6CC 
TCCCCCTCTT 
CTTTTGCCAC 
GACATTGTTC 
GTGGGCCCCG 
AGGTCACCAG 
CCCAAGGGCC 
ATTCTCTTGA 
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600 
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Seq ID No: 115 Protein sequence; 
Protein Accession NP 002623.1 
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MPVMRLFPCF LQWiAGLALP AVPPQQWALS AGNGSSEVEV VPFQEVWGRS YCRALERLVD 60 
WSEYPSEVE HMPSPSCVSIj LRCTGCCGDE NLHCVPVBTA NVTMQLIiKIR SGDRPSYVEL 120 
TFSQHVRCEC RPZiREKMKPB RCGDAVPRR 

Seq ID NO: 116 DNA sequence 

Nucleic Acid Accession #: KM_007361.1 

Coding sequence: 1-4131 (underlined sequences correspond to start and stop codons) 



1 
I 

ATGG AGGGGG 
CTGCAGTTGC 
GAGTCGTGGT 
GCTGGCGAAT 
ACGGCATCAT 
CCACCGACTT 
GCCGAGTCCT 
TGCX3CGCTGG 
CTGGGAGCAG 
AACACTTTCC 
CCTGCCAACG 
CA6CTTCCAG 
CCATATTTCA 
CTGGGGATCC 
A66CCA6CT6 
TTCAGCCATG 
GTGAATGAGG 
AGCAGCATTG 
ACCTTGGATC 
AAAGGCCAAG 
GACA6AGATT 
ATCCAGCCCT 
CCTGAAGAAG 
CGAGGGACGT 
TATAATGCTG 
TTCTGCACGG 
GGGAAGCACT 
CTCCACGTGG 
GGCAATGATG 
CTCCTCCCCC 
GGCTCTGAGA 
TTCTACCCGQ 
AACTACCTGA 
ACAGCCCACA 
ACAAGTTCCA 
ATCCACCAGA 
ACCACCCAGC 
CTTAGATTTG 
GTGAATCCTT 
ACA6GTGTAG 



11 
I 

ACCGGGTGGC 
TAATGTTGCG 
GGGACCAGCT 
CCCCTGCACT 
CTCCACTCAG 
CCCGGCCATC 
GTACCGAGAG 
CTTCCCGCGC 
GTAGGCGCTT 
AGGCAGTTTT 
GCCTGCAGTT 
CTCG6GTGGG 
GCTTGACTAG 
CTG6AGTGTG 
CAGTTGGAGA 
CTACAGCCCT 
AG6AAGCTGA 
ATGTTTCCTT 
CTCACACCAA 
TTGAGCCCTG 
CACTGGCTCC 
ACCCAGATGG 
AAATTGTTCT 
ATGAGGTGGG 
CCAACAAGGA 
ACTATGCCAC 
GTCTGCCTGA 
GCCATACACC 
GCAGAGCCTA 
TCACACCAAT 
AC6GCTTCA6 
GAGAG6AGAC 
GCATTAA6AC 
TCTCTCCCTA 
GAGACTACTC 
ACATCACTTA 
A6CTGAACGT 
CTGTGACCAA 
GCTATGATG6 
ATTACACCTG 



21 
I 

CGGGCGGCCX5 
GGCCGCGGCG 
CCTGCAGGAA 
TCTTACGAAG 
GACTTCCCCA 
GCCCCTTTTC 
6ACACCTCCC 
TCTGCGCGCT 
ACGAGGAGGT 
GGCATCTGAT 
CCTTGGAACC 
CTTCTGCCGA 
CACTGAACAG 
GGCTTTCCAT 
CXTTTTCCGCT 
GGAAAGTGAC 
ATACCTTCCG 
CCAATCCAAA 
AGAAGGAACA 
GGATGAGAGA 
TTCCTGGGAA 
AGGGCCAGTG 
TCGAAGTTAC 
ACTGGAAGAC 
AACCTGTGAA 
TGGCTTCTGC 
GGGGGCACCT 
CGTGCACTTC 
CACGGCCATC 
TGGAGGCCTG 
CCTCGCAGGT 
6GTTCGTATC 
CAACATTCAA 
CAAGGAGCT6 
TCTGACTTTT 
CCAG6TGTGC 
6GACCG6GTC 
TCAAATTGGC 
GAGCCACATG 
TGA6TGCGCA 



31 
1 

GTGCTGTCGT 
CTGCACCCAG 
GGCGACGACG 
CCCGATTCAG 
GGGAAACGCA 
TGGCGGACAT 
CCGCAGTGCT 
TTTTACCCCC 
CAAACGCGGG 
GGGTCTGATA 
CGCCCCAAAG 
GGGGAGGCT6 
TCTGTGAAAA 
ATCGGCAGCA 
GCCCACTCTT 
TATAATGAGG 
G6TGAACCAG 
6TGGATACAA 
TCTCTGGGAG 
GAGACCAGAA 
ACCCCACCAC 
CCTTCGGAAA 
CCTGCTTCAG 
AACATAGGTT 
CACAACCACA 
TGCCACTGCC 
CACCGAGTGA 
ACTGATGTGG 
AGCCACATCC 
TTTGGCTGGC 
GCTGCCTTTA 
ACTCAAACTG 
GGCCAGGTGC 
TACCACTACT 
GGTGCAATCA 
AGGCACGCCC 
TTTGCCTTGT 
CCGGTCAAAG 
TGTGACACAA 
TCT6GGTACC 



41 

I 

CGTTACCAGT 
ACGAGCTCTT 
TAAAGCTCAG 
CAACCTCTAC 
GTATGTGGAC 
CGACACGAGC 
GGGCCTGGCC 
ACCCACGCCT 
CGCTGCCCTC 
GCTACGCCCT 
AGTCTTACAA 
ATGATCT6AA 
ATCTCTATCA 
CTTCCCCGTT 
CTGTTCCCCT 
ACAATTTGGA 
AGGAGGCATT 
AGCCTTTAGA 
AGGTAGGGGG 
GCCCAGCTCC 
CGTACCCCGA 
TGGATGTTCC 
GTCACACTAC 
CCAACACCGA 
GACAATGCTC 
AATCCAAGTT 
ATGGGAAAGT 
ACCTGCATGC 
CACAGCCAGC 
TCTTTGCTTT 
CCCATGACAT 
CTGAGGGACT 
CTTACGTCCC 
CCGACTCCAC 
ACCAAACATG 
CCAGACACCC 
ATAATGAT6A 
AAGATTCAGA 
CAGCACGGTG 
AG6GAGATG6 



51 
I 

GCTACTGCTG 
CCCACACGGG 
CCGTGGTGAA 
GTGGGCACCA 
TATGATTTCC 
CACGGCAGAG 
GCCCGCTATG 
TCCTGGCCAC 
GGGAGAGCTG 
CTTTCTTTAT 
TGTCCAGCTT 
GTCAGAAGGA 
ACTAAGCAAC 
GGACAATGTC 
GGGACGTTCC 
TTACTACGAT 
6AATGGCCAC 
GGAATCTTCC 
CCCAGATTTA 
ACCAGAGGTA 
AAACGGAAGC 
CCCAGCTCAT 
ACCCTTT^GT 
GGTCTTCACG 
CCGGCATGCC 
TTATGGAAAT 
GAGTGGCCAC 
GTATATCGTG 
AGCCCAGGCC 
AGAAAAACCT 
GGAAGTTACA 
TGACCCAGAG 
AGCAAATTTC 
TGT6ACCTCT 
GTCCTACCGC 
GTCCTTCCCC 
AGAAA6AGTG 
CCCCACTCCG 
CCATCCAGGG 
ACGGAACTGT 
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GTGGATGAAA ATGAATGTGC AACTGGCTTT 
AACTTGCCTG GAAGCTACAG GTGT6A6T6C 
CATACTTGCA TCTT6ATCAC CCCACCTGCC 
GCTCCTGCTG GGCAGGCCCG GTGTGTTCAC 
5 CTGCCTGGTT ATGCCGGCGA TGG6CACCAG 
AGATGTCACC CTGCAGCTAC CTGCTACAAT 
CCCGGATATT ATGGGGATGG ATTTCAGTGC 
TGTGAACAAC A6CA6CGCCA TGCCCAGGCC 
ATCCCCCAAT GCGACGAGCA GGGCAACTTC 

10 TTCT6CTGGT 6CGTGGACCC TGATGGTCAT 
TCCACCCCX3C CTCACTGTGG ACCATCACCA 
GAGCGCTGGA GGGAAAACCT GCTGGAGCAC 
GTGCCCCAGT GCGATGACCT GGGCCACTTC 
TTCTGCTGGT GTGTGGACAA AGATGGCAGA 

15 ACCACCCCTG CGTGTATACC CACCGTCGCT 
GATGTGACCC CTCCATCTGT GGGCACCTTC 
TACTTACCCC TCAATGGCAC CAGGCTTCAG 
CATGGCTCCA TAATCGTGGG AATTGATTAC 
GATGTTGCTG GACGGACAAT CAGCCGTGCC 

20 ATCGTGAATT CAGGTCTGAT AAGCCCTGAA 
ATGTACTGGA CGGACAGTGT CCTGGATAAG 
CGCAAGGTCC TCTTCTACAC AGATCTGGTG 
CGAGGCAACT TGTACTGGAC AGACTGGAAT 
TTAGATGGAG AAAACAGAAG AATTCTGATC 

25 ACCTTTGACC CTTTCTCTAA ACTGCTCTGC 
TGTACACTAC CTGATGGAAC TGGACGGCGT 
AGCATCGTAA GCTATGCAGA TCACTTCTAC 
TCAGTAAATA AACATAGTGG CCAGTTTACT 
CTCTACGG6A TAACTGCAGT CTACCCCTAC 

30 TGTAAAG6AA GACTTGGAGT TTACAATCAG 
AGGCAAAGAA AGTAAAAAAG GAATTG6CCA 
TTTTGTAGTG CAAAAAGACT TTTGTGAAAA 
TTAAAAATGA AGGTTGTTAT TGCAAGTTTA 
TTAAAGCAAC TTCTTGTAAA CATTTATCAT 

35 AAGAATTAGA GTTTAAGACT CTAAACCTGA 
ATTAAAGCAC ATGTGATCAA TATAACAATA 
AATGCAGAAC TGATGGGAAA GGACCAATTA 
TTTTTACCTC TGCATCAGTG CATTTCTATT 
TGCATTTTCT GATCCTGTAG TGCCTCTATA 

40 TTATAAATAC CAAAGATGTA ACAATTTTAA 
TAA6TTTAAA AAAAAAAAAA AAAAAAAAA 



CATCGCTGTG GCCCCAACTC TGTATGTATC 2460 

C6GAGT6GTT ATGAGTTTGC AGATGACCGG 2520 

AACCCCTGTG AGGATGGCAG TCATACCTGT 2580 

CATGGAG6CA GCACGTTCA6 CTGT6CCTGC 2640 

TGCACT6ATG TAGATGAAT6 CTCA6AAAAC 2700 

ACTCCTGGTT CCTTCTCCTG CCGTTGTCAA 2760 

ATACCTGACT CCACCTCAAG CCT6ACACCC 2820 

CAGTATGCCT ACCCTGGGGC CCGGTTCCAC 2880 

CTGCCCCTAC AGTGTCATGG CAGCACTGGT 2940 

GT^GTTCCTG GTACCCAGAC TCCACCTGGC 3000 

GAGCCCACCC AGAGGCCCCC GACCATCTGT 3060 

TACGGTGGCA CCCCCCGAGA TGACCAGTAC 3120 

ATCCCCCTGC AGTGCCACGG AAAGAGCGAC 3180 

GAGGTGCAGG GCACCCGCTC CCAGCCAGGC 3240 

CCACCCATGG TCCGGCCCAC GCCCCGGCCA 3300 

CTGCTCTATA CTCAGGGCCA GCAGATTGGC 3360 

AAGGATGCAG CTAAGACCCT GCTGTCTCTG 3420 

GACTGCCGGG AGAGGATGGT GTACTGGACA 3480 

GGTCTGGAAC TGGGAGCAGA GCCTGAGACG 3540 

GGACTTGCCA TAGACCACAT CCGCAGAACA 3600 

ATAGAGAGCG CCCTGCTGGA TGGCTCTGAG 3660 

AATCCCCGTG CCATCGCTGT GGATCCAATC 3720 

AGAGAAGCTC CTAAAATTGA AACGTCATCT 3780 

AATACAGACA TTGGATTGCC CAATGGCTTA 384 0 

TGGGCAGATG CAGGAACCAA AAAACT6GA6 3900 

GTCATTCAAA ACAACCTCAA 6TACCCCTTC 3960 

CACACAQACT 6GAGGAGGGA T6GTGTTGTA 4020 

GATGAGTATC TCCCAGAACA AC6ATCTCAC 4080 

TGCCCAACAG GAAGAAAG2rA_AGTACAGTAA 4140 

AACCTTGGACC CTAAAGAACA GTGACTGCAA 4200 

TTAGACGTTC CTGAGCATCC AAGATGAACA 4260 

GCTGATACCT CTU^TCTTTAC TACTGTATTT 4320 

AAAAGGTAAC AGAATTTTAA CTGTTGCTTA 4380 

TAATATTTAA AAGATCAAAT TCATTCAACT 444 0 

TTTTTGCCAT GGATTCCTTC TGGCCAAGAA 4500 

TAATCCTAAA CCTTGACAGT TGGAGAAGCC 4 560 

TTTATAGTTT CCCAACAAAA GTTCTAAGAT 4620 

TATATCAAAA GGTGCTAAAA TGATTCTU^TT 4680 

G7VAGTACCCA CAGAAAGTAA AGTATCACAT 4740 

AATTTTCTA6 ATTACTCCAA TAAAGTGTTT 4800 
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Seq ID No: 117 Protein sequence; 
Protein Accession #: NP 031387.1 



50 



55 



60 



65 



70 



MEGDRVAGRP 
AGESPAIiLTK 
AESCTERTPP 
NTFQAVLASD 
PYFSLTSTEQ 
FSHATALESD 
TIiDPHTKEGT 
IQPYPDGGPV 
YNAANKETCE 
liHVGHTPVHF 
GSENGFSLAG 
TAHISPYKEL 
TTQQLNVDRV 
TGVDYTCECA 
HTCILITPPA 
RCHPAATCYN 
IPQCDEQGNP 
ERWRENLLEH 
TTPACIPTVA 
HGSIIVGIDY 
MYWTDSVWK 
UXSENRRILI 
SIVSYADHFY 



11 
I 

VIiSSLPVLLL 
PDSATSTWAP 
PQCWAWPPAM 
GSDSYALFIiY 
SVKNLYQLSN 
YNEDNIiDYYD 
SLGEVGGPDXi 
PSEMDVPPAH 
HNHRQCSRHA 
TDVDLHAYIV 
AAFTHDMEVT 
YHYSDSTVTS 
FALYNDEERV 
SGYQGDGRNC 
NPCEDGSHTC 
TPGSFSCRCQ 
liPLQCHGSTG 
Y6GTPRDDQY 
PPMVRPTPRP 
DCRERMVYWT 
ZESTOiIiDGSB 
NTDZGIiPHGIi 
HTDWRRDGW 



21 
I 

LQLLMIiRAAA 
TASSPIiRTSP 
CALASRALRA 
PANGLQFIiGT 
liGIPGVWAFH 
VNEEEAEYLP 
KGQVEPWDER 
PEEBIVIiRSY 
FCTDYATGPC 
GNDGRAYTAI 
PYPGEETVRI 
TSSRDYSIiTF 
liRFAVTNQIG 
VDENECATGP 
APAGQARCVH 
PGYYGDGFQC 
FCWCVDPDGH 
VPQCDDIiGHF 
DVTPPSVGTF 
DVAGRTISRA 
RKVIiFYTDIiV 
TFDPFSKbLC 
SVNKHS6QFT 



31 
I 

LHPDEIiFPHG 
GKRSMWTMZS 
FYPHPRIiPGH 
RPKESYNVQI* 
IGSTSPIiDNV 
GEPEEAIiNGH 
ETRSPAPPEV 
PASGHTTPLS 
CHCQSKFYGN 
SHIPQPAAQA 
TQTAEGLDPE 
GAINQTWSYR 
PVKEDSDPTP 
HRCGPNSVCI 
HGGSTFSCAC 
IPDSTSSLTP 
EVPGTQTPPG 
IPLQCHGKSD 
liliYTQGQQIG 
GIiELGAEPET 
NPRAIAVDPI 
WADAGTKfOiE 
DEYLPEQRSH 



41 
I 

ESWWDQIiIiQE 
PPTSRPSPLF 
IiGAGRRIiRGG 
QLPARVGFCR 
RPAAVGDLSA 
SSIDVSFQSK 
DROSIiAPSWB 
RGTYBVGIiED 
GKHCLPBGAP 
LLPLTPIGGL 
WYLSIKTNIQ 
IHQNITYQVC 
VNPCYDGSHM 
NLPGSYRCEC 
LPGYAGDGHQ 
CEQQQRHAQA 
STPPHCGPSP 
FCWCVDKDGR 
YLPIiNGTRLQ 
IVNSGLISPE 
RGNLYWTDWN 
CTLPDGTGRR 
LYGITAVYPY 



51 
I 

GDDVKIiSRGE 
WRTSTRATAE 
QTRALPSGEIi 
GEADDLKSEG 
AHSSVPLGRS 
VDTKPLEESS 
TPPPYPENGS 
NIGSNTEVFT 
HRVNGKVSGH 
FGWLFALEKP 
GQVPYVPANF 
RHAPRHPSFP 
CDTTARCHPG 
RSGYEFADDR 
CTDVDECSEN 
QYAYPGARFH 
EPTQRPPTIC 
EVQGTRSQPG 
KDAAKTItLSIi 
GLAIDHZRRT 
REAPKZBTSS 
VZQNNLKYPF 
CPTGRK 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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Seq ZD HO: 118 DNA fleouence 

Nucleic Acid Accession #: KM_0030e8.1 

Coding sequence: 112-1593 (underlined sequences correspond to start and stop codona) 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



GCGGAGGGTG 
CCCGCCACCC 
AACG6CACAG 
CTGACG6CCG 
CAGATCTGGA 
AGCCACCTGG 
GTGCCCGGTC 
CA6TCCGAGG 
CAGACGGTGT 
ATCTACAGTG 
GCCGTGGACC 
CAGCGCTACA 
GCGCGCCCCG 
CGCGACTGCG 
AAGGCCACCA 
GTGCTGCAGG 
AATCAGGACG 
AAGTGTGCCT 
CAGTCCACCG 
CGCATCACAC 
GCCGCCTCGG 
CCCATCATCG 
CTGGACGCCA 
AACATCAAAG 
AGCGGCGACA 
AA6GTGGGCG 
ACCGTGGACC 
CCACATGGCG 
GGCGGGAGGC 
CCTGTCGCCC 
TCAGCGGCTG 
CG6GGCGAGT 
GAAGC6GCTA 
TTTGCCTCTC 
CTGTCAGTGG 
CGGGAGGGCT 
CTCCCACGTG 
ACAGGGTCTG 
GGGCCGTCTT 
CAAATCAGTA 
GTAGTAGCGA 
CCCCCTCTTT 
GCCAGAGCCC 
CGCCCCCTCC 
TCCCCAACAT 
TATAACTCTA 
AGTCTGC 



11 
I 

CGTGCGGGCC 
ACCTCCCGGG 
CCX3A6GCGGT 
AG6C6TTCGG 
C6CTGGAGCA 
GCCGCTACCT 
CC6ACTGCCG 
CGCACCGGCG 
CCCCCGCCGA 
TCACCCGTAA 
GCGACGTGCC 
GCGTGCAGAC 
AGCCGGCCAC 
AGGGCCGTTA 
AGGTGGGCAA 
CGGCCAACGA 
AGGAGACCGA 
TCCGTACCCA 
CCTCCAGCAA 
TGAGGGCGTC 
TGGAGACAGC 
TGTTCCGCGG 
ACCGCTCCAG 
ACTCCACAGG 
CTCCTGTGGA 
GGCGCTACCT 
CCX3CCTCGCT 
GCTCCTGCCA 
AAGCCCCCTT 
CTATGGACTC 
CGGCCTGGCC 
CT6GCACCTC 
AGGGACGGTT 
CCAGCCACCT 
CCCTCCCTGG 
AGGACTGACC 
GGAGAGGCTC 
CCCGCTGCAC 
CCTCCTGTCT 
TTTTTTTTAA 
GTGATCTGGC 
CCGTCCTTCC 
CTGCTGTGAT 
GGGAGCCCTG 
GCATCTCACT 
AACGCCCATG 



21 
I 

GCGGCAGCCG 
GCCGCGCAGC 
GCAGATCCAG 
GTTCAAGGTG 
GCCCCCTGAC 
G6CGGCG6AC 
TTTCCTCATC 
CTACTTCGGC 
GAAGT6GAGC 
6CGCTAC6CG 
CTGGGGCGTC 
CGCCGACCAC 
TGGCTACACG 
CCTGGCGCCG 
GGACGAGCTC 
GAGGAACGTG 
CCAGGAGACC 
CACGGGCAAG 
GAATGCCAGC 
CAATGGCAAG 
AGGGGACTCA 
GGAGCATGGC 
CTATGACGTC 
CAAATACTGG 
CTTCTTCTTC 
6AAGGGCGAC 
CTGGGAGTAC 
ACCCTCCCTG 
6CCTTTCAAA 
CCCACTCTCC 
CTGG6AGGGA 
TTTCTTCT6A 
GGGG6CT6GG 
CCTCCCAGCC 
TGCACTGTCC 
CTTGTGGTGT 
AGCCTGGCTC 
GTTCTGCCAA 
CTTTCCTTTC 
TGAAATATTA 
GGGGGGCGTC 
CGTCCAGCCC 
TGGTGCTCCC 
GGGTGAGCCG 
CTGGGTGTCT 
ATAGTAGCTT 



31 
I 

AACAAAGGAG 
GGCCTCTCGT 
TTCGGCCTCA 
AACGCGTCCG 
GAGGCGGGCA 
T^GGACGGCA 
GTGGCGCACG 
GGCACCX3AG6 
GTGCACATOG 
CACCTGAGCG 
GACTCGCTCA 
CGCTTCCTGC 
CTGGAGTTCC 
TCGGGGCCCA 
TTTGCTCTGG 
TCCACGCGCC 
TTCCAGCTGG 
TACTGGACGC 
TGCTACTTTG 
TTTGTGACCT 
GAGCTCTTCC 
TTCATCGGCT 
TTCCAGCTGG 
ACGGTGGGCA 
GAGTTCTGCG 
CACGCAGGCG 
TAGGGCCGGC 
CTAACCCCTT 
CTGGAAACCC 
CCTCCGCCCG 
TTTCAGATGC 
CCTCAGAC6G 
AGCCCTGGGC 
CCCCAGGAGA 
CCGAAACCCC 
TTTTTTGGGT 
CCTTCCCTGG 
GGTGGTGGTG 
ACCCTAGCCT 
TTGCTGGAGG 
TCAGCACCCT 
CAGCCCTGGG 
TGGGCCTCCC 
CCGGGGCCCC 
TGGTCTTTTA 
CAAACTGGAA 



41 
I 

CAGGGGCGCC 
CTACTGCCAC 
TCAACTGCGG 
CCAGCAGCCT 
GCGCGGCCGT 
ACGTQACCTG 
ACGACGGTCG 
ACCGCCTGTC 
CCATGCACCC 
C6CGGCCGGC 
TCACCCTCGC 
GCCACGAC6G 
GCTCCGGCAA 
GCGGCACGCT 
AGCAGAGCTG 
AGGGTATGGA 
AGATCGACCG 
TGACGGCCAC 
ACATCGAGTG 
CCAAGAAGAA 
TCATGAAGCT 
GCCGCAAGGT 
AGTTCAACGA 
GTGACTCCGC 
ACTATAACAA 
TCCTGAAGGC 
CCGTCCTTCC 
CTCCGCCAGG 
CAGAGAJ^AAC 
GGTTCCCTAC 
CCCTGCCCTC 
CTCTGAGCCT 
GTGTAGTGTA 
GCTGGGCACA 
TGCTTGGGAA 
GGTGGCTGGA 
AGCGGCAGGG 
GCGGGCGGGT 
GACTGGAAGC 
CGTCCCAGGC 
CCCCAGGGGG 
CCTGGGCTGC 
GGGTGGATGA 
CCTGCTGCCA 
TTTTTTGTAA 
ATAGCGAAAT 



51 
I 

GCCGCAGGGA 
CATGACCGCC 
CAACAAGTAC 
GAAGAAGAAG 
GTGCCTGCGC 
CGAGCGCGAG 
CTGGTCGCTG 
CTGCTTCGCG 
TCAGGTCAAC 
CGAC6AGATC 
CTTCCAGGAC 
GCGCCTGGTG 
GGTGGCCTTC 
CAAGGCGGGC 
CGCCCAGGTC 
CCTGTCTGCC 
CGACACCAAA 
CGGGGGCGTG 
GCGTGACCGG 
TGGGCAGCTG 
CATCAACCGC 
CACGGGCACC 
TGGCGCCTAC 
GGTCACCAGC 
GGTGGCCATC 
CTCG6CGGAA 
CCGCCCCTGC 
TG6GCTCCAG 
GGTGCCCCCA 
TCCCCTCGGG 
TTGTCTGCCA 
TATTTCTCTG 
ACTGGAATCT 
TGTCCCAAGC 
GGGAAGCTGT 
7VACAGCCCCT 
CGTGACGGCC 
AGGGGTGTGG 
AGAAAATGAC 
AAGCCTGGCT 
TGCATCTCAG 
CGACACCTGG 
AGCCAGGCGT 
GCCTCCCCCG 
GTGTCATTTG 
AAAATAACTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
64 0 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



Seq ID No: 119 Protein sequence; 
Protein Accession #! NP 003079.1 



55 



60 



65 



70 



MTANGTAEAV 
CLRSHIXSRYL 
CFAQTVSPAE 
FQDQRYSVQT 
KAGKATKVGK 
DTKKCAFRTH 
GQIiAASVBTA 
GAYNIKDSTG 
SAETVDPASZi 



11 

I 

QIQFGIjINCG 
AADKDGNVTC 
KWSVHIAMHP 
ADHRPLRHDG 
DELFALEQSC 
TGKYWTLTAT 
GDSELFIiMKL 
KYWTVGSDSA 
WBY 



21 
I 

NKYLTAEAFG 
EREVPGPDCR 
QVNIYSVTRK 
RLVARPEPAT 
AQWLQAANE 
GGVQSTASSK 
INRPIIVFRG 
VTSSGDTPVD 



31 
I 

FKVNASASSIi 
FLIVAHDDGR 
RYAHLSARPA 
GYTLEFRSGK 
RNVSTRQGMD 
NASCYFDIEW 
EHGFIGCRKV 
FFFEPCDYNK 



41 

I 

KKKQIWTLEQ 
WSLQSEAHRR 
DEIAVDRDVP 
VAFRDCEGRY 
LSANQDEETD 
RDRRITLRAS 
TGTLDANRSS 
VAIKVGGRYL 



51 
I 

PPDEAGSAAV 
YFGGTEDRLS 
WGVDSLITLA 
LAPSGPSGTL 
QETFQLEIDR 
NGKFVTSKKN 
YDVFQLEFND 
KGDHAGVLKA 



60 
120 
160 
240 
300 
360 
420 
480 



Seq ID NO: 120 DNA sequence 

Nucleic Acid Accession #: NM_006404.1 

Coding sequence: 25-741 (underlined- sequences correspond to start and stop codons) 



75 



1 11 21 31 41 51 

I I I I I I 

CAOGTCCGGA GCCTCAACTT CAG GATGT TG ACAACATTGC TGCCGATACT GCT6CTGTCT 60 
GGCTGGGCCT TTTGTAGCCA AGACGCCTCA GATGGCCTCC AAAGACTTCA TATGCTCCAG 120 
ATCTCCTACT TCCGCGACCC CTATCACGTG TGGTACCAGG GCAACGCGTC GCT6G6GGGA 180 



236 

BNSDCXIID: <WO 02079492A2_L> 



wo 02/079492 



PCT/US02/04915 



CACCTAACX3C ACGTGCTGGA AGGCCCAGAC ACCAACACCA CGATCATTCA GCTGCAGCCC 240 

TTGCAGGAGC CCX3AGAGCTG GGCGCGCACG CAGAGTGGCC TGCAGTCCTA CCT6CTCCAG 300 

TTCCACGGCC TCGTGCGCCT GGTGCACCAG GAGCGGACCT TGGCCTTTCC TCTGACCATC 360 

CGCTGCTTCC TGGGCTGTGA GCTGCCTCCC GAGGGCTCTA GAGCCCATGT CTTCTTCGAA 420 

GTGGCTGTGA ATGGGAGCTC CTTTGTGAGT TTCCGGCCGG AGAGAGCCTT GTGGCAGGCA 480 

GACACCCAGG TCACCTCCGG AGTGGTCACC TTCACCCTGC AGC^GCTCAA TGCCTACAAC 540 

CGCACTCGGT AT6AACTGCG GGAATTCCTG GAGGACACCT GTGTGCAGTA TGTGCAGAAA 600 

CATATTTCCG CGGAAMCAC GAAAGGGAGC CA;y\C7^GCC GCTCCTACAC TTCGCTGGTC 660 

CTGGGCGTCC TGGTGGGCGG TTTCATCATT GCTGGTGTGG CTGTAGGCAT CTTCCTGTGC 720 

ACAGGTGGAC GGCGATG TTA AT TACTCTCC AGCCCCGTCA GAAGGGGCTG GATTGATGGA 780 

GGCTGGCAAG GGAAAGTTTC AGCTCACTGT GAAGCCAGAC TCCCCAACTG AAACACCAGA 840 

AGGTTTGGAG TGACAGCTCC TTTCTTCTCC CACATCTGCC CACTGAAGAT TTGAGGGAGG 900 

GGAGATGGAG AGGAGAGGTG GACAAAGTAC TTGGTTTGCT AAGAACCTAA GAACGTGTAT 960 

GCTTTGCTGA ATTAGTCTGA TAAGTGAATG TTTATCTATC TTTGTG6AAA ACAGATAATG 1020 

GAGTTGGGGC AGGAAGCCTA TGCGCCATCC TCCAAAGACA GACAGAATCA CCTGAGGCGT 1080 

TCAAAAGATA TAACCAAATA AACAAGTCAT CCACAATCAA AATACAACAT TCAATACTTC 1140 

CAGGTGTGTC AGACTTGGGA TGGGACGCTG ATATAATAGG GTAGAAAGAA GTAACACGAA 1200 

GAAGTGGTGG AAATGTAAAA TCCAAGTCAT ATGGCAGTGA TCAATTATTA ATCAATTAAT 1260 
AATATTAATA AATTTCTTAT ATTT 



Seq ID NO: 121 Protein secmence : 
Protein Accession NP_006395.1 

1 11 21 31 41 51 

I I I I I I 

MLTTLLPILIj IiSGWAFCSQD ASDGLQRLHM LQISYFRDPY HVWYQGNASI* GGHLTHVLEG 60 
PDTNTTIIQ1» QPLQEPESWA RTQSGIiQSYL LQFHGIjVRIiV HQERTIAFPI. TIRCPLGCEIi 120 
PPEGSRAHVP FEVAVNGSSF VSFRPBRALW QADTQVTSGV VTFTLQQIiNA YNRTRYEIiRE 180 

FiiEayrcvQYv qkhisaentk gsqtsrsyts lvlgvlvggf iiagvavgif lctggrrc 



seq ID NO: 122 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 2-505 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I i I I I I 

C GAGA AGCTG GGAGAGACAC CACTTGTCCC TGAACAAGAC AATTCAGTAA CATCTATTCC 60 

TGAGATTCCT CGATGGGGAT CACAGAGCAC GATGTCTACC CTTCAAATGT CCCTTCAAGC 120 

CGAGTCAAAG GCCACTATCA CCCCATCAGG GAGCGTGATT TCCAAGTTTA ATTCTACGAC 180 

TTCCTCTGCC ACTCCTCAGG CTTTCGACTC CTCCTCTGCC GTGGTCTTCA TATTTGTGAG 240 

CACAGCAGTA GTAGTGTTGG TGATCTTGAC CATGACAGTA CTGGGGCTTG TCAAGCTCTG 300 

CTTTCACGAA AGCCCCTCTT CCCAGCCAAG GAAGGAGTCT ATGGGCCCGC CGG6CCTGGA 360 

GAGTGATCCT GAGCCCGCTG CTTTGGGCTC CAGTTCTGCA CATTGCACAA ACAATGGGGT 420 

GAAAGTCGGG GACT6TGATC TGCGGGACAG AGCAGAGGGT GCCTTGCTGG CGGAGTCCCC 480 

TCTTGGCTCT AGTGATGC AT AGG GAAACAG 6GGACATGGG CACTCCTGTG AACAGTTTTT 540 

CACTTTTGAT GAAACGGG6A ACCAAGAGGA ACTTACTTGT GTAACTGACA ATTTCTGCAO 600 

AAATCCCCCT TCCTCTAAAT TCCCTTTACT CCACTGAGGA GCTAAATCAG AACTGCACAC 660 

TCCTTCCCTG ATGATAGAGG AAGTGGAAGT GCCTTTAGGA TGGTGATACT GGG6GACCGG 720 

GTAGTGCTGG GGAGAGATAT TTTCTTATGT TTATTCGGM AATTTGGAGA AGTGATTGAA 780 

CTTTTCAAGA CATTGGAAAC AAATAGAACA CAATATAATT TACATTAAAA AATAATTTCT 840 

ACCAAAATG6 AAAGGAAATG TTCTATGTTG TTCAGGCTAG GAGTATATTG GTTCGAAATC 900 
CCAGG6AAAA AAATAAAAAT AAAAAATTAA AGGATTGTTG ATAAAA 

Seq ID No: 123 Protein secfuence ; 
Protein Accession ft: none found 

1 11 21 31 41 51 

I I I I I I 

EKIiGETPLVP EQDNSVTSIP EIPRWGSQST MSTLQMSLQA ESKATITPSG SVISKFNSTT 60 

SSATPQAFDS SSAWFIFVS TAWVLVILT MTVIiGLVKLC FHESPSSQPR KESMGPPGLE 120 
SDPEPAALGS SSAHCTNNGV KVGDCDIiRDR AEGAMiABSP LGSSDA 



Seq ID NO: 124 DNA sequence 

Nucleic Acid Accession #: NM__006500.1 

Coding sequence: 27-1967 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ACTTGCGTCT CGCCCTCCGG CCAAG CATGG GGCTTCCCAG GCTGGTCT6C GCCTTCTTGC 60 

TCGCCGCCTG CTGCT6CT6T CCTCGCGTCG CGGGTGTGCC CGGA6AGGCT GAGCAGCCTG 120 

CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 

AGTCCCAAGG CAACCTCAGC CATGTC6ACT GGTTTTCTGT CCACAAG6AG AAGCGGACGC 240 

TCATCTTCCG TGTGCGCCAG OGCCAG6GCC AGAGCGAACC TGGGGAGTAC GAGCAGCGGC 300 

TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC 360 



237 



wo 02/079492 



PCT/US02/04915 



GCATCTTCTT GTGCCAGGGC AAGCGCCCTC GGTCCCAGGA GTACCGCATC CAGCTCCGCG 420 

TCTACAAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC CCTGGGCATC CCTGTGAACA 4 80 

GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA CGGGTACCCC ATTCCTCAAG 540 

TCATCTGGTA CAAG7UVTGGC CGGCCTCTGA AGGAGGAGAA GAACCGGGTC CACATTCAGT 600 

5 CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTGCA GAGTATTCTG AAGGCT^CAGC 660 

TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACCGG CTGCCCAGTG 720 

GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 

T6TGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG GGACCGCGTG GAAATCAGGT 840 

GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG CAAGCA6AAC CCCT^GCACCA 900 

10 6G6AGGCAGA GGAAGAGACA ACCAACGACA ACX3GGGTCCT GGT6CTGGAG CCTGCCCG6A 960 

AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT GGACACCATG ATATCGCT6C 1020 

TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCCGAGTG AGTCCCGCAG 1080 

CCCCTGA6AG ACAG6AAGGC A6CA6CCTCA CCCT6ACCT6 TGAGGCA6AG A6TAGCCAGG 1140 

ACCTC6AGTT CCAGTG6CTG AGAGAAGA6A CAGACCAG6T GCTGGAAAGG G6GCCTGTGC 1200 

15 TTCAGTTGCA TGACCT6AAA CGGGAGGCAG GA6GCGGCTA TCGCTGCGTG GCGTCTGTGC 1260 

CCAGCATACC CGGCCTG/g^C CGCACACAGC TG6TCAAGCT GGCCATTTTT GGCCCCCCTT 1320 

6GATGGCATT CAAGGAGAGG AA66TGTGGG TGAAAGAGAA TAT6GTGTTG AATCrTGTCTT 1380 

GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 144 0 

AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 

20 TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 

TCTTCCTGGA GCTG6TCAAT TTAACCACCC TCACACCAGA CTCCAACACA ACCACTGGCC 1620 

TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 

TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740 

TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

25 GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCX3AACTT GTAGTTGAAG 1860 

TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 

GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCA TTAG CCC CGAATCACTT 1980 

CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2040 

CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

30 GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

GTCCACCACC ATCTCCTGCA CGTTGAGTGA AGCTCATCCC AAGCAA6GAG CCCCAGTCTC 2220 

CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 

AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

CAAA66CTGG CTTCCACCAT CCAGGTGCAC CACTGAAGT6 A66ACACACC GGA6CCAGGC 2400 

35 GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGC6GCATCC 2460 

AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGT6 2520 

ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAT^GA TAC6TGCCG6 2580 

GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCG6A 2640 

TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 

40 TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 

CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 

CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT CGAGGAAAAA 7UUUVGAAAAG 2880 

ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 

TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 

45 GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3060 

TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 

AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 3240 

TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300 

50 AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 3360 

AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 

A6AGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA GAGCAGACAG 3480 

CTACCCTACT TTTCAGCAGC AAAAC6TCCC GTATGACGCA 6CAC6AA6GG CCT6GCA66C 3540 
TGTTAGCAGG AGCTATGTCC CTTCCTATCXS TTTCCGTCCA CTT 



55 



Seq ID No: 125 Protein aeouence; 
Protein Accession #: NP 006491.1 



60 1 11 21 31 41 51 

I I I I I I 

MGIiPRLVCAP LIjAACCCCPR VAGVPGEAEQ PAPELVEVEV GSTALLKCGL SQSQGNIiSHV 60 

DWFSVHKEKR TlilFRVRQGQ GQSEPGEYEQ RIjSIiQDRGAT IALTQVTPQD ERIFLCQGKR 120 

PRSQEYRIQL RVYKAPEEPN IQVNPIiGIPV NSKBPEEVAT CVGRNGYPIP QVIWYKNGRP 180 

65 LKEEKNRVHI QSSQTVESSG LYTLQSILKA QLVKEDKDAQ FYCELNYRLP SGNHMKESRE 240 

VTVPVFYPTE KVWIiEVEPVG MIiKEGDRVEI RCLADGNPPP HFSISKQNPS TREAEEETTN 300 

DNGVLVLEPA RKEHSGRYEC QAWNLDTMIS LLSEPQELLV NYVSDVRVSP AAPERQEGSS 360 

I*TIiTCEJ\ESS QDLEFQWIiRE ETDQVLERGP VLQLHDLKRE AGGGYRCVAS VPSIPGI.NRT 420 

QLVKLAIFGP PWMAFKERKV WVKENMVLNL SCEASGHPRP TISWNVNGTA SEQDQDPQRV 480 

70 LSTIiNVLVTP ELLETGVECT ASNDLGKNTS ILFLELVNLT TLTPDSNTTT GLSTSTASPH 540 

TRANSTSTER KLPEPESRGV VIVAVIVCIL VLAVLGAVLY FLYKKGKLPC RRSGKQEITL 600 
PPSRKTEDW EVKSDKLPBE MGLLQGSSGD KRAPGDQGEK YIDLRH 



75 Seq ID NO: 126 DMA sequence 

Nucleic Acid Accession #: NM 001955.1 



238 



BNSOCCID: <WO O2O70492A2_I_> 



wo 02/079492 



PCT/US02/04915 



Coding sequence: 337-975 (xuiderliiied sequences correspond to start and stop codons) 



10 



15 



20 



25 



GGAGCTGTTT 
AA6TCA6ACG 
AGCTCTCCAC 
AGGCGCT6CC 
AGGAACCC6C 
GTTTGAACGG 
TCTCTGCTGT 
AGCGCG6TGG 
CGGTCCAAGC 
CTGGACATCA 
AGGTCCAAGA 
TGCCAATGTG 
CTCAGGGCTG 
TCCAAGCTTG 
AGTTCAGAGG 
TCTTTTCATG 
CGAGCACATT 
TGTGGCCGAC 
TTCCTGACTG 
TCCCCCAACC 
TGGGGATGAC 



11 
I 

ACCCCCACTC 
CGCCTCTGCA 
CACCGCCGCG 
TTTTCTCCCC 
AGCGCTTTGA 
GAG6TTTTTG 
TTGTGGCTTG 
GT6AGAACG6 
GCTGCTCCTG 
TTTGGGTCAA 
GAGCCTTGGA 
CTAGCCAAAA 
AAGACATTAT 
GGAAAAAGTG 
AACACCTAAG 
ATCCCAAGCT 
GGTG^CAGAC 
TCTGCACTCT 
GCAAAGGACC 
ATCTTCACTG 
AATGGACCTC 



21 
I 

TAATAGGGGT 
TCTGC6CCAG 
TGCGCCTGCA 
GTTAAAGGGC 
GGGACCTGAA 
ATCCCTTTTT 
CCAAGGA6CT 
CGGGGAGAAA 
CTCGTCCCTG 
CACTCCCGAG 
GAATTTACTT 
AGACAAGAAG 
GGAGAAAGAC 
TATTTATCAG 
ACAAACCAGG 
GAAAGGCAAG 
TTCGGGGCCT 
CCACCCTGGC 
AGCGTCCTCG 
GCTTCCATCA 
TCAGCAGAAA 



31 
I 

TCAATATAAA 
GCGAACGGGT 
GACGCTCCGC 
ACTTGGGCTG 
GCTGTTTTTC 
TTCAGAATGQ 
CCAGAAACAG 
CX:CACTCCCA 
ATGGATAAAG 
CACGTTGTTC 
CCCACAAAGG 
TGCTGGAATT 
TGGAATAATC 
CAGTTAGTGA 
TCGGAGACCA 
CCCTCCAGAG 
GTCTGAAGCC 
TGGGATCAGA 
TTCAAAACAT 
GTGGTAACTG 
CACACAGTCA 



41 
I 

AAGCCX3GCAG 
CCTGCGCCTC 
TCGCTGCCTT 
AAGGATCX3CT 
TTCGTTTTCC 
ATTATTTGCT 
CAGTCTTAG6 
GTCCACCCT6 
AGTGTGTCTA 
CGTATGGACT 
CAACAGACCG 
TTTGCCAAGC 
ATAAGAAAGG 
GAGGAAGAAA 
TGAGAAACAG 
AGCGTTATGT 
ATAGCCTCCA 
GCAGGAGCAT 
TCCAAGAAAG 
CTTTGGTCTC 
CATTCGAATT 



51 
I 

AGAGCTGTCC 
CTGCAGTCCC 
CTCTCCTGGC 
TTGAGATCTG 
TTTGGGTTCA 
CATGATTTTC 
C6CTGAGCTC 
GCGGCTCCGC 
CTTCTGCCAC 
TGGAAGCCCT 
TGAGAATAGA 
AGGAAAAGT^ 
AAAAGACTGT, 
AATCAGAAGA 
CGTCAAATCA 
GACCCACAAC 
CGGA6AGCCC 
CCTCTGCTGG 
GTTAAGGAGT 
TTCTTTCATC 
C 



60 
120 
180 
240 
300 
360. 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 



30 



Seq ID No: 127 Protein sequence: 
Protein Accession #: NP 001946.1 



1 11 21 31 41 51 

I I I I I I 

MDYLLMXFSL LFVACQGAPE TAVLGAELSA VGENGGEKPT PSPPWRLRRS KRCSCSSLMD 
35 KECVYFCHLD IIWVNTPEHV VPYGLGSPRS KRALENIiIiPT KATDRENRCQ CASQKDKKCW 
NFCQAGKEIiR AEDIMEKDWN NHKKGKDCSK LGKKCIYQQL VR6RKIRRSS BEHZjRQTRSE 
TMRNSVKSSF HDPKLKGKPS RERYVTHNRA HW 



60 
120 
180 



40 Seq ID NO: 128 DNA sequence 

Nucleic Acid Accession #: NM__001721.1 

Coding sequence: 34-2061 (underlined sequences correspond to start and stop codons) 



45 



50 



55 



60 



65 



70 



75 



GCAAGCACGG 
CTTCTTCTCA 
CTTTTT6TTT 
AGCAGAAAAG 
GAGCA6AC6C 
TATGTCTATG 
ATAAGG6GTA 
TTCCTGTGTT 
GCTAATCTGC 
GTGCTGAAGA 
ACTCTAGCCC 
AGTACCAGTC 
TTCAACATGC 
AATIAGTAGCA 
CACACCACCT 
AACCTGGATG 
CTCAGACAAA 
TACACAGTGT 
CACGTGCATA 
ATTCCAAAGC 
CACCCTGTGT 
TGGGAACTGA 
GTG6TCCAGC 
GGCTCCATGT 
CCCAAGCTGG 
GAATATATAA 
CCTTCCCAGC 
CACCAATTCA 
GT6AAAGTAT 



11 
I 

AACAAGCTGA 
AAAGATCACA 
TGACCAAAAC 
GATCCATTGA 
CTGTAGAGAG 
CATCAAATGA 
ACCCCCACCT 
GCCAGCAGA6 
ATACTGCAGT 
TACCTCGGGC 
AATATGACAA 
TAGCGCAATA 
AGTATATTCC 
GCAGCAGTGA 
CAAAGATTTC 
ATTATGACTG 
AGGGAAAAGA 
CCTTATTTAG 
CAAATGCTGA 
TTATTCATTA 
CAACAAAGGC 
AAAGAGAAGA 
TGGGCAAGTG 
CAGAAGAT6A 
TTAAATTCTA 
6CAATGGCTG 
TCTTAGAAAT 
TACACCG6GA 
CTGACTTT6G 



21 
I 

GAGG6ATGAT 
GCAAAAGAAG 
AAACCTTTCC 
AATTAAGAAA 
ACAGTACCCA 
AGAGAGCCGA 
GCTGGTCAAG 
CTGTAAA6CA 
CAATGAAGAG 
AGTTCCTGTT 
CGAATCT^G 
TGACAGCAAC 
AAGGGAAGAC 
AGATGTTGCA 
ATGGGAATTC 
GTTTGCTGGT 
AGGAGCATTT 
TAAGGCTGTG 
GAACAAATTA 
TCATCAACAC 
CAACAAGGTC 
GATTACCTTG 
GAAGGGGCAG 
ATTCTTTCAG 
TGGAGTGTGT 
CTTGCTGAAT 
GTGCTACGAT 
CTTGGCTGCT 
AATGACAAGG 



31 
I 

AATATGGATA 
AAAATGTCAC 
TACTATGAAT 
ATCAGATGTG 
TTTCAGATTG 
AGTCAGTGGT 
TACCATAGTG 
GCCCCAGGAT 
AAACACAGAG 
CTCAAAATGG 
AAAAACTATG 
TCAAAGAAAA 
TTCCCTGACT 
AGCAGTAACC 
CCTGAGTCAA 
AACATCTCCA 
ATGGTTAGAA 
AATGATAAAA 
TACCTGGCAG 
AATTCAGCAG 
CCCGACTCT6 
TTGAAGGAGC 
TATGATGTTG 
GAGGCCCA6A 
TCAAAGGAAT 
TACCTGAGGA 
GTCTGTGAAG 
CGTAACTGCT 
TATGTTCTTG 



41 
I 

CAAAATCTAT 
CAAATAATTA 
ATGACAAAAT 
TG6AGAAA6T 
TCTATAAA6A 
TGAAAGCATT 
GGTTCTTCGT 
GTACCCTCTG 
TTCCCACCTT 
ATGCACCATC 
GCTCCCAGCC 
TCTATGGCTC 
GGTGGCAAGT 
AAAAA6AAAG 
GTTCATCTGA 
GATCACAATC 
ATTCGAGCCA 
AAGGAACTGT 
AAAACTACTG 
GCATGATCAC 
TGTCCCTGG6 
TGGGAA6TGG 
CTGTTAAGAT 
CTATGATGAA 
ACCCCATATA 
GTCAGGGAAA 
GCATGGCCTT 
T6GTGGACAG 
ATGACCA6TA 



51 
I 

TCTAGAAGAA 
CAAAGAACG6 
GAAAA6G6GC 
AAATCTCGAG 
TGGGCTTCTC 
ACAAAAAGA6 
GGACGGGAAG 
GGAAGCATAT 
CCCAGACAGA 
TTCAAGTACC 
ACCATCTTCA 
CCAGCCAAAC 
AAGAAAACTG 
AAATGTGAAT 
AGAAGAGGAA 
TGAACAGTTA 
AGTGGGAATG 
CAAACATTAC 
TTTTGATTCC 
AGGGCTCCGC 
AAATGGAATC 
CCA6TTTGGA 
GATCAAGGAG 
ACTCAGCCAT 
CATAGT6ACT 
A6GACTTGAA 
CTTGGAGAGT 
A6ATCTCT6T 
TGTCAGTTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



239 



BNSDCXID: <WO 0207049ZA2.I.> 



wo 02/079492 



PCT/US02/04915 



10 



GTCGGAACAA 
AGCAGCAAGT 
AAGCAGCCCT 
AG6CTTTACC 
CACGAGCTTC 
CGGGAAAAA6 
CTGGCCAGCA 
TTTTAATAGT 
ATTCCCTTGA 
TTTCCAGCCT 
AAAGACT6AG 
ATTGTCACAA 



AGTTTCCAGT 
CAGACGTATG 
ATGACTTGTA 
GGCCCCACCT 
CAGAAAAGCG 
ACAAGCATTG 
TTTTCATTCA 
GTTCTCTGTA 
AATTTAGATC 
ATA6CAGAAG 
CAGAACTGAA 
CAATTAAATA 



CAAGTGGTCA 
GGCATTTGGG 
TGACAACTCC 
GGCATCGGAC 
TCCCACATTT 
AAGAA6AAAT 
TTTTAAG6AA 
TT6TCTATTA 
AAATTAGTAA 
CACATTTTCA 
AAATTACTTA 
TACTACCAAG 



GCTCCAGAGG 
ATCCTGATGT 
CAGGTGGTTC 
ACCATCTACC 
CAGCAACTCC 
TAG6AGT6CT 
AGTAGGAAG6 
TTTAGAAATG 
TTTTGTTTTA 
GACTGCAATA 
TTGGATATTC 
TACAGAAATG 



TGTTTCATTA 
GGGAGGTGTT 
TGAAGGTCTC 
AGATCATGTA 
TGTCTTCCAT 
6ATAAGAATG 
CATAAGTAAT 
AACAAGGCAG 
TGCTGCTCCT 
TA6AGACTGT 
ATTCTTTTCT 
TGGAAAAAAA 



CTTCAAATAC 
CAGCCTGGGG 
CCAGGGCCAC 
CAGCTGCTGG 
TGAACCACTT 
AATATAGATG 
TTTAGCTAGT 
GAAACATiAAG 
GATATAACAC 
GTTCAT6T6T 
TTATATTGTC 
AAACCG 



1800 
1B60 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



15 



Seq ID No: 129 Protein secmence: 
protein Accession #: NP 001712.1 



20 



25 



30 



MDTKSIIiEEL 
RCVEKVNLEE 
HSGFFVDGKF 
KMDAPSSSTT 
PDWWQVRKLK 
ISRSQSEQLIj 
IiAENYCFDSI 
KEL6SGQFGV 
KBYPIYIVTE 
NCIiVDRDLCV 
LMWEVFSLGK 
QIiLSSIEPIiR 



11 

1 

LLKRSQQKKK 
QTPVERQYPF 
LCCQQSCKAA 
LAQYDNESKK 
SSSSSEDVAS 
RQKGKEGAFM 
PKLIHYHQHN 
VQLGKWKGQY 
yiSNGCLLNY 
KVSDFGMTRY 
QPYDLYDNSQ 
EKDKH 



21 
1 

MSPNNYKERL 
QIVYKDGIiIjY 
PGCTIiWEAYA 
NYGSQPPSSS 
SNQKERNVNH 
VRNSSQVGMY 
SAGMITRLRH 
DVAVKMIKEG 
LRSHGKGLEP 
VLDDQYVSSV 
WIiKVSQGHR 



31 
I 

FVIiTKTNIiSY 
VYASNEESRS 
NliHTAVNEEK 
TSIiAQYDSNS 
TTSKISWEFP 
TVSLFSKAVN 
PVSTKANKVP 
SMSEDEFFQE 
SQLIiEMCYDV 
GTKPPVKWSA 
IiYRPHIiASDT 



41 
I 

YEYDKMKRGS 
QWDKALOKBI 
HRVPTFPDRV 
KKIYGSQPNF 
ESSSSEEEEN 
DKKGTVKHYH 
DSVSLGNGIW 
AQTMMKLSHP 
CEGMAFIiESH 
PEVFHYFKYS 
lYQIMYSCWH 



51 
I 

RKGSIEIKKI 
RGNPHIjLVKY 
LKIPRAVPVL 
NMQYIPREDF 
LDDYDWFAGN 
VHTNAENKIiY 
EIiKREEITLL 
KLVKFYGVCS 
QFIHRDIiAAR 
SKSDVWAFGI 
BLPBKRPTFQ 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 



Seq ID NO: 130 DNA aemience 
35 Nucleic Acid Accession #: NM_012072.2 

Coding sequence: 149-2107 (underlined sequences correspond to start and stop codons) 



40 



45 



50 



55 



60 



65 



70 



75 



1 
I 

AAAGCCCTCA 
CCCCTTGGGG 
TCCCGCAGAG 
GCTGCTGCTC 
CGTGGGGACC 
CCACTGCAAC 
CGTCCAGCGA 
CAAGTTCTGG 
GAAGGGCTTC 
GCTCC6GAAC 
GCTCCTTCCC 
CGGAA6TAAC 
GGCCCTGGGG 
CTTGGAGGCT 
CGAGACTCAG 
CAGCTCGGGC 
CCACCAGGAC 
CCGGCTGCTG 
TCGTGGGGGG 
CCAAGGGTAC 
CTCCCCCTGT 
TGGCTATGAG 
GGGTCGCTCG 
TGAGGAGGGC 
TGTGGGCCCG 
CTGTGGCTGC 
TGTGTCTCTG 
GAGCACCGTG 
GGCTACACCC 
ACTCAAGATG 
CX;CCACAGCT 
AAACAACGAT 
GGCCATCCTA 
GAAGAGGGAG 
TCCAGAGCGA 
CTGCTGAAAG 



11 

I 

GCCTTTGTGT 
CCCAGCTGGG 
GGCCACACAG 
CTGACCCAGC 
GCCTGCTACA 
CAGAACG6GG 
GTACTGGCCC 
ATTGGGCTCC 
AGCTGGGTGG 
TCGTGCATCT 
AACCGCCTGC 
ATTGAGGGCT 
6GCCCAGGTC 
GTGCCCTTTG 
AGTCATEATT 
CCCCTCTGTG 
TGCTTTGAAG 
GATGACCTGG 
GCCACGTGCG 
CAGCTGGACT 
GCCCAGGAGT 
CCGGGCGGTC 
CCTTGCGCCC 
TACGTCCTGG 
6GGGGCCCCC 
CTGCCAGGCT 
GGACCACCAT 
CCCCGCGCTG 
ACCACAAGTA 
CTGGCCCCCA 
GCCTCTGGCC 
G6CACTGACX3 
CTCCTGCTGG 
GAGAAGAAGG 
GCTGA6AGCA 
T6AG6TGGCC 



21 



31. 



CCTTCTCTGC GCCGGAGTGG 
AGCCGAGATA GAAGCTCCTG 
AGACCGGGA2LGGCCACCTCC 
CCX3GGGCGGG GACGGGAGCT 
CGGCCCACTC GGGCAAGCTG 
GCAACCTGGC CACTGTGAAG 
AGCTCCTGAG GCGGGAGGCA 
AGCGAGAGAA GGGCAAGTGC 
GCGGGGGGGA GGACACGCCT 
CCAAGCGCTG TGTGTCTCTG 
CCAAGTGGTC TGAGG6CCCC 
TCGT6TGCAA GTTCAGCTTC 
A66TGACCTA CACCACCCCC 
CCTCTGCGGC CAATGTAGCC 
TCCTGTGCAA GGAGAAGGCC 
TCAGCCCCAA GTATGGCTGC 
GGGGGGATGG CTCCTTCCTC 
TGACCTGTGC CTCTCGAAAC 
TCCTGGGACC CCATGGGAAA 
CGAGTCAGCT GGACTGTGTG 
GTGTCAACAC CCCTGGGGGC 
CTGGAGAGGG GGCCTGTCAG 
AGGGCTGCAC CAACACAGAT 
CCGGGGAGGA CGGGACTCAG 
TCTGCGACAG CTTGTGCTTC 
GGGTGCTGGC CCCAAATGGG 
CTGGGCCCCC CGATGAGGAG 
CAACAGCCAG TCCCACAAGG 
GACCTTCGCT GTCATCTGAC 
GTGGGTCCTC AGGCGTCTGG 
CCCAG6AGCC TGCAGGTGGG 
GGCAAAAGCT GCTTTTATTC 
CCCTG6CTCT GGGGCTACTG 
AGAAGAA6CC CCA6AATGCG 
GGGCCAT66A GAACCA6TAC 
CTAGAGACAC TAGAGTCACC 



41 
I 

CTGCAGCTCA 
TCGCCGCTGG 
ATGGGCCTGC 
GACACGGAGG 
AGCGCTGCCG 
AGCAAGGAGG 
GCCCTGACGG 
CTGGACCCTA 
TACTCTAACT 
CTGCTGGACC 
TGTGG6AGCC 
AAAG6CATGT 
TTCCAGACCA 
T6TGG6GAAG 
CCCGAT6TGT 
AACTTCAACA 
TGCGGCTGCC 
CCTTGCAGCT 
AACTACACGT 
GACGTGGATG 
TTCCGCTGCG 
GATGTGGATG 
GGCTCATTTC 
TGCCAGGACG 
AACACACAAG 
GTCTCTTGCA 
GACAAAGGAG 
GGCCCCGAGG 
GCCCCCATCA 
AGGGAGCCCA 
GACTCCTCCG 
TACATCCTAG 
GTCTATCGCA 
GCAGACA6TT 
A6TCCGACAC 
AGCCACCATC 



51 
I 

CCCCTCAGCT 
GCTTCTCGCC 
TGCTGCTGCT 
CGGTGGTCTG 
AGGCCCAGAA 
AGGCCCAGCA 
CGAGGATGAG 
GTCTGCCGCT 
GGCACAAGGA 
TGTCCCAGCC 
CAGGCTCCeC 
6CCGGCCTCT 
CCAGTTCCTC 
6TGACAAGGA 
TCGACTGGGG 
ATGGGGGCTG 
GACCAGGATT 
CCAGCCCATG 
GCCGCTGCCC 
AATGCCAGGA 
AATGCTGGGT 
AGTGTGCTCT 
ACTGCTCCTG 
TGGATGAGTG 
GGTCCTTCCA 
CCATGGGGCC 
AGAAAGAAGG 
GCACCCCCAA 
CATCTGCCCC 
GCATCCATCA 
TGGCCACACA 
GCACC6TGGT 
AGCGGAGAGC 
ACTCCTGGGT 
CTGG6ACAGA 
CTCAGAGCTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



240 



3NSDOCID: <WO 02079492A2_L> 



wo 02/079492 PCT/US02/04915 



TGAACTCCCC ATTCCAAAGG GGCACCCACA 
CAAACAATTG TAAGTCTCCT CCTTAAAGGC 
TGTTTGATGT TCCTGAAGTG GAAGCTGTGT 
TCTATAATGA TTGTTACTCC CCCTCCCTTT 
5 G66TGTGAGG AG6CTGGGGC TAAGGGGCTC 
ATCTAAGAGG AAAT^GTGAG TTGCTCATGC 
CCTAG6ATGA AAACTAAATC AATTAATTAT 
TCAAAGGGAA CATGTTCGGA CTG6AAACAT 
A6CACAAGTC TTGCTAAATG TGATACTGTT 

10 TAACCTCTTA GGTGGCAAG6 AGGCAGGAAG 
CTTGGGTTTA TTTGCAAAGG AAGCTTGAAA 
CAGGTGTTTG TGAAGTCACA TAATCTACGG 
CACAGATACT TGAATTAATT CATCCAAATG 
TGTGATCAAC ACTAACAAGG AAACAAATTC 

15 CCTCAGACAC CCTGCCTGTG GCCCCGCCTC 
CGAGCTCAGA CAGAGGAAGC CCTGCAGAAA 
TGAACGGGAG ATGATGCACT GTGTTTTGAA 
TCATAGTCCA CAGTTGATGC AGCATCCTGA 
CACACCAAGT AGGGAGCTAG TCAGGCAGTT 

20 TTCCTTAAAA TTGGGGGTAA GGAGGGAAGG 
TTTTTACAGC AAAAACTGCT CAAAGCCATT 
TTGCAAATAT TTCTCCCTAT GATAATGCAG 
TCTCTCTCAC ACACACACAC ACACACACAC 
CCTGGGGCAC TGGAACACAT TCCTGGGGGT 

25 TGAGTATCTC TGGGAGGCCT CATGTCTCCT 
ACAGACAGAG GAAATGTGTC TCCCTCCAAG 
GGTTTTGCCT TAGCAATGCA TCGGTCTCTG 
CAA6GTGCAG GGTTAATACT CTTGCCAGTT 
TTTTAATA6A AAACTAAAGG GGCAG6GGAA 

30 TCGATGGGGC ATTTGGAACT TCTTTTTAAA 
AACTCT6GTG TTTAACACTT AAG6GAGACA 
GGCCACGAGA CTCTAGGTGA T6TGT6AAGC 
CTGTCT6GCC ATTCAGAGGA TTCTAAAGAC 
CACTTAAATA AATGCAAATG CAACATTTCT 

35 TCATTTGGGG TGAAGGAGAC ATTTCTGTCC 
GTATQATTCC TGGGATCCAA CGAGCCCTCC 
GCCCAGGCCC ATCGTCTGTT CTCTGAATGC 
GAACCCCTCT GTGGAACCCA CAAGGGGAGA 
ACCTTCCCTG GCAGGCTGGG TCCCTCTCCT 

40 CCCACCACGG GGGGAGAGCC AGCAACCCAA 
AACCACTGGG CTCAAACACG TGCTTTATTC 
ATGGAAATTC TTGTTTGGGQ GATCTTGGGG 
GCCAAGAGGC CATTAACAAA TCGTCCTTGT 
CACAGTGGGG AATCCAAGGG TCACAGTATG 

45 TCTCGCTAGA CACAGTGTTT CTGCCCAGGT 
CATGGGGACG GGGGJU^GTTT TCACTTGGAG 
CCAAATAGGT CAAT/^TTCT GGGAGACTCT 
TCTCTCCCTC CCCTCATCCC ACATCTCAAA 
CACCCAGCTC GCCATGCCTA CTCATTCCTG 

50 CTTCTTTGTC ATTTOAGAAA GGATGCA6GA 
CAGAAAAACC AGG6CAGGAC AGTTATCGAC 
TAGAGG6ACT CCACCCCTGC TCAACAGCTT 
CTCTGCCTTC GGTGGCCCAC ACACCTAAGC 
AACACATCTA CGTGTAGCAC TACGAC6TTA 

55 AGGCTCTGAT TAAGGATGTG GGGAAGTGGG 
CTGGAGGCCT GTCTGTTAGC CAGTGGTGGA 
TGCCATCTTC CCTGCGATCA GGCAAAAAAG 
TGTGTTATGT CCATTTTGCA GGATG/^CTG 
TTGCTTTGTC TTTTCCATCC TCATCACAAG 

60 TCTTTCGATG GATGGAGATG ATCATTAGGT 
ATTTCTGTGA AAACTAGGAG AACA6AGATG 
TAACACAGTC TTTTTAAAAC TAACATAGGA 
TCTCCATTGT CTAAATCAGG AAAACAGGAA 
TTAATGCCCC CTACATATTT CCATCACCTT 

65 TATGATCCCA GAAAACATCT GTCTCTACTT 
TGGTTTGTGC ATTTTCTCAA CTAAAAATAG 
CTAATCAAAG ACACTATTTT CATACTAGAT 
TTTAAAAATA AATTGTGTTT TGGTCTGTTC 
AAGCTCTGGA ATCCCTTTAT TGTGCTGTTG 

70 TTCAGCAGAT TTTGCCCACT ATTCCTCTGA 
GCTTGAATTA GATCCCTGCA AAGGCTTGCT 
GTAATCACTT CATGAATGCT AAAT6AGAAT 
TTTGTTTQAC TAATTCTGGA ATTACAA6AT 
ATGTTTCCCA AACT6TGAGG AGGGAA6GCT 

75 CAAAATG6TG CTTT6AGGGT C3VGCCTTTAG 
TCTGTTATGT GCCTATCCTA ATAAACTCTT 



TTT TT TT GAA AGACTGGACT GGAATCTTAG 2220 

CCCTTGGAAC ATGCAGGTAT TTTCTACGGG 2280 

GTTGGCGTGC CACGGTG6GG ATTTCGT6AC 2340 

TCAAATTCCA ATGTGACCAA TTCCX3GATCA 2400 

CCCTGAATAT CTTCTCTGCT CACTTCCACC 2460 

TGATTAGGAT TGATUWCGATT TGTTTCTCTT 2520 

TCAATTAGGT AAGAAGATCT GGTTTTTTGG 2580 

TTCTTTACAT TTGCATTCCT CCATTTCGCC 2640 

GACATCCTCC AGAATGGCCA GAAGTGCAAT 2700 

TGCCTCTTTA GTTCTTACAT TTCTAATAGC 2760 

AATATGAGAA AAGTTGCTTG AAGTGCATTA 2820 

GGCTAGGGC6 AGAGAGGCCA GGGATTTGTT 2880 

TACTGAGGTT ACCACACACT TGACTACGGA 2940 

AAGGACAACC TGTCTTTGAG CCAGGGCAGG 3000 

CACTTCATCC TGCCCGGAAT GCCAGTGCTC 3060 

GTTCCATCAG GCTGTTTCCT AAAGGATGTG 3120 

AGTTGTCATT TTAAAGCATT TTAGCACAGT 3180 

GATTTTAAAT CCTGAAGTGT GGGTGGCGCA 3240 

TGCTTAAGGA ACTTTTGTTC TCTGTCTCTT 3300 

AAGAGGGAAA GAGATGACTA ACTAA7VATCA 3360 

TAAATTATAT CCTCATTTTA AAAGTTACAT 3420 

TCGATAGTGT GCACTCTTTC TCTCTCTCTC 3480 

ACACACACAC AGAGACACGG CACCATTCTG 354 0 

CACCGATGGT CAGAGTCACT AGAAGTTACC 3600 

GTGGGCTTTT TACCACCACT GTGCAGGAGA 3660 

6CCCCAAA6C CTCAGA6AAA GG6TGTTTCT 3720 

AGGTGACACT CTGGAGTGGT TGAAGG6CCA 3780 

TTGAAATATA GATGCTAT6G TTCAGATTGT 3840 

GTGAAAGGAA AGATGGAG6T TTTGTGCGGC 3900 

GTCATCTCAT GGTCTCCAGT TTTCAGTTGG 3960 

AAGGCTGTGT CCATTTGGCA AAACTTCCTT 4020 

T6GGCAGTCT GT6GTGTGGA GAGCAGCCAT 4080 

ATGGCTGGAT GCGCTGCTGA CCAACATCAG 4140 

CCCTCTGGGC CTTGAAAATC CTTGCCCTTA 4200 

TTGGCTTCCC ACAGCCCCAA CGCAGTCTGT 4260 

TATTTTCACA GTGTTCTGAT TGCTCTCACA 4320 

AGCCCTGTTC TCAACAACAG GGAGGTCATG 4380 

AATGGGTGAT AAAGAATCCA GTTCCTCAAA 4440 

GCTGGGTGGT GCTTTCTCTT GCACACCACT 4500 

CCAGACAGCT CAGGTTGTGC ATCTGATGGA 4560 

TCCTGTTTAT TTTTGCTGTT ACTTTGAAGC 4620 

CTACAGTAGT GGGTAAACAA ATGCCCACCG 4680 

CCTGAGGGGC CCCAGCTTGC TCGGGCGTGG 4740 

GGGAGAGGTG CACCCTGCCA CCTGCTAACT 4800 

GACCTGTTCA GCAGCAGAAC AAGCCAGGGC 4860 

ATGGACACCA AGACT^TGAA GATTTGTTGT 4920 

TGGAAAAAAC TOAATATATT CAGGACCAAC 4980 

GCAGACAATG TAAAGAGAGA ACATCTCACA 5040 

AATTTCAGGT GCCATCACT6 CTCTTTCTTT 5100 

GGACAATTCC CACAGATAAT CTGAGGAATG 5160 

AATGCATTAG AACTTGGT6A GCATCCTCTG 5220 

GGCTTCCAGG CAAGACCAAC CACATCTGGT 5260 

GTCATCGTCA TT6CCATAGC ATCATGATGC 5340 

TGTTTGGGTA ATGTGGGGAT GAACTGCATG 5400 

CTGCGGTCAC TGTCGGCCTT GCAAGGCCAC 5460 

GGAGCAAGGC TTCAGGAAGG GCCAGCCACA 5520 

TGGAATTAAA AAGTCAAACC TTTATATGCA 5580 

AGTTTAAAAG AATTTTTTTT TCTCTTCAAG 5640 

CCCTTGTTTG AGTGTCTTAT CCCTGAGCAA 5700 

ACTTTTGTTT CAACCTTTAT TCCTGTAAAT 5760 

AGATTTGACA AAAAT^^T^TT GAATTAAAAA 5820 

AAGCCTTTCC TATTATTTCT CTTCTTAGCT 5880 

AACACAGCTT TCTAGCAGCT GCAAAATGGT 5940 

GAACAATAGC TTTAGCTTGG GAATCTGAGA 6000 

CGGCTGCAAA ACCCATGGTT TAAATCTATA 6060 

AGATGATAAT CCGAATTCTC CATATATTCA 6120 

TCCTGAGACA AATACTCACT GAAGGGCTT6 6180 

TTGTAGATAA TGCCCTTCTA TTTTAGGTA6 6240 

CTCTTATCTG CAAGGTGGCA A6CAGTTCTT 6300 

GCTGAAGTTC TTTGCATAGA TTTGGCTTAA 6360 

CTGTGATGTC A6AT6TAATT 6TAAATGTCA 6420 

GTAAGTATTT TTAAATGTGT QTATTTCAAA 6480 

TTCTATGCAG GATTTACCTT CATCCTGTGC 6540 

CA6AGATCGA GCTTCTCCTC TGAGTTCTAA 6600 

GAAGGTGCAG CTTTGTTQTC CTTTGAGCTT 6660 
AAACACATT 
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Seq ID No: 131 Protein aecmence ; 
Protein Accession #: NP 036204.1 



1 11 21 31 41 - 51 

I I i I I I 

MATSMGLLLL LLI.LLTQPGA 6T6ADTEAW CVGTACYTAH SGKIiSAAEAQ NHCNQNGGNIi 60 

ATVKSKEEAQ HVQRVXiAQIiL RRET^LTARM SKFWIGLQRE KGKCIiDPSLP LKGPSWVGG6 120 

EDTPYSNWHK ELRNSCISKR CVSItLLDLSQ PIiIiPNRLPKW SEGPC6SPGS P6SNIE6FVC 180 

KFSFKGMCRP IiALGGPGQVT YTTPFQTTSS SLEAVPFASA ANVACGEGDK DETQSHYFLC 240 

KEKAPOVFDW 6SSGPLCVSP KYGCNFNNGG CHQDCFEGGD GSFLCGCRPG FRIiZiDDLVTC 300 

ASRNPCSSSP CRGGATCVL6 PHGKNYTCRC PQGYQIiDSSQ LDCVDVDECQ DSPGAQBCVN 360 

TPGGFRCECW VGYEPGGPGE GACQDVDECA LGRSPCAQGC TNTDGSFHCS CEEGYVLAGB 420 

DGTQCQDVDB CVGPGGPLCO SLCFNTQGSF HCGCLP6WVL APNGVSCTMG PVSLGPPSGP 480 

PDEEOKGEKE 6STVPRAATA SPTR6PEGTP KATPTTSRPS I.SSDAPITSA PIiKMLAPSGS 540 

SGVWRBPSIH HATAASGPQE PAGGDSSVAT QNNDGTDGQK IiIaLFYIIiGTV VAILLLIiAIiA 600 
LGIiliVYRKRR AKREEKKEKK PQNAADSYSW VPERAESRAM ENQYSPTPGT DC 



Seq ID NO: 132 DNA seoruence 

Nucleic Acid Accession #: NM_000963.1 

Coding sequence: 135-1949 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I t 

CAATTGTCAT ACGACTTGCA GTGAGCGTCA GGAGCACGTC CAGGAACTCC TCAGCAGCGC 60 

CTCCTTCAGC TCCACAGCCA GACGCCCTCA GACAGCAAAG CCTACCCCCG CGCCGCGCCC 120 

TGCCCGCCGC TCG GATG CTC GCCCGCGCCC TGCTGCTGTG CGCGGTCCTG GCGCTCAGCC 180 

ATACAGCAAA TCCTTGCTGT TCCCACCCAT GTCAAAACCG AG6TGTATGT ATGAGTGTGG 240 

GATTTGACCA 6TATAAGTGC 6ATTGTACCC GGACAGGATT CTATGGAGAA AACT6CTCAA 300 

CACCGGAATT TTTGACAAGA ATAAAATTAT TTCTGAAACC CACTCCAAAC ACAGTGCACT 360 

ACATACTTAC CGACTTCAAG GGATTTTGGA ACGTTGTGAA TAACATTCCC TTCCTTCGAA 420 

ATGCAATTAT GAGTTATGTC TTGACATCCA GATCACATTT GATT6ACAGT CCACCAACTT 480 

ACAATGCTGA CTATGGCTAC AAAAGCTGGG AAGCCTTCTC TAACCTCTCC TATTATACTA 540 

GA6CCCTTCC TCCT6TGCCT GATGATTGCC CGACTCCCTT GGGTGTCAAA GGTAAAAAGC 600 

AGCTTCCTGA TTCAAATGAG ATT6TGGAAA AATTGCTTCT AA6AAGAAAG TTCATCCCTG 660 

ATCCCCAGGG CTCAAACATG ATGTTTGCAT TCTTTGCCCA GCACTTCACG CATCAGTTTT 720 

TCAAGACAGA TCATAAGCGA GGGCCAGCTT TCACCAACGG GCTGGGCCAT GGGGTGGACT 780 

TAMTCATAT TTACGGTGAA ACTCTGGCTA GACAGCGTAA ACTGCGCCTT TTC/U^GGATG 840 

GAAAAATGAA ATATCAGATA ATTGATGGAG AGATGTATCC TCCCACAGTC AAAGATACTC 900 

AGGCAGAGAT GATCTACCCT CCTCAAGTCC CTGAGCATCT ACGGTTTGCT GTGGGGCAGG 960 

AGGTCTTTGG TCTGGTGCCT GGTCTGATGA TGTATGCCAC AATCTGGCTG CGGGAACACA 1020 

ACA6AGTATG CGATGTGCTT AAACAGGAGC ATCCTGAATG GGGTGATGAG CAGTTGTTCC 1080 

AGACAAGCAG GCTAATACTG ATAGGAGAGA CTATTAAGAT TGTGATTGAA GATTATGTGC 1140 

AACACTTGAG TGGCTATCAC TTCAAACTGA AATTTGACCC AGAACTACTT TTCAACAAAC 1200 

AATTCCAGTA CCAAAATCGT ATTGCTGCTG AATTTAACAC CCTCTATCAC TGGCATCCCC 1260 

TTCTGCCTGA CACCTTTCAA ATTCATGACC AGAAATACAA CTATCAACAG TTTATCTACA 1320 

ACAACTCTAT ATTGCTGGAA CATGGAATTA CCCAGTTTGT TGAATCATTC ACCAGGCAAA 1380 

TTGCTGGCAG GGTTGCTGGT GGTAGGAATG TTCCACCCGC AGTACAGAAA GTATCACAGG 1440 

CTTCCATTGA CCAGA6CAGG CAGATGAAAT ACCAGTCTTT TAAT6AGTAC CGCAAACGCT 1500 

TTATGCTGAA GCCCTATGAA TCATTTGAAG AACTTACAGG AGAAA7U3GAA ATGTCTGCAG 1560 

AGTTGGAAGC ACTCTATGGT 6ACATCGATG CTGTGGAGCT GTATCCTGCC CTTCTGGTAG 1620 

AAAA6CCXCG GCCAGATGCC ATCTTTGGTG AAACCATG6T A6AAGTTGGA GCACCATTCT 1680 

CCTTGAAAGG ACTTATGGGT AATGTTATAT GTTCTCCTGC CTACTGGAAG CCAAGCACTT 1740 

TTGGTGGAGA AGTGGGTTTT CAAATCATCA ACACTGCCTC AATTCAGTCT CTCATCTGCA 1600 

ATAACGTGAA GGGCTGTCCC TTTACTTCAT TCAGTGTTCC AGATCCAGAG CTCATTAAAA 1860 

CAGTCACCAT CAATGCAAGT TCTTCCCGCT CCGGACTAGA TGATATCAAT CCCACAGTAC 1920 

TACTAAAAGA ACGTTCGACT GAACT GTAG A AGTCTAATGA TCATATTTAT TTATTTATAT 1980 

GAACCATGTC TATTAATTTA ATTATTTAAT AATATTTATA TTAAACTCCT TATGTTACTT 2040 

AACATCTTCT GTAACAGAAG TCAGTACTCC TGTTGCGGAG AAAGGAGTCA TACTTGTGAA 2100 

GACTTTTATG TCACTACTCT AAAGATTTTG CTGTTGCTGT TAAGTTTGGA AAACAGTTTT 2160 

TATTCTGTTT TATAAACCAG AGAGAAATGA GTTTTGACGT CTTTTTACTT GAATTTCAAC 2220 

TTATATTATA AGAACGAAAG TAAAGATGTT TGAATACTTA AACACTATCA CAAGATGGCA 2280 

AAATGCTGAA AGTTTTTACA CTGTCGATGT TTCCAATGCA TCTTCCATGA TGCATTAG7VA 2340 

GTAACTAATG TTTGAAATTT TAAAGTACTT TTGGTTATTT TTCTGTCATC AAACAAAAAC 2400 

AGGTATCAGT GCATTATTAA ATGAATATTT AAATTAGACA TTACCAGTAA TTTCATGTCT 2460 

ACTTTTTAAA ATCAGCAATG TU^CAATAAT TTGAAATTTC TAAATTCATA GGGTAGAATC 2520 

ACCTGTAAAA GCTTGTTTGA TTTCTTAAAG TTATTAAACT TGTACATATA CCAAAAAGAA 2580 

GCTGTCTTGG ATTTAAATCT GTAAAATCAG ATGAAATTTT ACTACAATTG CTTGTTAAAA 2640 

TATTTTATAA GTGATGTTCC TTTTTCACCA AGAGTATAAA CCTTTTTAGT GTGACTGTTA 2700 

AAACTTCCTT TTAAATCAAA ATGCCAAATT TATTAAGGTG GTG6AGCCAC TGCAGTGTTA 2760 

TCTCAAAATA AGAATATTTT GTTGAGATAT TCCAGAATTT GTTTATATGG CTGGTAACAT 2820 

GTAAAATCTA TATCAGCAAA AGGGTCTACC TTTAAAATAA GCAATAACAA AGAAGAAAAC 2880 

CAAATTATTG TTCAAATTTA GGTTTAAACT TTTGAAGCAA ACTTTTTTTT ATCCTTGTGC 2940 
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10 



15 



20 



25 



ACTGCAGGCC 
ATAACGATAT 
TTGCAAAAGT 
CATTAATTTT 
ACCTG«TGC 
CTCATCACTT 
T6GACTCTGC 
AGGACTGCTA 
AATAGTATAC 
ATCTGTAACC 
TGACTGGAAA 
CAAATGATAT 
ATCTCATTGT 
TTTATGTCTT 
TTTTTTGTTA 
AACAATCCAA 
CATATTGAGA 
AAAGAATATT 
GGGATCTGTG 
TCAAGCACTG 
TATAAATAAT 
AAAGATAACT 
AGAAATAGTC 
GATTTGTTAT 
GTTTCAGTGC 
TGTCTGTTTA 



TGGTACTCAG 
GTTTTCTCAG 
AGCAATGACC 
ATCTCAGTCT 
TGTTCCTTTT 
CGTTTCTCCT 
CTATATTTTC 
TTTAGCTCCT 
ACTTATTTTA 
AAGATGGATG 
AAGTTACGTT 
CTAAGTAGTT 
CACTGACATT 
ATTAGGACAC 
TGTCACAATC 
AGAAATGATT 
TATTTAAGGT 
GTCTCATTAG 
GATGCTTCGT 
TGGGTTTTAA 
TGAAAAAAAT 
CAGGAGAATC 
AATATGCTTG 
TAACATTGAT 
CTCAGACAAA 
TTTTTGTACT 



ATTTTGCTAT 
ATTTTCTGTT 
TCATAAAATA 
TGAAGCCAAT 
CTTTTCTTCT 
ATTTTGTTTT 
TTACCTGAAC 
CTTAAGAAGA 
A6TGAAAAGC 
CAAAGAGGCT 
CCCATTCTAA 
CTCAGCAATA 
TAATGGTACT 
TATGGTTATA 
AGTATATTTT 
GTATTAAGAT 
TGAATGTTTG 
CCTGAATGTG 
TAATTTGTTC 
TATTTTTAAA 
TTTCTTTTGG 
TTCTTTACAA 
TATAAAACAC 
CTGCTGACAA 
TGTGTATTTA 
ATTTA 



GAGGTTAATG 
GTACAGTTTA 
CCTCTTCAAA 
TCAGTA66TG 
TTTAGCCATT 
ACTAGTTTTA 
TTTTGCAAGT 
TTAAAA6AGA 
AGAGAATTTT 
AGT6CCTCAG 
TTAATGCCCT 
ATAATAATGA 
GTATATTACT 
AACTGTGTTT 
CTTTGGGGTT 
TTGTGAATAA 
TCCTTAGGAT 
CCATAAGACT 
AGCCACAATT 
TCAAACGCTG 
GAAGAGGGAG 
TTTTACGTTT 
TGTTCACTGT 
AACCTGGGAA 
ACTTATGTAA 



AAGTACCAAG 
ATTTAGCAGT 
ATGCTTAAAT 
CATTGGAATC 
TTGCTAAGAG 
AGATCAGAGT 
TTTCAG6TAA 
AAAAAAAA6G 
ATTTATAGCT 
AGAGAACT6T 
TTCTTATTTA 
CGATAATACT 
TAATTTATTG 
AAGCCTACAA 
ACCTCTCTGA 
ATTTTTAGAA 
AGGCCTATGT 
GACCTTTTAA 
TATTGAGAT^ 
ATTACAGATA 
AAAATGAAAT 
AGAATGTTTA 
TTTTTTTAAA 
TTTGGGTTGT 
AAGATAAGTC 



CTGTGCTTGA 
CCATATCACA 
TCATTTCACA 
AAGCCTG6CT 
ACACAGTCTT 
TCACTTTCTT 
ACCTCAGCTC 
CCCTTTTAAA 
AATTTTAGCT 
ACG6GGTTTG 
AAAACAAAAC 
TCTTTTCCAC 
AAGATTATTA 
TCATTGATTT 
ATATTATGTA 
ATCTGATTGG 
GCTAGCCCAC 
AATGTTTT6A 
ATATTCTGTG 
ATAGTATTTA 
AAATATCATT 
AGGTTAAGAA 
AAAAAAACTT 
GTATGCGAAT 
TQGAAATAAA 



3000 
3060 
3120 
31S0 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 



30 



Seq ID No: 133 Protein sequence; 
Protein Accession NP 000954.1 



35 



40 



45 



50 



55 



60 



65 



70 



75 



]4I«ARAIiI«LCA 
TRIKLFIiKPT 
GYKSWEAPSN 
NMMPAFFAQH 
QIIDGEMYPP 
VLKQBHPEWG 
NRIAAEFNTIj 
AGGRNVPPAV 
YGDIDAVELY 
GFQXINTASI 
STBIi 



11 
I 

VLALSHTANP 
PNTVHYILTH 
LSYYTRALPP 
FTHQFFKTDH 
TVKDTQAEMI 
DEQLFQTSRL 
YHWHPLLPDT 
QKVSQASIDQ 
PAIiLVEKPRP 
QSLICNNVKG 



21 
I 

CCSHPCQNRG 
PKGFWNWNN 
VPDDCPTPI.G 
KRGPAFTNGL 
YPPQVPEHLR 
ILIGETIKIV 
FQIHDQKYNY 
SRQMKYQSFN 
DAIFGETMVE 
CPFTSFSVPD 

; 



31 
I 

VCMSVGFDQY 
IPFLRNAIMS 
VKGKKQIiPDS 
GHGVDI^NHIY 
FAVGQEVFGIi 
lEDYVQHLSG 
QQFIYNNSIIi 
EYRKRFMLKP 
VGAPFSLKGIi 
PEIilKTVTIN 



41 
I 

KCDCTRTGFY 
YVIiTSRSHIjI 
NEIVEKLIiIiR 
GETLARQRKL 
VPGLMMYATI 
YHFKLKPDPE 
LEHGITQFVE 
YESFEELTGE 
MGNVICSPAY 
ASSSRSGLDD 



51 
I 

GEKCSTPBFX* 
DSPPTYNADY 
RKFIPDPQGS 
RLFKDGKMKY 
WIiREHNRVCD 
LLFNKQFQYQ 
SFTRQIAGRV 
KEMSAELEAL 
WKPSTFGGEV 
INPTVLIiKER 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO: 134 DNA sequence 

Nucleic Acid Accession #: XM_059648.1 

Coding sequence: 35-664 (underlined sequences correspond to start and stop codons) 



AG6CTGCTQA 
AAAAT6TTCC 
AGTAACGCTA 
CTTTGAAGTG 
TTCACTAGTT 
GAAGGAACTG 
TCAGTTTGGA 
CTACGGAGTA 
TGCATTTAGA 
TCTTGTCAAC 
CATCAGGCCT 
AyGAG AATGC 
ATTTTT^CA 
TAGTGCGTTC 
TGAAGGATTT 
GGATGTTACC 
ACTTCCTCT6 
TATTGTTCAA 
ACTCTTTGGA* 
ATGTTAATG6 
TCCTTATATA 
TTCCTAGTGT 
TATAAATGAC 



11 
I 

GACTTCCCTC 
6G6CCCAGAG 
TTTCTTCTAC 
AAGGATGCAA 
GTAAACGTGG 
CACAAAGAGT 
GAATCGGAGC 
ACTTTCCCCA 
TTTCTTGTTG 
CCTGAGGGTC 
GACATAGCAG 
CATTGCGTTT 
TTTTTTTTTT 
TCAGCTCATT 
TTTTTTAATG 
CAAAGCAAAA 
ACCATACTAA 
CTTGACATTT 
TTCAAGAGCA 
AATAAMCAC 
TGTATGCTTG 
TTTACTATAA 
CGTATTATAA 



21 
I 

TAGAATCCTC 
CAAAGGTATT 
AACTAAAATT 
AAGGAAGAAC 
CCAGTGACTG 
TT6GACCATC 
CCCGCCCAAG 
TCTTCCACAA 
ATTCTTCAAA 
AAGTTGTGAA 
CTCT6GTTAG 
CTAATAGAAC 
GGAGACAGTG 
GCAACCTCTG 
TTATCTTGCT 
ATCAA6AGTA 
AGAATTCAGA 
TCTAGGACTO 
CTGTGTATGA 
AAATGTTGAA 
TTTTGT6TAC 
CTGTCACTAT 
CA 



31 
I 

CAA CATGG AG 
TGCAGTTTTG 
CCTCTUUVCCT 
TGTTTCTCTG 
CCJUVCTCACA 
CCACTTCAGC 
CAAGGAAGTA 
6ATTAAGATT 
GAAGGAACCA 
GTTCTGGAAG 
ACAAGTGATC 
AGAGAAATGT 
TCTCACTCTG 
CCTTTTTAAA 
ATTAAGTGGT 
GCCAAAGAAT 
ATACACAGTG 
TACTTGAT6A 
CTGAAATTTC 
AAATGTAAAA 
AGGATTTT6T 
GTATGTAACT 



41 

I 

CCTCTTGCAG 
CTGTCTATAG 
AAAATCAACA 
GAATU^GTATA 
GACAGAAATT 
GTGTTGGCTT 
GAATCTTTTG 
CTAGGATCTG 
AGGTGGAATT 
CCA6AGGAGC 
ATAAAAAAGA 
CTCCATGAGG 
TCACCCAGGC 
CAT6CTATTA 
AATGAATGTT 
CAACATGAAA 
ACCAATGTGC 
AAAT6CCAAC 
TGGAATAACT 
TATATATACA 



GACATATATA 



51 
I 

CTTACCCQCT 
TTCTATGCAC 
GCTTTTATGC 
AAGGCAAAGT 
ACTTAGGGCT 
TTCCCTGCAA 
CAAGAAAAAA 
AAGGAGAACC 
TTTGGAAGTA 
CCATTGAAGT 
AAGAGGATCT 
GTTTGGTCTC 
TGGAGTGCAG 
AATGTGGCAA 
CCCAG6ATGA 
TATATTAACT 
CTCAATATCT 
ACACTAGACC 
GTAAATGGTT 
TAGATTCAAA 
TAAGTACA06 
AATAGTCATT 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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Seq ID No: 135 Protein aecmencei 
Protein Accession #: XP 059646.1 



5 1 11 21 31 41 51 

I I I I 11 

MEPLAAYPLK CSGPRAKVFA VLLSIVLCTV TLFLI*QLKFL KPKINSFYAF EVKDAKGRTV 60 

SLEKYKGKVS LWNVASDCQ LTDRNYIiGUC ELHKEFGPSH FSVLAPPCNQ FGESEPRPSK 120 

EVESFARKNY GVTFPIFHKI KILGSEGEPA FRFLVDSSKK EPRWNFWKYIi VNPEGQWKF 180 

10 WKPBEPZEVI RPDIAAItVRQ VIIKKKEDZi 



Seq ID NO: 136 DNA sequence 
Nucleic Acid Accession #: NM 003003.1 
15 Coding sequence: 304-2451 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

on ' ' ' ' ' ' 

ZU CAAGTGCCGT CGCCGCX3CCC CTTCCCCCTC CCGCCTCCCC GGCXTCCCTCC CCGGAACCGG 60 

C6GTG6AGCT AC6GTCGCGG AC6AGTGGAA CCGAGACTTGC CCCGCGGAGC CGCCGGTATG 120 

AGCGCCCCTC GCCACCCCGT 6TCCCAGGCC CGGCCTTTCT GACAAGAGCT AGACTTCGGG 180 

CTCCTTGAGG ATATTCAGTT TTGTATGTTT GAATATCCTC TCACCATGTT CAGCATAAAG 240 

TACCATTCTT AATGATTATC CTCAACAAGA CAGGTGTGAG AGGGTTGCTG TTGCATTGCA 300 

25 AT CATG GTGC AAAAATACCA GTCCCCAGTG AGAGTGTACA AATACCCCTT TGAATTAATT 360 

ATGGCTGCCT ATGAT^GGAG GTTCCCTACA TGTCCTTTGA TTCCGATGTT CGTGGGCAGT 420 

GACACTGTGA GTGAATTCAA GAGCGAAGAT GGGGCTATTC ATGTCATTGA AAGGCGCTGC 480 

AAGCTGGATG TAGATGCACC CAGACTGCTG AAGAAGATTG CAGGAGTTGA TTATGTTTAT 540 

TTTGTCCAGA AAAACTCACT GT^TTCTCGG GAACGTACTT TGCACATTGA GGCTTATAAT 600 

30 GAAACGTTTT CCAATCGGGT CATCATTAAT GAGCATTGCT GCTACACCGT TCACCCTGAA 660 

AATGAAGATT GGACCTGTTT TGAACAGTCT 6CAAGTTTAG ATATTAAATC TTTCTTTGGT 720 

TTTGAAAGTA CAGTGGAAAA AATTGCAATG AAACAATATA CCAGCAACAT TAAAAAAGGA 780 

7VAGGAAATCA TCGAATACTA CCTTCGCCAA TTAGAAGAAG T^GGCATAAC CTTTGTGCCC 840 

CGTTGGAGTC CGCCTTCCAT CACGCCCTCT TCAGAGACAT CTTCATCATC CTCCAAGAAA 900 

35 CAAGCAGCGT CCATGGCCGT CGTCATCCCA GAAGCTGCCC TCAA66AGGG GCTGAGTGGT 960 

GAT6CCCTCA GCA6CCCCAG TGCACCTGAG CCCGTGGTGG 6CACCCCTGA CGACAAACTA 1020 

GATGCCGACC ACATCAAGAG ATACCTGGGC GATTTGACTC CX3CTGCAGGA GAGCTGCCTC 1080 

ATTAGACTTC GCCAGT6GCT CCAGGAGACC CACAAGGGCA AAATTCCAAA AGATGAGCAT 1140 

ATTCTTCGGT TCCTCCGTGC ACGGGATTTT AATATTGACA AAGCCAGAGA GATCATGTGT 1200 

40 CAGTCTTTGA CGTGGAGAAA GCA6CATCAG GTAGACTACA TTCTTGAAAC CTGGACCCCT 1260 

CCTCAGGTCC TTCAGGATTA CTACGCX3GGA GGCTGGCATC ATCACGACAA AGATGGGCX3G 1320 

CCCCTCTACX3 TGCTCAGGCT GGGGCAGATG GACACCAAA6 GCTTGGTGAG A6C6CTCGGG 1380 

GAGGAAGCCC TGCTGAGATA CGTTCTCTCC GT7VAATGAAG AACX3GCTAAG GCGATGCGAA 1440 

GAGAATACAA AAGTCTTTGG TCGGCCTATC AGCTCATGGA CCTGCCTGGT GGACTTGGAA 1500 

45 GGGCTGAACA TGCGCCACTT GTGGAGACCT GGTGTGAAAG CGCTGCTGCG GATCATCGAG 1560 

GTGGTGGAGG CCAACTACCC TGAGACACTG GGCCGCCTTC TCATCCTGCG GGCGCCCAGG 1620 

GTATTTCCTG TGCTCTGGAC GCTGGTTAGT CCGTTCATTG ATGACAACAC CAGAAGGAAG 1680 

TTCCTCATTT ATGCAGGAAA TGACTACCAG GGTCCTGGAG GCCTGCTGGA TTACATCGAC 1740 

AAAGAGATTA TTCCAGATTT CCTGAGTGGG GAGTGCATGT GCGAAGTGCC AGAGGGTGGA 1800 

50 CTGGTCCCCA AATCTCTGTA CCGGACTGCA GAGGAGCTGG AGAACGAAGA CCTGT^GCTC 1860 

TGGACTGAGA CCATCTACCA GTCTGCAAGC GTCTTCAAAG GAGCCCCACA TGAGATTCTC 1920 

ATTCAGATTG TGGATGCCTC GTCAGTCATC ACTTGGGATT TCGACGTGTG CAAAGGGGAC 1980 

ATTGTGTTTA ACATCTATCA CTCCAAGAGG TCGCCACAAC CACCCAAAAA GGACTCCCTG 2040 

GGAGCCCACA GCATCACXTTC TCCGGGTGGG AACAATGTGC AGCTCATAGA CAAAGTCTGG 2100 

55 CAGCTGGGCC GCGACTACAG CATGGTGGAG TCGCCTCTGA TCTGCAAAGA AGGAGAAAGC 2160 

GTGCAGGGTT CCCATGTGAC CAG6T66CCG GGCTTCTACA TCCTGCAGTG GAAATTCCAC 2220 

AGCATGCCTG CGTGCGCCGC CAGCAGCCTT CCCCGGGTGG ACGACX5TGCT T6CGTCCCTG 2280 

CAGGTCTCTT CGCACAAGTG TATU^TGATG TACTACACCG AGGTGATC6G CTCGGAGGAT 2340 

TTCAGAGGTT CCATGACGAG CCTGGAGTCC AGCCACAGCX5 GCTTCTCCCA GCTGAGTGCC 2400 

60 GCCACCACCT CCTCCAGCCA GTCCCACTCC AGCTCXSITGA TCTCCAG GTA GT GCCGCGCT 2460 

GCCTGCACCT AGT6TGCAGA GGGGACGGCC GCCCCTCCTC GGACAGCA6C TGCACCCGCC 2520 

CACCCAGCGG CGACATTGTA CAGACTCCTC TCACCTCTAG ATAGCAAATA GCTCTCAGAT 2580 

GGTAAACGTA GTCGTTTGAT CCCAAAACTA CCTTGGCAGG TAGTTTTAAC TCTGATCCTA 2640 

ACTTAACTCA ATAGCCATAG ATTTTGTATA CGTTGTGCAC AAAATCCAAC CAGAGCGCAA 2700 

65 GGGCTCTCTT GAAAGAAAAG TAGTTTCTGT ACCAATTAAA GGATTGACGT GGTCTCAGAT 2 760 

ATTGATGCAA AAAATTTTTC CAACGAACTC CGCATTGTCC ATTAGTGAAT GAATTCCTGT 2820 

GACATCCTCC AGAGATGGCC CCTCCTCACC TGGGACGGAA GCTGCCAGCT CGCTTCCCCC 2880 

AAGCTGCCTC ATGGCCCGCA CGCCGCCTCA CGGCCCCCAT GCTTCCCGCC AGTCAAGATG 294 0 

GTCTGTGGAC TTAGGGCCAG CCCTTGAGGT CCTTATCCTC TGAGGATTCA GAGGTTGCCT 3000 

70 GCGGAGTACC TTGTCCCAGG GCCAGACACA CCCACACCAC CCACTGTCTG CAGTGGGGCC 3060 

GGGGGCTCAG GAGGGGCTCT CAGGGACTCC TGGTGACTCC AGGA7UU\TGC TGCCATCGTT 3120 

AAACATTACT TTCTCTTTCC TCCTTTTCAA ATCTTTTTGA TACTTTTTAG AGCAGGATTT 3180 

TTCTGTATGT GAACTTGGGT GGGGGGGTTC TTCCCGTTTC CTTCCGTGCG TCGCCCCTCT 3240 

CACCTGCAGT CAGCTCCCAG CCCAGTGTAG GCCATCTCCT CTGTGCCCTC TGGAGGCTCA 33 00 

75 TTGTCTCAGA GCCCAGACAG TTCCAGCCAC TAGGAGGCCX3 TCTTGGAACC AGCAAGTCGC 3360 

ATTTGCCACT TGACACTGTC CATGGGGTTT TATTAGTAGC TAAGCAGCAG CTCTCGCATC 3420 
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CACTTCAGGG TGGC6TGTGG CATGTAGGAG TCCTGCTTCT TTGTACATGG GAATTGTG6A 3480 

CTCATGCGTG TGTGTGTGTG CATGTGCTGT GTGTGTGCAT GTGTGCATGA C6GTGGGGGT 3540 

GCTGGGGGGA C6G6GTGA6T GGAAACTTAG TTTGA6TAAT GAAGGAATCT TCACAGAAGC 3600 

AAATCAGAAT ATGGGATTTG TTTGCCTTTT ACATTTTGTT TAATTCCTGA TTTTAAAGCC 3660 

5 TGCTCTATCT 6GTACAGGCC CTTATTTTTT CAGCTTTTTA TGGGAAAAGC AGGTTATTTG 3720 

AGAATCTGTC CAGAAGTTGC ATAGGGGATG GCCTCCACGA TAAGGACATG CAACAC6TGT 3780 

TTCTGTGTGC AGCAGAGGCC GTGTTTTTCA TGCCAAACCC CACGCG6CTG TCAACTGTGT 3840 

GCGTGGTAGG CATGGAGATC CTGGTTGTGC CGTCTCAGCT CC6CTCTGAA GGCACTGTGT 3900 

GGGTGCTGCG TGACTGGAGA GCTGTGTGGA GGCCATGTGT GCCCCGTGCA GGGATCAGGA 3960 

10 GGGCGGGGGA GGGACCGAGC AGCCCTCTTG CCCGGTCGGG TCAGCCCTAG TGGCTGCCTG 4020 

CACACTGTAG ACX3TCCCAGG GCCT6T6CTG TGATCACCTG CCTTTGGACC ACATTTGTGT 4 080 

TTGCTCTTAG AGATCGAGCT CCTCAGTGGT ACCTGAAGCC TTTGCTTCCG GAAAGCGCGG 414 0 

TAGGGTTCGT AGGTAGGGCT AGTAGGTAGG GTTAGTAGGT AGGGCTAGTA GGTAGGGCTA 4200 

GTAGGTAGGG TTAGTAGGTA GGGTTCGTAG GTAGGGCTGG TAGGTAGGGT TAGTAGGTAG 4260 

15 GGCTAGTAGG TAGGGTTCGT AGGTAGGGCT AGTAGGTAGG GTTAGTAGGT AGGGCTAGTA 4 320 

GGTAGGGCTA GTAGGTAGGG TTAGTAGGTA GGGTTCGTAG GTAGGGCTGG TAGGTAGGGT 4380 

TAGTAGGTAG GGCTAGTAGG TAGGGTTCGT AGGTAGGGCT AGTAGGTAGG GTTAGTAGGT 4440 

AGGGCTAGTA GGTAGGGCTA GTAGGTAGGG TTAGTAGGTA GGGTTCGTAG GTAGGGCTGG 4500 

TAGGTAGGGT TAGTAGGTAG GGCTAGTAGG TAGGGCTAGT AGGTAGGGCT AGTAGGTAGG 4560 

20 GTTAGTAGGT AGGGCTAGTA GGTAGGGCTA GTAGGTAGGG TTAGTAGGTA GGGTTCGTAG 4620 

GTAGGGCTGG TAGGTAGGGT TAGTAGGTAG GGCTAGTAGG TAGGGCTAGT AGGTAGGGCT 4680 

AGTAGGTAGG GCTAGTAGGT AGGGCTAGTA GGTAGGGCTA GTAGGTAGGG CTAGTAGGTA 4740 

GGGTTCGTAG GTAGGGTTCG TAGGTAGGGT TCGTAGGTAG GGTTAGTAGC GCGTCTGTGC 4800 

TGCTTCCACC TGGTGCTTCC TGTTCCCAAA TCACAAGGGC CTGAAGGTGG TCCCTGCTTT 4860 

25 CTCTTTCTCT TTCTCTGTGT CTCAGATGGC GATTTTGCTG ACAGCT6CCA AGTU^AATGCT 4920 

TCACTCAACA GTCCTCATGT GCCCAGAGAT GTTTATAGAA CTGTTTGAAT TGCAGCCATC 4980 

CCCTGCCCCC TCCCAGGCTG AAGATCTGTT CTTTTTAAGT TGATTCG6GA GTGGCATTCT 5040 

TTTATACCCA AAGACTGTAG TGCATCTT6A AGAGCTCAAA GCACATQACC GCACAAATGC 5100 

TTACAGGGTT TCCTCCCGAG TAATCCAATC TCACTCCCCT TGTAAGGGAA TTCTGGGGCA 5160 

30 GCTATGGTTT GAGTATGCAG TTTGCATCGT GTTTCTACCT TTAGTACCTT GCCACTCTTT 5220 

TAAAAC6CTG CTGTCATTTC CCATTTCTTA GTACTAATGA TTCTTTGATT CTCCCTCTAT 5280 

TATGTCTTAA TTCACTTTCC TTCCTAAATT TGTTATTTGC ATATCAAATT CTGTAAATGT 5340 

TTTGTAAACA TATTACCTCA CTTGGTAATA CAATACTGAT AGTCTTTAAA AGATTTTTTT 5400 
ATTGTTATCA ATAATAAATG TGAACTATTT AAAG 

35 

Seq ID No; 137 Protein sequence; 
Protein Accession NP 002994.1 



40 1 11 21 31 41 51 

I I I I I I 

MVQKYQSPVR VYKYPFELIM AAYERRFPTC PLIPMFVGSD TVSEFKSEDG AIHVIERRCK 60 

LDVDAPRIiLK KIAGVDYVYF VQKNSLNSRE RTIiHIEAYNE TFSNRVIINE HCCYTVHPEN 120 

EDWTCFEQSA SLDIKSFFGP ESTVEKIAMK QYTSNIKKGK EIIEYYLRQL EEEGITFVPR 180 

45 WSPPSITPSS ETSSSSSKKQ AASMAWIPE AAIiKEGbSGD AIiSSPSAPEP WGTPDDKLD 240 

ADHIKRYLGD LTPLQESCliI RLRQWIiQBTH Xa3KIPKDBHI IiRFIiRARDFN IDKAREIMCQ 300 

SLTWRKQHQV DYIIiETWTPP QVIiQDYYAGG WHHHDKDGRP LYVIiRIiGQMD TKGLVRALGE 360 

EAIOjRYVIfSV NEERIiRRCEE NTKVFGRPIS SWTCLVDIiEG LNMRHXiWRPG VKTUjLRIIEV 420 

VEANYPETIiG RLLIIiRAPRV FPVItWTLVSP FIDDNTRRKF IiIYAGNDYQG PGGIiIiDYIDK 480 

50 EIIPDFLSGB CMCEVPEGGL VPKSLYRTAB ELENEDIiKLW TBTIYQSASV FKGAPHEIIiZ 540 

QIVDASSVIT WDFDVCKGDZ VFNZYHSKRS PQPPKXDSLG AHSITSPGGN NVQLZDKVWQ 600 

LGRDYSMVBS PLICKBGESV QGSKVTRWPG FYIIjQWKFHS MPACAASSIjP RVDDVIiASI<Q 660 
VSSHKCKVMY YTEVIGSEDF RGSMTSIiBSS HSGFSQiiSAA TTSSSQSHSS SMZSR 

55 Seq ZD NO: 138 DNA sequence 

Nucleic Acid Accession #: NM_004181.1 

Coding sequence: 32-670 (underlined sequences correspond to start and stop codons) 



60 1 11 21 31 41 51 

60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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GCAGAAATAG CCTAGGGAGA TCAACCCCGA 
GGTCGCCGGC CAGTGGCGCT TCGTGGACGT 
GGTGCCAGCG CCTGCCTGCG CGCTGCTGCT 

65 CTTCAGGAAA AAGCAGATTG AAGAGCTGAA 
CATGAAGCAG ACCATTGGGA ATTCCTGTGG 
TAATCAAGAC AAACTGGGAT TTGAGGATGG 
AGAGAAAATG TCCCCTGAAG ACAGAGCAAA 
AGCCCATGAT GCCGTGGCAC AGGAAGGCCA 

70 TTTTATTCT6 TTTAACAACG TGGATGGCCA 
TCCGGTGAAC CATGGC6CCA GTTCAGAGGA 
CA6AGAATTC ACCGAGCGTG AGCAAGGAGA 
GGCAGCC TAA TGCTCTGTGG 6AGGGACTTT 
AATATATACC CCCCATGCAG TCTAAAATGC 

75 TGTTCTGCAG ACACGCCTTC CCCTCAGCCA 
ACAGCTGTCC ACTGGGCCAT TGTGGTGTGA 



GBTGCTGAAC AAAGTGCTGT CCCGGCTGGG 
GCTGGGGCTG GAAGAGGAGT CTCTGGGCTC 
GCTGTTTCCC CTCACGGCCC AGCATGAGAA 
GGGACAAGAA GTTAGTCCTA AAGTGTACTT 
CACAATCGGA CTTATTCACG CAGTGGCCAA 
ATCAGTTCTG AAACAGTTTC TTTCTGAAAC 
ATGCTTTGAA AAGAATGAG6 CCATACAGGC 
ATGTCGGGTA GAT6ACAAGG TGAATTTCCA 
CCTCTATGAA CTTGATGGAC GAATGCCTTT 
CACCCT6CT6 AAG6AC6CTG CCAAGGTGTG 
AGTCCGCTTC TCTGCOSTGC CTCTCTGCAA 
GCTGATTTCC CCTCTTCCCT TCAACATGAA 
TTCAGTACTT GTGAAACACA GCTGTTCTTC 
CACCCAGGCA CTTAAGCACA AGCAGA6TGC 
GCTTCAGATG GTGAAGCATT CTCCCCAGTG 
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TATGTCTTGT ATCCGATATC TAACGCTTTA AATGGCTACT TTGGTTTCTG TCTGTAAGTT 
AAGACCTTGG ATGTGGTTAT GTTGTCX:t;A AGAATAAATT TTGCTGATAG TAGC 
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Seq ID No: 139 Protein sequence ; 
Protein Accession #: KIP 004172.1 



11 



21 
I 



31 



41 



51 



MmKVLSRLG VAGQWRFVDV LGLBEESLGS VPAPACAIjLIi LFPLTAQHEN FRKKQIBELK 60 

GQEVSPKVYF MKQTIGNSCG TIGIjIHAVAN NQDKIiGFEDG SVXjKQFLSET BKMSPBDRAK 120 

CFEKNEAIQA AHDAVAQEGQ CRVDDKVNFH FILFNNVDGH LYELDGRMPF PVNHGASSED 180 
TLLKDAAKVC REFTEREQGE VRFSAVALCK AA 

Seq ID NO: 140 DNA secmence 

Nucleic Acid Accession NM_000201.1 

Coding sequence: SS-iese (underlined secjuences correspond to start and stop codons) 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



1 
I 

GCGCCCCAGT 
GCTCCCAGCA 
CCAGGACCTG 
GGCTCCGTGC 
ACCCCGTTGC 
AGCAATGTGC 
ACAGCTAAAA 
CCCTCTTGGC 
CCCCGGGCCA 
6CTGTGGGGG 
GCCAATTTCT 
AACACCTCGG 
GTCAGCCCCC 
CTGTTCCCAG 
ACAGTCACCT 
GACGAGGGCA 
CTGCAGACAG 
GTCTCAGAAG 
CTGAATGGGG 
CCA6AGGACA 
ATACACAAGA 
GATTGTCCGG 
TGGGGGJW^CC 
GGGGAATCAG 
ACTCAAGGGG 
GTCATCATCA 
CTCTATAACC 
CCCATGAAAC 
CCTCGGCCTT 
CATGCAGCTA 
TACAACA6CA 
CTGTAGTCAC 
TTAAAGTCTA 
ATACAACTG6 
CAGAAGAAGT 
GACG6ATGCC 
TTCATTTGTT 
GGTCTCTGGC 
TACAGGTTGT 
CATTGGCCAA 
ATGGACTGGT 
CCCCCAAAAC 
CAC/^TGACA 
GCCTTGTCCT 
TCCTGCAGTG 
CCTCCCAGCT 
CGCTCTGTCA 
TTTGGGCTCA 
ACACCACACC 
TGCCCAGACT 



11 
I 

CGACGCTGAG 
GCCCCCGGCC 
GCAATGCCCA 
TGGTGACATG 
CTAAAAAGGA 
AAGAAGATAG 
CCTTCCTCAC 
AGCCAGTGGG 
ACCTCACCGT 
AGCCCGCTGA 
CGTGCCGCAC 
CCCCCTACCA 
GGGTCCTAGA 
TCTCGGAGGC 
ATGGCAACX3A 
CCCAGCGGCT 
TGACCATCTA 
GGACCGAGGT 
TTCCAGCCCA 
ACGGGCGCAG 
ACCAGACCCG 
GAAACTGGAC 
CATTGCCCGA 
TGACTGTCAC 
AGGTCACCCG 
CTGTGGTAGC 
GCCAGCGGAA 
CGAACACACA 
CCCATATTGG 
CACCTACCG6 
TTTGGGGCCA 
ATGACTAAGC 
GCCTGATGAG 
6AAATACTGA 
GGCCCTCCAT 
AGCTTGGGCA 
ATTTTACCAG 
CTCACGGAGC 
ACACTGCAGG 
CCTGCCTTTC 
AATGGTTCAC 
TGACACCTTT 
CTCAGCGGTC 
CTTGTCCTGT 
ATCAGGGTCC 
TTGGAAGGGT 
CCCAGGCTGG 
AGTGATCCTC 
TGGCAAATTT 
TCCTTTGTGT 



21 
I 

CTCCTCTGCT 
CGCGCTGCCC 
GACATCTGTG 
CAGCACCTCC 
GTTGCTCCTG 
CCAACCAATG 
CGTGTACTGG 
CAAGAACCTT 
GGTGCTGCTC 
GGTCACGACC 
TGAACTGGAC 
GCTCCA6ACC 
G6TGGACACG 
CCAGGTCCAC 
CTCCTTCTGG 
GACGTGTGCA 
CAGCTTTCCG 
GACAGTGAAG 
GCCACTGGGC 
CTTCTCCTGC 
GGAGCTTCGT 
GTGGCCAGAA 
GCTCAAGTGT 
TCGAGATCTT 
CGAGGTGACC 
AGCCGCAGTC 
GATCAAGAAA 
AGCCAGGCCT 
TGGCAGTGGT 
CCCT6GGACG 
TGGTACCTGC 
CAAGAGGAAG 
AGGGGAAGTG 
AACTTGCTGC 
AGACAT6TGT 
CTGCTGTCTA 
CTATTTATTG 
TCCCAGTCCA 
AGAGTGCCTG 
CCCAGAAGGA 
AGGTTCAGAG 
GTTAGCCACC 
ATGTCTGGAC 
TTGCATTTCA 
TGCAAGCAGT 
CATCCGCGTG 
AGTGCAGTGG 
CCACCTCAGC 
GATTTTTTTT 
TAGTTAATAA 



31 
I 

ACTCAGAGTT 
GCACTCCTGG 
TCCCCCTCAA 
TGTGACCAGC 
CCTGGGAACA 
TGCTATTCAA 
ACTCCAGAAC 
ACCCTACGCT 
CGTGGGGAGA 
ACGGTGCTGG 
CTGCGGCCCC 
TTTGTCCTGC 
CA6GGGACCG 
CTGGCACTGG 
6CCAAGGCCT 
GTAATACTGG 
GCGCCCAACG 
TGTGAGGCCC 
CCGAGGGCCC 
TCTGCAACCC 
GTCCTGTATG 
AATTCCCAGC 
CTAT^GGATG 
GAGGGCACCT 
GTGAATGTGC 
ATAATGGGCA 
TACAGACTAC 
CC CTGAA CCT 
GCCACACTGA 
CCG6A6GACA 
ACAGCTAAAA 
GAGCAAGACT 
GTGG6G6AGA 
CTATTGGGTA 
AGCATCAAAA 
CTGACCCCAA 
AGTGTCTTTT 
TGTCACATTC 
GCAAAAAGAT 
GTGATTTTTC 
ATTACCCAGT 
TCCCCACCCA 
ATGAGTGCCC 
CTGGGAGCTT 
GGGGTU^GGGG 
TGTGTGTGTG 
TGCAATCATG 
CTCCTGAGTA 
TTTTTTTTCA 
A6CTTTCTCA 



41 

t 

GCAACCTCAG 
TCCTGCTCGG 
AAGTCATCCT 
CCAAGTTGTT 
ACCGGAAGGT 
ACTGCCCTGA 
GGGTGGTVACT 
GCCAGGTGGA 
AGGAGCTGAA 
TGAGGAGA6A 
AAGGGCTGGA 
CAGCGACTCC 



GGGACCAGAG 
CAGTCA6TGT 
6GAACCA6AG 
TGATTCTGAC 
ACCCTAGAGC 
AGCTCCTGCT 
TGGAGGTGGC 
GCCCCCGACT 
AGACTCCAAT 
GCACTTTCCC 
ACCTCTGTCG 
TCTCCCCCCG 
CTGCAGGCCT 
MCAGGCCCA 
ATCCCGGGAC 
ACAGAGTGGA 
GGGCATTGTC 
CACTAG6CCA 
CAAGACATGA 
CATAGCCCCA 
TGCTGA6GCC 
CACAAAG6CC 
CCCTTGATGA 
ATGTAGGCTA 
AAGGTCACCA 
CAAATGGGGC 
TATCGGCACA 
GAGGCCTTAT 
CATACATTTC 
AGGGAATATG 
GCACTATTGC 
GCCAAGGTAT 
TGTATGTGTA 
GTTCACTGCA 
GCTGGGACCA 
GAGACGGGGT 
ACTGCC 



51 

I 

CCTCGCTATG 
GGCTCTGTTC 
GCCCCGGGGA 
GGGCATAGAG 
GTATGAACTG 
TGGGCAGTCA 
GGCACCCCTC 
GGGTGGGG CA 
ACGGGAGCCA 
TCACCATGGA 
GCTGTTTGAG 
CCCACAACTT 
CCTGGACGGG 
GTTGAACCCC 
GACCGCAGAG 
CCAGGAGACA 
GAAGCCAGAG 
CAAGGTGACG 
GAAGGCCACC 
CGGCCAGCTT 
GGACGAGAGG 
GTGCCAGGCT 
ACTGCCCATC 
GGCCAGGAGC 
GTATGAGATT 
CAGCACGTAC 
AAAAGGGACC 
AGGGCCTCTT 
AGACATATGC 
CTCA6TCAGA 
CGCATCTGAT 
TTGATGGATG 
CCATGA6GAC 
CACAGACTTA 
CACACTTCCT 
TAT6TATTTA 
AATGAACATA 
GGTACAGTTG 
TGGGACTTCT 
AT^GCACTAT 
TCCTCCCTTC 
TGCCAGTGTT 
CCCAAGCTAT 
AGCTCCAGTT 
TGGAGGACTC 
GACAAGCTCT 
GTCTTGACCT 
TAGGCTCACA 
CTCGCAACAT 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 



Seq ID No: 141 Protein sequence: 
Protein Accession #: NP 000192.1 
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MLQPVRAGAR AWLRPTGSQG LSSIAEEAAR ATENPEQVAS EGLPEPVLRK VELPVPTHRR 
5 PVQAWVESLR GFEQERVGIiA DLHPDVFATA PRLDILHQVA MWQKNFKRIS YAKTKTRAEV 
RGGGGKPLAA ERHWAGPAWQ HPLSAIARRR CCPWPPGPTS YYYMLPMKVR ALGLKVALTV 
KIiAQDDIiHIM DSIaEIiPTGDP QYLTELAHYR RWGDSVIiLVD LTHEEMPQSI VEATSRLKTF 
NLIPAVGIiNV HSMIiKHQTLV LTLPTVAPIiE DKLLWQDSRY RPLYPFSIiPY SDFPRPLPHA 
TQ6PAATPYH C 

10 

Seg ID NO: 142 DNA sequence 
Nucleic Acid Accession #: NM_000270.1 

Coding sequence: 110-979 (underlined sequences correspond to start and stop codons) 

15 

1 11 21 31 41 51 

I I I I I I 

AACTGTGCGA ACCAGACCCG GCAGCCTTGC TCAGTTCAGC ATAGCGGAGC GGATCCGATC 
GGATCGGAGC ACACCGGAGC AGGCTCATCG AGAAGGCGTC TGCGAGACCA_TGGAGAACGG 

20 ATACACCTAT GAAGATTATA AGAACACTGC AGAATGGCTT CTGTCTCATA CTAAGCACCG 
ACCTCAAGTT GCAATAATCT GTGGTTCTG6 ATTAGGAGGT CTGACTGATA AATTAACTCA 
GGCCCAGATC TTTGACTACA GTGAAATCCC CAACTTTCCT CGAAGTACAG TGCCAGGTCA 
TGCTG6CCGA CTGGTGTTTG GGTTCCTGAA TGGCAGGGCC TGTGTGATGA TGCAGGGCAG 
GTTCCACATG TATGAAGGGT ACCCACTCTG GAAGGTGACA TTCCCAGTGA G6GTTTTCCA 

25 CCTTCTGGGT GTGGACACCC TGGTAGTCAC CAATGCAGCA GGAG6GCTGA ACCCCAAGTT 
TGAGGTTGGA GATATCATGC TGATCCGTGA CCATATCAAC CTACCTGGTT TCAGTGGTCA 
GAACCCTCTC AGAGGGCCCA ATGATGAAAG 6TTTGGAGAT CGTTTCCCTG CCATGTCTGA 
TGCCTACGAC CGGACTATGA GGCAGAGGGC TCTCAGTACC TGGAA ACAAA TGGGGGA6CA 
ACGTGAGCTA CAGGAAGGCA CCTATGTGAT GGTGGCAGGC CCCAGCTTTG AGACTGTGGC 

30 AGAATGTCGT GTGCTGCAGA AGCTGGGAGC AGACGCTGTT GGCATGAGTA CAGTACCAGA 
AGTTATCGTT GCACXSGCa^CT GT6GACTTCG AGTCTTTGGC TTCTCACTCA TCACTAACAA 
GGTCATCATQ GATTATGAAA GCCTGGAGAA GGCCAACCAT GAAGAAGTCT TAGCAGCTGG 
CAAACAAGCT GCACAGAAAT TGGAACAGTT TGTCTCCATT CTTATGGCCA GCATTCCACT 
CCCTGACAAA QCCAG TTGAC CTGCCTTGGA GTC6TCTGGC ATCTCCCACA CAAGACCCAA 

35 GTAGCTGCTA CCTTCTTTGG CCCCTT6CTG GAGTCATGTG CCTCTGTCCT TAGGTTGTAG 
CAGAAA6GAA AAGATTCCTG TCCTTCACCT TTCCCACTTT CTTCTACCAG ACCCTTCTGG 
TGCCAGATCC TCTTCTCAAA GCTGGGATTA CAGGTGTGAG CT^TAGTGAGA CCTTGGCGCT 
ACAAAATAAA GCTGTTCTCA TTCCTGTTCT TTCTTACACA AGAGCTGGAG CCCGTGCCCT 
ACCACACATC TGTGGAGATG CCCAGGATTT GACTCGGGCC TTAGAACTTT GCATAGCAGC 

40 TGCTACTAGC TCTTTGAGAT AATACATTCC GAGGGGCTCA GTTCTGCCTT ATCTAAATCA 
CCAGAGACCA AACAAGGACT AATCCAATAC CTCTTGGA 

Seq ID No: 143 Protein sequence; 
Protein Accession #: NP 000261.1 

45 

1 11 21 31 41 51 

I I I I I I 

MBNGYTYEDY KNTAEWLLSH TKHRPQVAII CGSGLGGLTD KIiTQAQIFDY SEIPNFPRST 
50 VPGHAGRLVF GFLNGRACVM MQ6RFHMYEG YPIiWKVTFPV RVPHIiLGVUrr LWTNAAGGL 
NPKPEVGDIM LIRDHINXiPG FSGQNPIiRGP NDERFGDRFP AMSDAYDRTM RQRALSTWKQ 
MGEQREIiQEG TYVMVAGPSF ETVAECRVLQ KLGADAVCMS TVPEVIVARH CGIiRVFOFSL 
ITNKVIMDYE SIiEKANHEEV LAAGKQAAQK IiEQFVSIIiMA SIPI>PDKAS 

55 Seq ID NO: 144 DNA sequence 

Nucleic Acid Accession #: NM_015S77.1 

Coding sequence: 112-3054 (underlined sequences correspond to start and stop codons) 



60 1 11 21 31 41 51 

I I I I I I 

GAAGCGGCGG GCGGGGTGGA GCAGCCAGCT GGGTCCGGGG AGCGCCGCCG CCGCCTCGAT 60 

GGGGTGTT6A AAAGTCTCCT CTAGAGCTTT GGAAGGCTGA ATGCACTAAA CATGAAGAGC 120 

TTGAAAGCGA AGTTCAGGAA GAGTGACACC AATGAGTGGA ACAAGAATGA TGACCGGCTA 180 

65 CTGCAGGCCG TGGAGAATGG AGATGCGGAG AAGGTGGCCT CACTGCTCGG CAAGAAGGGG 240 

GCCAGTGCCA CCAAACACGA CAGTGAGGGC AAGACCGCTT TCCATCTTGC TGCTGCAAAA 300 

GGACACGTGG AATGCCTCAG GGTCATGATT ACACATGGTG TGGATGTGAC A(3CCCAAGAT 360 

ACTACCGGAC ACAGCGCCTT ACATCTCGGA GCCAAGAACTl 6CCACCATGA ATGCA TCAGG 420 

AGGCTGCTTC AGTCTAAATG CCCa^GCCGAA AGTGTCGACA GCrrcrrGG(3AA AACAGCTTTA 480 

70 CATTATGCAG CGGCTCAGGG CTGCCTTCAA GCTTCTGCACSA TTCTCTGCGA ACACAAGAGC 540 

CCCATAAACC TCAAAGATTT GGAT6C3GAAT ATACCGCTTGC TTCTTGCTGT ACAAAAT<3GT 600 

CACAGTGAGA TCTGTCACTT TCTCCT6GAT CATGGAGCAG ATGTCAATTC CAGGAACAAA 660 

AGT6GAAGAA CTGCTCTCa^T GCTGGCCTGT (3AGATTGGCA GCTCTAACGC TGTG6AAGCC 720 

TTAATTAAAA AGGGTGCAGA CCTAAACCTT GTAGATTCTC TTGGATACAA TGCCTTACAT 780 

75 TATTCCAAAC TCTCAGAAAA TGCAGGAATT CAAAGCCrTTC TATTATCAAA AATCTCTCAG 840 

GATGCTGATT TAAAGACCCC AACAAAACC». AAGCAGCATG ACCAAGTCTC TAAAATAAGC 900 
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60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



TCAGAAAGAA 
CAGTTGAGTG 
TCG6TATTTT 
GACA6ACTAA 
GACCAACAAG 
AAG6A6TTAC 
TTTGACTCAT 
ACCTCTCCCC 
ACTACTGACA 
AGATTAGAGA 
GCAGAACTGG 
AAACTTAAAG 
AAGCAGATGA 
GAGCTGAGGG 
GAAGAAAGTG 
GAGAAAGGTA 
TGCTCTGTTA 
GCCCAAGAAG 
AGTGATGAAG 
CAGGTGAGCG 
AAGAGGAAAT 
CTGATGCAAT 
TCTCAGTATT 
AAAGAGAACT 
GCAAAAGAGA 
GAAGTAGCAA 
GTACCTCGGT 
GCATCGAAAT 
ATTAGAAGTG 
TCCAAAGAGC 
GCAGAAAT6A 
ATAAATGA6A 
CTCTCCTACT 
CAAGTCAT^C 
TCAGTTTACA 
AAAGTACTGA 
TTCCTTGGCA 
AGCCGCTGCC 
TCCAAAGGTT 
ACTTCAAACC 
GCCATGCCCA 
GGCTGAGTGG 
TCCATTCCTG 
TCAATTCCAA 
CTGCTTAAAA 
TTCCCATCTG 
ATTAACTAAA 
AGTT^AATTGT 
AAGCTCAGGA 
CCAAAA6AAA 
AATATTTTTA 
CAGTAGATTA 
TAACTTTTTC 
ATGAGCTGCC 
TTTCTGAATC 
CAGCGCCCAT 
GCTAAAAACC 
TTCTGTTCAC 
GCATTAATCT 
AGTAGGGCCT 
TGGCTGTGCT 
AATGGCTTTA 
AAAAATGCTC 
CAGCCCCTTC 
TAGGATGCCT 
ACTCTGGTTA 
TACAGAAGTT 
AAAAA 



GTGGAACTCC 
ATGTCTCTTC 
TTGCTGAACC 
GTGACAGTAC 
ATCTTCTCTC 
AA6ATAAATT 
ACCATTCCAC 
CAGACTCCAA 
ATGATGTCAG 
GCTCTGAAGC 
TATGCTTAAA 
AAACTCAGAG 
AACTCGGTCT 
TCACGGAAGA 
AAAGAAATAA 
CAGTGATTAA 
TTGAGAATAT 
AAATCATGAA 
CTGAGGACAT 
AGCTGTCACA 
CTCTAGAGGA 
TGACAAACGT 
CAAAAGTGTT 
CTGTCTCTAT 
TGGAAGAAAA 
AGCTGGAGAA 
CTTCCTATGA 
TAAAGGAATC 
AGGTCTCACA 
AAGAAGTAAA 
AAA6ATACGC 
TGTCGAAGGA 
CAACAAGCTC 
AGCTCCAGAA 
GAATGCATCT 
AGCAAATCCT 
GGACACTGCC 
ATTGTTCTCA 
TCTGAG6ACT 
AGCAGAGGTG* 
GAGGTCTGGT 
CTTTATCACC 
TACTCGGGCA 
AACATGATTT 
CCCTTCATCA 
CCTTATATAT 
ATATGAAACT 
TGACATACTG 
AATCCAAACA 
TGCCTCATAG 
TTGGCTCATA 
TATTAACAAA 
CTTAAAGTTT 
ACCAACACCC 
TAGCAAATCC 
AAATGATCTC 
CTGATATGGT 
TTTACTTTCA 
TGTTGAGGGA 
GCTCTGACAG 
GGATGATATG 
AAGCTTGGTG 
CAAA6AAATG 
TGCCCCCGCA 
GACGAGGTGT 
AATATTGAAA 
ATTGGAGAAA 



AT^AAACACGC 
CCCAAGATCA 
ACCCTTCAAG 
TACAGGTGCT 
TCTATTGCAA 
ACAGGCCAAA 
CCAAACTGAC 
ATCATCTCCA 
AATTCAGCAA 
AGAGAGAAAA 
CAACACTGAG 
CAAATACGAG 
TGTCTCACCT 
GGAAATAAAT 
AGAGAAAGTG 
GCCACCTGTG 
GAATAAGGAG 
ATTAAAAGAC 
GAAAGAAGCC 
GCTGTACAAA 
TGTCACAGCT 
GTCCAGGGCT 
GAATGAGTTG 
CACAGAACAT 
AATAAGCAAT 
ACAACTCTTA 
AAAACTCCAG 
TGTGAAAGAG 
GGTGAAAAGA 
TGAACTTCTG 
TGAGAGCTCT 
AGTCACCAAA 
ATCCAAAAGG 
CCAGCTGGCG 
TCTGTATGCT 
TACCATGTGT 
CCTTGTCATC 
TTCGTGGTAT 
TCTCCCAGGA 
AAAGTCCCTG 
CCTGATGCTG 
ACCGAGTGAT 
GTGCCATTCA 
AATTTCTAAC 
TGATATCCTG 
CTCATCACCC 
TAAAAACAAA 
CAAATTTTCT 
TTTGTGTTTC 
TTCTTAACCT 
AAGATGTTTT 
ATACTTTTTA 
TCAGCTATAG 
CTAGAACTTT 
TCCTTTTACC 
TAGAAGGACT 
AATATTATGG 
GGTTAAAAAT 
GAGAGACAGA 
GGCCCATGTC 
TTGATCTGTA 
TTACTTTTCT 
TGAAAGGACC 
CAGAAATGCT 
AGCCTTTTAT 
AGTTATATGC 
ATAAACTTGT 



AAAGCTCCAC 
ATAACTTCGA 
GCTGAGATCA 
GATAGCTTAT 
GCAAAAGTTG 
TCACCCAAGG 
TTGGGCCCAT 
TCTGTCTTAA 
CTGCAAGAGA 
CAGCTACAGG 
ATTTCAGAGA 
GAGGCTATGA 
GAAAGCATG6 
GTGCTAAAGC 
AGAGAGTTAG 
GAAGAGTACG 
AAAGCATTTT 
ACACTAAAAA 
ATGAATAGGA 
GAAGCCCAGG 
GAATATATCC 
AAAGCAGAAG 
ACCCAGCTCA 
TTGCAAGTGA 
CTTAAGGAAC 
GAAGAGAAAG 
TCATCCTTAG 
AAAGAGT^G 
GAAAAGGAAA 
CAAAAATTCC 
TCAAAACTGG 
TTGAAGGAG6 
CAGAGTCAGC 
GAATGCAAGA 
GTGCAGG6CC 
AAAAACCAGT 
TGTCTTTGTG 
GCACTGTGGC 
GAAGACTGCC 
TCATCCCTTC 
GCAGGGGGGC 
GTGCTGAGGC 
GCACAGGAGA 
TAAATTAGTA 
TGGATTTAAA 
TGCTTATCAA 
AGCAAGTT6T 
ATGCAAACTT 
AACAAGGGAC 
CAACTTTT6T 
CATATCTGAA 
GGTA6CCATG 
CAAAAGGTAG 
CAGCCATGGT 
CGTTGAATGT 
GTTAGTACCA 
TGCATAGCAG 
GTTTCTAACA 
ATCCTGGACT 
CCACAAGGCT 
TTGGATAAGG 
TAAGTTGTTT 
TTTTGTCACA 
GCAGAGTATA 
CTTGTTTCCG 
TGTAGTTTTT 
TTCATTTTGC 



CACCTCCTAT 
CTCC7VCTATC 
GTTCTATACG 
TGGATATAAG 
CTTCCCTTAC 
AGGCGGAAGC 
CCCTGGGAAA 
TACATTCTTT 
TTTTGCAAGA 
TCGAACTCCA 
ACAGCTCTGA 
AAGAAGTCCT 
ATAATTATTC 
AGGATCTGCA 
AGGAAAAACT 
AGGAAATGAA 
TGTTTGAGAA 
GTCAGATGAC 
TGATAGATGA 
CTGAGCTGGA 
ATAAAGCAGA 
ATGCACTGTC 
AACAACTGGT 
TAACCACGCT 
ACCTTGCAAG 
CTGCTATGAC 
AGAGTGAAGT 
TCCATTCAGA 
ATATTCa^GAC 
AGCAAGCTCA 
AGGAAGATAA 
CCTTGAACAG 
AGCTGGAGGC 
AACAACACCA 
AGATGGATGA 
CTCAAAAGAA 
TTAGATCCAG 
CTAGCGTAGC 
CXSCCTCAGT^ 
AGATTCCAGA 
CCCCTCCTCC 
CTCCTGCAGT 
GCTCTTTTTG 
TGGCACTAGT 
AACTCTAATT 
TATTCAGTTT 
CCTTAAAAGT 
GCCTCCTGCT 
AGTAAACTGT 
AGAAGTATTT 
CTCCTAAATA 
CTTGAGACTT 
TTATGTATGC 
GTCTTCAGM 
TTTGAATGCC 
ATCTGTTTTT 
AGGTCTCGGA 
CGCTTGCAAC 
CTCCAAAGTA 
GCTTGGCCTC 
ACCAATGACA 
AATTATAGTT 
GCACTTCAGA 
TATUU^CTTGA 
GATGCATATT 
AGTATTTTGT 
AAAAAAAAAA 



CAGTCCTACC 


960 


GGGAAAGGAA 


1020 


AGAAAACAAA 


1080 


TTCTGAAGCT 


1140 


CTTACACAAT 


1200 


AGACCTAAGC 


1260 


ACXTTGGTGAA 


1320 


AG6TAAATCC 


1380 


TCTACAGAAG 


1440 


ATCCCGAAGG 


1500 


CCTCAGCCAG 


1560 


TAGTGTGCAG 


1620 


ACATTTCCAC 


1680 


GAATGCATTA 


1740 


GGTAGAGAGG 


1800 


AAGTTCATAT 


1860 


ATACCAAGAA 


1920 


ACAGGAAGCC 


1980 


ACTCAATAAA 


2040 


GGATTACAGG 


2100 


GCATGAGA7A 


2160 


TGAAATGAAG 


2220 


GGATGCACAA 


2280 


GCGGACTGCA 


2340 


CAAGGAAGTG 


2400 


TGATGCAATG 


2460 


GAGTGTGTTG 


2520 


GGTTGTCCAG 


2580 


TCTCTTGAAA 


2640 


GGAAGAACTT 


2700 


AGATAAAAAG 


2760 


CCTCTCCCAQ 


2820 


GCTGCA6CAG 


2880 


GGAGGTCATA 


2940 


AGAT6TCCAG 


3000 


GTAAAGTGGA 


3060 


AGTTGTCGGC 


3120 


TTCTTCCCTT 


3180 


CTGCTTAGAG 


3240 


GCTGGGATCA 


3300 


ATCCCTGACT 


3360 


GAATGCTCCT 


3420 


CCTTTGGCTT 


3480/^ 


TATGAAGTAT 


3540 


CCATGTTTTC 


3600 


GATGAGCACT 


3660 


TCTTTTTTTA 


3720 


GTTATCTGTG 


3780 


GTGTTTACAG 


3840 


TTTTCTCT6T 


3900 


AGTGAAATTA 


3960 


TTTAAAAATA 


4020 


CAGACCTAAT 


4080 


TTGTAGCGCA 


4140 


CTGACTCTAC 


4200 


CAACTTTGAA 


4260 


AAAAAAATAT 


4320 


TTCCCTTATG 


4380 


TTTAACTGAA 


4440 


AGTGGGTGCT 


4500 


GCAAAGCAAA 


4560 


AAGCAATTTC 


4620 


AAATACACAA 


4680 


GACATTTTTG 


4740 


TATTACGAGT 


4800 


CTTTGTAATT 


4860 


AAAAAAAAAA 


4920 



Seq ID No: 145 Protein sequence: 
Protein Accession #: NP 056392.1 



1 11 21 31 41 51 

75 1 1 I I I I 

MKSUCAKFRK SDTNEPniKND DRLLQAVENG DABKVASLLG XKGASATKHD SEGKTAFHLA 60 



248 



BNSDOCID: <WO 0a079492A2_l_> 



wo 02/079492 PCT/US02/04915 



AAKGHVECLR VMITHGVDVT AQDTTGHSAL HIiAAKNSHHE CIRRIiLQSKC PAESVDSSGK 120 

TALHYAAAQG CLQAVQILCE HKSPINLKDL DGNIPLLIAV QNGHSEICHF IiLDHGADVMS 180 

RNKSGRTALM LACEIGSSNA VEAIjIKKGAD LNLVDSI/SYN ALHYSKLSEN AGIQSIiIiLSK 240 

ISQDADIiKTP TKPKQHDQVS KISSERSGTP KTRKAPPPPI SPTQLSDVSS PRSITSTPLS 300 

5 GKESVFFAEP PFKAEISSIR ENKDRLSDST TGADSIiLDIS SEADQQDIiLS IiIiQAICVASI«T 360 

LHNKEIiQDKIi QAKSPKEAEA DLSFDSYHST QTDLGPSLGK PGBTSPPDSK SSPSVLIHSL 420 

GKSTTDNDVR IQQLQEILQD IiQKRLESSEA ERKQLQVELQ SRRAEI*VCIiN NTEISBHSSD 480 

IiSQKLKETQS KYEEAMKEVL SVQKQMKIjGL VSPESMDNYS HFHBLRVTEB BINVIjKQDLQ 540 

NALEESERNK EKVRBIiEEKL VEREKGTVIK PPVEEYEBMK SSYCSVZENM NKEKAPLPEK 600 

10 YQEAQEBIMK LICDTLKSQMT QEASDEAEDM KEAMNRMIDE LNKQVSELSQ IiYKEAQAELB 660 

DYRKRKSLED VTAEYIHKAE HEKLMQLTNV SRAKABDALS BMKSQYSKVL NELTQLKQIiV 720 

DAQKBNSVSI TEHLOVITTIi RTAAKBMEEK ISNLKEHIiAS KEVEVAKLEK QLliEEKATVMT 780 

DAMVPRSSYE KIiQSSLESEV SVIASKLKES VKEKEKVHSE WQIRSEVSQ VKRBKENIQT 840 

LIiKSKEQEVN ELLQKFQQAQ EEIAEMKRYA ESSSKLEBDK DKKINEMSKE VTKLKEAIiNS 900 

15 IjSQLSYSTSS SKRQSQQLEA LQQQVKQLQM QLAECKKQHQ EVISVYRMHL LYAVQGQMDB 960 
DVQKVLKQIL TMCKNQSQKK 

Seq ID NO: 146 DNA sequence 
Nucleic Acid Accession #: NM_000459.1 

20 Coding sequence: 149-3523 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I ' I 

25 CTTCTGTGCT GTTCCTTCTT GCCTCTAACT TGTAAACAAG ACGTACTAGG ACGATGCTAA 60 

TGGAAAGTCA CAAACCGCTG GGTTTTTGAA AGGATCCTTG GGACCTCATG CACATTTGTG 120 

GAAACTGGAT GGAGAGATTT GGGGAAG CAT GG ACTCTTTA GCCAGCTTAG TTCTCTGTGG 180 

AGTCAGCTTG CTCCTTTCTG GAACTGTG6A AGGTGCCATG GACTTGATCT TGATCAATTC 240 

CCTACCTCTT GTATCTGATG CTGAAACATC TCTCACCTGC ATTGCCTCTG GGTGGCGCCC 300 

30 CCATGAGCCC ATCACCATAG GAAGG6ACTT T6AAGCCTTA ATGAACCAGC ACCAGGATCC 360 

GCTGGAAGTT ACTCAAGATG TGACCAGAGA ATG6GCTAAA AAAGTTGTTT 66AAGAGA6A 420 

AAA6GCTAGT AAGATCAATG GTGCTTATTT CTGTGAA6GG CGAGTTCGAG GAGAGGCmT 480 

CAGGATACGA ACCATGAAGA TGCGTCAACA AGCTTCCTTC CTACCAGCTA CTTTAACTAT 540 

GACTGTGGAC AAGGGAGATA ACGTGAACAT ATCTTTCAAA AAGGTATTGA TTAAAGAAGA 600 

35 AGATGCAGTG ATTTACAAAA ATGGTTCCTT CATCCATTCA GTGCCCCGGC ATGAAGTACC 660 

TGATATTCTA GAAGTACACC TGCCTCATGC TCAGCCCCAG GATGCTGGAG TGTACTCGGC 720 

CAGGTATATA GGAGGAAACC TCTTCACCTC GGCCTTCACC AGGCTGATAG TCCGGAGATG 780 

TGAAGCCCAG AAGTGGGGAC CTGAATGCAA CCATCTCTGT ACTGCTTGTA TGAACAATGG 840 

TGTCTGCCAT GAAGATACTG GAGAAT6CAT TTGCCCTCCT GGGTTTATGG GAAGGACGTG 900 

40 TGAGAAGGCT TGTGAACTGC ACACGTTTGG CAGAACTTGT AAAGAAAGGT GCAGTGGACA 960 

AGAGGGATGC AAGTCTTATG TGTTCTGTCT CCCTGACCCC TATGGGTGTT CCTGTGCCAC 1020 

AGGCTGGAAG GGTCTGCAGT GCAATGAAGC ATGCCACCCT GGTTTTTACG GGCCAGATTG 1080 

TAAGCTTAGG TGCAGCTGCA ACAATGGGGA GATGTGTGAT CGCTTCC7UVG GATGTCTCTG 1140 

CTCTCCAGGA TGGCAGGGGC TCCAGTGTGA GAGAGAAGGC ATACCGAGGA TGACCCCAAA 1200 

45 GATAGTGGAT TTGCCAGATC ATATAGAAGT AAACAGTGGT AAATTTAATC CCATTTGCAA 1260 

AGCTTCTGGC TGGCCGCTAC CTACTAATGA AGAAATGACC CTGGTGAAGC C6GATGGGAC 1320 

AGTGCTCCAT CCAAAAGACT TTAACCATAC GGATCATTTC TCAGTAGCCA TATTCACCAT 1380 

CCACCGGATC CTCCCCCCTG ACTCAGGAGT TTGGGTCTGC AGTGTGAACA CAGTGGCTGG 1440 

GATG6TGGAA AAGCCCTTCA ACATTTCTGT TAAAGTTCTT CCAAAGCCCC TGAATGCCCC 1500 

50 AAAC6TGATT GACACTGGAC ATAACTTTGC TGTCATCAAC ATCAGCTCTG A6CCTTACTT 1560 

TGGGGATGGA CCAATCAAAT CCAAGAAGCT TCTATACAAA CCCGTTAATC ACTATGAGGC 1620 

TTGGCAACAT ATTCAAGT6A CAAATGAGAT TGTTACACTC AACTATTTGG AACCTCGGAC 1680 

AGAATATGAA CTCTGTGTGC AACTGGTCC6 TCGTGGAGAG GGTGG6GAAG GGCATCCTGG 1740 

ACCTGTGAGA CGCTTCACAA CAGCTTCTAT CXSGACTCCCT CCTCCAAGAG GTCTAAATCT 1800 

55 CCTGCCTAAA AGTCAGACCA CTCTAAATTT GACCTGGCAA CCAATATTTC CAAGCTCGGA 1860 

AGATGACTTT TATGTTGAAG TGGAGAGAAG GTCTGTGCAA A7VAAGTGATC AGCAGAATAT 1920 

TAAAGTTCCA GGCAACTTGA CTTCGGTGCT ACTTAACAAC TTACATCCCA GGGAGCAGTA 1980 

CGTGGTCCGA GCTAGAGTCA ACACCAAGGC CCAGGGGGAA TGGAGTGAAG ATCTCACTGC 2040 

TTGGACCCTT AGTGACATTC TTCCTCCTCA ACCAGAAAAC ATCAAGATTT CCAACATTAC 2100 

60 ACACTCCTCG GCTGTGATTT CTTGGACAAT ATTGGATGGC TATTCTATTT CTTCTATTAC 2160 

TATCCGTTAC AAGGTTCAAG GCAAGAATGA AGACCAGCAC GTTGATGTGA AGAT/U^GAA 2220 

TGCCACCATC ATTCAGTATC AGCTCAAGGG CCTAGAGCCT GAAACAGCAT ACCAGGTGGA 2280 

CATTTTTGCA GAGAACAACA TAGGGTCAAG CAACCCAGCC TTTTCTCATG AACTGGTGAC 2340 

CCTCCCAGAA TCTCAAGCAC CAGCX3GACCT CGGAGGGGGG AAGATGCTGC TTATAGCCAT 2400 

65 CCTTGGCTCT' GCTGGAATGA CCTGCCTGAC TGTGCTGTTG GCCTTTCTGA TCATATTGCA 2460 

ATTGAAGAGG GCAAATGTGC AAAGGAGAAT GGCCCAAGCC TTCCAAAACG TGAGGGAAGA 2520 

ACCAGCTGTG CAGTTCAACT CAGGGACTCT GGCCCTAAAC AGGAAGGTCA AAAACAACX:C 2580 

AGATCCTACA ATTTATCCAG TGCTTGACTG GAATGACATC AAATTTCAAG ATGTGATTGG 2640 

GGAGGGCAAT TTTGGCCAAG TTCTTAAGGC GCX3CATCAAG AAGGATGGGT TACGOATGGA 2700 

70 TGCTGCCATC AAAAGAATGA AAGAATATGC CTCCAAAGAT GATCACAGGG A CTTT GCAGG 2760 

AGAACTGGAA GTTCTTTGTA AACTTGGACA CCATCCAAAC ATCATCAATC TCTTAGGAGC 2820 

ATGTGAACAT CGAGGCTACT TGTACCTGGC CATTGAGTAC 6C6CCCCATG GAAACCTTCT 2880 

GGACTTCCTT CGCAAGAGCC GTGTGCTQGA GACGGACCCA GCATTTQCCA TTGCCAATAG 2940 

CACCX3CGTCC ACACTGTCCT CCCAGCAGCT CCTTCACTTC GCTGCCGACG TGGCCCGGG6 3000 

75 CATGGACTAC TTGAGCCAAA AACAGTTTAT CCACAGGQAT CTGGCTGCCA 6AAACATTTT 3060 

AGTTGGTGAA AACTATGTGO CAAAAATAGC AGATTTTGGA TTGTCCCGAG 6TCAAGAGGT 3120 



249 



BNSDOCIO: <WO ^02070492A2_L> 



wo 02/079492 



PCT/US02/04915 



GTACGTGAAA AAGACAATGG GAAGGCTCCC 
TTACAGTGTG TACACAACCA ACAGTGATGT 
TGTTAGCTTA GGAGGCACAC CCTACTGCGG 
GCCCCAGGGC TACAGACTGG AGAAGCCCCT 
5 GAGACAATGC TGGCGGGAGA AGCCTTATGA 
CTTAAACAGA ATGTTAGAGG AGCGAAAGAC 
TACTTATGCA GGAATTGACT GTTCTGCTGA 
CCCTCTGTTT CCCTTTCACT GGCATGGGAG 
CTGCCAAAGG ATGTGATATA TAAGTGTACA 

10 TAATATTTAA GACACTGAAA AATCTAAGT6 
TCCCTCACCT GTAGCATGCC AGTCCCX3TTT 
TTTCCACAGC CTGCAA6TTC AGTCCAGGAT 
TTGCTTACAA GCCTAA6AAT CTTTAGAGAA 
ATTTTCTTTT CTTTTCTCTG GTAATATTGA 

15 TGGGTGACAT TTGGGAGACA T6TGACATTT 
GCACATTGTA AAAAGTTTTA GTTTTGATGA 
CACTTTGCAC TGATATATCA TGAGTGAATA 

Seq ID No: 147 Protein aerpiianr?** « 
20 Protein Accession #: NP_000450.1 



AGTGCGCTGG ATGGCCATCG AGTCACTGAA 3180 

ATGGTCCTAT GGTGTGTTAC TATGGGAGAT 324 0 

GATGACTTGT GCAGAACTCT ACGAGAAGCT 3300 

GAACTGTGAT GATGAGGTGT ATGATCTAAT 3360 

GAGGCCATCA TTTGCCCAGA TATTGGTGTC 3420 

CTACGTGAAT ACCACGCTTT ATGAGAAGTT 3480 

AGAAGCGGCC TAGGACAGAA CATCTGTATA 354 0 

ACCCTTGACA ACTGCTGAGA AAACATGCCT 3600 

TATGTGCTGG AATTCTAACA AGTCATAGGT 3660 

ATATAAATCA GATTCTTCTC TCTCATTTTA 3720 

CATTTAGTCA TGTGACCACT CTGTCTTGTG 3780 

QCTAACATCT AAAAATA6AC TTAAATCTCA 3840 

GTATACATAA GTTTAGGATA AAATAATGG6 3900 

CTTGTATATT TTAAGAAATA ACA6AAAGCC 3960 

ATATATTGAA TTAATATCCC TACATGTATT 4020 

6TT6T6AGTT TACCTTGTAT ACTGTAGGCA 4080 
/y^TGTCTTGC CTACTCAAAA AAAAAAAA 



25 



30 



35 



40 



45 



1 
I 

MDSIiASIiVIiC 
FEALMNQHQD 
QASFI/PATLT 
AQPQDAGVYS 
ICPPGFMGRT 
ACHPGFYGPD 
VNS6KFNPIC 
VWVCSVNTVA 
LLYKPVNHYE 
IGI*PPPRGUt 
LIiNNLHPREQ 
IliDGYSISSI 
SNPAFSHEIiV 
MAQAFQHVRE 
ARIKKDGIiRM 
AIEYAPHGNL 
IHRDIiAARNI 
VWSYGVLIiWE 
ERPSFAQILV 



11 
I 

GVSLLLSGTV 
PLEVTQDVTR 
MTVDKGDNVN 
ARYIGGNLFT 
CEKACELHTF 
CKLRCSCNNG 
KASGWPLPTN 
GNVEKPFNIS 
AWQHIQVTNE 
LLPKSQTTLN 
YWRARVNTK 
TIRYKVQGKN 
TIiPESQAPAD 
EPAVQFNSGT 
DAAIKRMKEY 
LDPIiRKSRVL 
LVGENYVAKI 
IVSIjGGTPYC 
SLNRMLEERK 



21 

I 

EGAMDLILIN 
EWAKKWWKR 
ISFKKVLIKE 
SAFTRLIVRR 
GRTCKERCSG 
EMCDRFQGCL 
EEMTI.VKPDG 
VKVLPKPIiHA 
IVTLNYIjEPR 
LTWQPIPPSS 
AQGBWSEDIiT 
EDQHVDVKIK 
LGGGKMLLIA 
LALNRKVKNN 
ASKDDHRDPA 
ETDPAFAIAN 
ADPGLSRGQE 
GMTCAELYEK 
TYVNTTLYEK 



31 

I 

SLPIjVSDAET 
EKASKINGAY 
EDAVIYKNGS 
CEAQKWGPEC 
QEGCKSYVFC 
CSPGWQGLQC 
TVIiHPKDFNH 
PNVIDTGHNF 
TEYELCVQIiV 
BDDFYVBVBR 
AWTLSDIIiPP 
NATIIQYQLK 
iLGSAGMTCIi 
PDPTIYPVIiD 
GELEVIiCKIiG 
STASTLSSQQ 
VYVKKTMGRL 
LPQGYRLEKP 
PTYAGIDCSA 



41 

1 

SLTCIASGWR 
FCEGRVRGEA 
FIHSVPRHEV 
NHLCTACMNN 
LPDPYGCSCA 
EREGIPRMTP 
TDHFSVAIFT 
AVINISSEPY 
RRGEGGEGHP 
RSVQKSDQQN 
QPENIKISNZ 
GXiEPETAYQV 
TVLLAFLIIL 
WNDIECFQDVZ 
HHPNIINLLG 
LliHFAADVAR 
PVRWMAIESL 
IiNCDDEVYDIi 
EEAA 



51 
I 

PHEPITIGRD 
IRIRTMKMRQ 
PDIBEVHIiPH 
GVCHEDTGEC 
TGWKGLQCNE 
KIVDLPDHIE 
IHRILPPDSG 
FGDGPIKSKK 
GPVRRPTTAS 
IKVPGMIiTSV 
THSSAVISWT 
DIFAES3NIGS 
QIiKRAKVQRR 
GEGNFGQVLK 
ACEHRGYLYIi 
GMDYItSQKQF 
NYSVYTTNSD 
MRQCWREKPY 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO: 148 PNA secruence 

Nucleic Acid Accession #: NM_0005S2.2 

Coding sequence: 311-8752 (underlined sequences correspond to start and stop codons) 



50 



55 



60 



65 



70 



75 



A6CTCACA6C 
TATCTCCCCC 
GA6CT6TAGC 
GCACCATTGT 
GCAG6GGAAG 
GCCCTCATTT 
GCCAGGGACC 
TTTCGGAAGT 
CAGTTACCTC 
GAATGGCAAG 
TGTCAATGGT 
GCTGTATCTA 
GGCCAGGATC 
GACCTGCGGG 
AGGGACCTTG 
ACAGTGGTGT 
GCAGAAGGGC 
CCACCCTCTG 
TGCT6GGGGG 
GGAG6GAATG 
T6GTATG6AG 
CAAT6AAATG 
GGAT6AAGGC 
CXrCTCCCGGC 
GATCTGCAGC 



11 
I 

TATTGT6GTG. 
AGCAGTGG6G 
AGACCTGATT 
CCA6CAGCTG 
GCACCATTGT 
AT6ATTCCTG 
CTTTGTGCAG 
GACTTCGTCA 
CTGGCAGGGG 
AGAGTGAGCC 
ACCGT6ACAC 
GAAACTGAGG 
GATGGCAGCG 
CTGTGTGGCA 
ACCTCGGACC 
GAACGGGCAT 
CTGTGGGAGC 
GTGGACCCCG 
CTGGAGTGCG 
GT6CTGTAC6 
TATAG6CAGT 
T6TCA6GAGC 
CTCTGCGTGG 
ACCTCCCTCT 
AATGAAGAAT 



21 
I 

GGAAAGGGAG 
ACTCCACAGC 
GAGCCTTTGC 
'A6TTTCCCAG 
CCAGCA6CTG 
CCAGATTTGC 
AAGGAACTCG 
ACACCTTTGA 
GCTGCCAGAA 
TCTCCGTGTA 
AGGGGGACCA 
CTGGGTACTA 
GCAACTTTCA 
ACTTTAACAT 
CTTATGACTT 
CTCCTCCCAG 
AGTGCCAGCT 
AGCCTTTTGT 
CCTGCCCTGC 
GCTGGACCGA 
GTGTGTCCCC 
GATGCGTGGA 
AGAGCACC6A 
CTCGAGACTG 
GTCCAGGG6A 



31 
I 

GGT6GTTGGT 
CCCT6GGCTA 
A6CAGCTGA6 
GGACCTT6GA 
A6TTTCCCAG 
CGGGGTGCTG 
CGGCAGGTCA 
TGGGAGCATG 
ACGCTCCTTC 
TCTTGGGGAA 
AAGAGTCTCC 
CAAGCTGTCC 
AGTCCTGCTG 
CTTTGCTGAA 
TGCCAACTCA 
CAGCTCATGC 
TCTGAAGAGC 
GGCCCTGTGT 
CCTCCTGGAG 
CCACAGCGCG 
TTGCGCCAGG 
TGGCTGCAGC 
GTGTCCCTCC 
CAACACCTGC 
GTGCCTTGTC 



41 

I 

GGAT6TCACA 
CATAACA6CA 
AGCATGGCCT 
6ATA6CCGCA 
GGACCTTGGA 
CTTGCTCTGG 
tCCACGGCCC 
TACAGCTTTG 
TCGATTATTG 
TTTTTTGACA 
ATGCCCTATG 
GGTGAGGCCT 
TCAGACAGAT 
GATGACTTTA 
TGGGCTCTGA 
AACATCTCCT 
ACCTCGGTGT 
GAGAAGACTT 
TACGCCCGGA 
TGCAGCCCAG 
ACCTGCCAGA 
TGCCCTGAGG 
GTGCATTCCG 
ATTTGCC6AA 
ACTGGTCAAT 



51 
1 

GCTTGGGCTT 
AGACAGTCCG 
AGG6TGGGCG 
GCCCTCATTT 
GATAGCCGCA 
CCCTCATTTT 
GATGCAGCCT 
CGGGATACT6 
GGGACTTCCA 
TCCATTTGTT 
CCTCCTUU^GG 
ATGGCTTTGT 
ACTTCAACAA 
TGACCCAAGA 
GCAGTGGAGA 
CTGGGGAAAT 
TTGCCCGCTG 
TGTGTGAGTG 
CCTGTGCCCA 
TGTGCCCTGC 
GCCTGCACAT 
GACAGCTCCT 
GAAAGC6CTA 
ACAGCCAGTG 
CCCACTTCAA 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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GAGCTTTGAC T^CAGATACT TCACCTTCAG 
TTGCCAGGAC CACTCCTTCT CCATTGTCAT 
CGCTGTGTGC ACCCGCTCCG TCACCGTCCG 
ACTGAAGCAT GGGGCAGGAG TTGCCATGGA 
5 AGGTGACCTC CGCATCCAGC ATACAGTGAC 
CCTGCAGATG GACTGGGAT6 GCCGCGG6A6 
C66GAAGACC TGC6GCCTGT GTGG6AATTA 
CCCCTCTGGG CTGGCAGAGC CCCXK3GTGGA 
GGACTGCCAG GACCTGCAGA AGCAGCACA6 

10 CAGGTTCTCC GAGGAGGC6T GCGCGGTCCT 
TGCCGTCAGC CCGCTGCCCT ACCTGCG6AA 
CGGCCGCGAG TGCCTGTGCG GCGCCCTGGC 
CGTGCX3CGTC GC6TGGCGCG AGCCAG6CC6 
6TACCTGCA6 TGCGGGACCC CCTGCAACCT 

15 6GAATGCAAT GAGGCCTGCC TGGAGGGCTG 
GAGGGGGGAC TGCGTGCCCA AGGCCCAGTG 
GCCAGAAGAC ATCTTCTCAG ACCATCACAC 
CTGTACCATG AGTGGAGTCC CCGGAAGCTT 
GTCTCATCGC AGCAAAAGGA GCCTATCCTG 

20 CGCTGACAAC CTGCGGGCTG AAGGGCTCGA 
GGAGTGCATG AGCATGGGCT GTGTCTCTGG 
GGAGTGCATG AGCATGGGCT GTGTCTCTGG 
TGAGAACAGA TGTGTGGCCC TGGAAAGGTG 
CCCTGGAGAA ACAGTGAAGA TTGGCTGCAA 

25 CTGCACAGAC CATGTGTGTG ATGCCACGTG 
CTTCGACGGG CTCAAATACC TGTTCCCCGG 
CTGCGGCAGT AACCCTGGGA CCTTTCGGAT 
CTCAGTGAAA TGCAAGAAAC GGGTCACCAT 
TGAC6GGGAG GTGAATGTGA AGAGGCCCAT 

30 GTCTGGCCGG TACATCATTC TGCTGCTGGG 
CCTGAGCATC TCCGTGGTCC TQAAGCAGAC 
GAATTTTQAT GGCATCCAGA ACAATGACCT 
CCCTGTGGAC TTTGGGAACT CCTGGAAAGT 
GCCTCTGGAC TCATCCCCTG CCACCTGCCA 

35 TTCCTCCT6T AGAATCCTTA CCAGTGACGT 
CGAGCCATAT CTGGATGTCT GCATTTACGA 
CGCCTGCTTC TGCGACACCA TTGCTGCCTA 
GGTGACCTGG AGGACGGCCA CATTGTGCCC 
GAACGGGTAT GAGTGTGAGT GGCGCTATAA 

40 TCAGCACCCT GAGCCACTGG CCTGCCCTGT 
CCCTCCAGGG AAAATCCTGG ATGAGCTTTT 
AGTGTGTGAG GTGGCTGGCC ' GGCGTTTTGC 
TGACCCTGAG CACTGCCAGA TTTGCCACTG 
CCAGGAGCCG GGAGGCCTGG TGGTGCCTCC 

45 GTATGTGGAG GACATCTCGG AACCGCCGTT 
CCTGGTCTTC CTGCTGGATG GCTCCTCCAG 
GGCCTTTGTG GTGGACATGA TGGAGCGGCT 
CGTGGTGGAQ TACCACGACG GCTCCCACGC 
GTCAGAGCTG CGGCGCATTG CCAGCCAGGT 

50 CAGCGAGGTC TTGAAATACA CACTGTTCCA 
CTCCC6CATC GCCCTGCTCC TGATGGCCAG 
TGTCC6CTAC GTCCAGGGCC TGAA6AAGAA 
GCCCCATGCC AACCTCAAGC AGATCCXSCCT 
CTTCGTGCTG AGCAGTGT6G ATGAGCTGGA 

55 CTGTGACCTT GCCCCTGAAG CCCCTCCTCC 
TGTGG6CCCG G6GCTCTTGG GGGTTTCGAC 
GGATGTGGCG TTCGTCCTGG AAG6ATCGGA 
CAAGGAGTTC ATGGAGGAGG TGATTCAGCG 
CACGGTGCTG CAGTACTCCT ACATGGTGAC 

60 CAAAGGGGAC ATCCTGCAGC GGGTGCGAGA 
CACTGGGCTG GCCCTGCGGT ACCTCTCTGA 
GGAGCAGGCG CCCAACCTGG TCTACATGGT 
GAGGCTGCCT GGAGACATCC AGGTGGTGCC 
GGAGCTGGAG AGGATTGGCT GGCCCAATGC 

65 CCCCCGAGAG GCTCCTGACC TGGTGCTGCA 
CCCCACCCTC TCCCCTGCAC CTGACTGCAG 
TGGCTCCTCC AGTTTCCCAG CTTCTTATTT 
CATTTCAAAA GCCAATATAG GGCCTCGTCT 
CATCACCACC ATTGACGTGC CATGGAACGT 
70 TGTGGACGTC ATGCAGCGGG AGGGAGGCCC 
TGTGCGATAC TTGACTTCA6 AAATGCAT6G 
CATCCTGGTC ACGGACGTCT CTGTGGATTC 
CAACAGAGTG ACA6TGTTCC CTATTGGAAT 
GATCTT66CA G6CCCAGCA6 GCGACTCCAA 

75 CCCTACCATG GTCACCTTGG GCAATTCCTT 
GATTTGCATG GATGAGGATG 6GAATGAGAA 



TGGGATCTGC CAGTACCTGC TGGCCCGGGA 1560 

TGAGACT6TC CAGTGTGCTG AT6ACCGC6A 1620 

GCTGCCTG6C CTGCACAACA GCCTT6T6AA 1680 

TGGCCAGGAC ATCCAGCTCC CCCTCCTGAA 1740 

G6CCTCCGTG CGCCTCAGCT ACG666AG6A 1800 

GCTGCTGGTG AAGCTGTCCC CCGTCTACGC 1860 

CAATG6CAAC CAGGGCGACG ACTTCCTTAC 1920 

GGACTTCGGG AACGCCTG6A AGCT6CAC6G 1980 

CGATCCCTGC GCCCTCAACC CGCGCATGAC 2040 

GACGTCCCCC ACATTCGAGG CCTGCCATCG 2100 

CTGCCGCTAC GACGTGTGCT CCTGCTCGGA 2160 

CAGCTATGCC GCGGCCTGCG CGGGGAGAGG 2220 

CTGTGAGCTG AACTGCCCGA AAGGCCAGGT 2280 

GACCTGCCGC TCTCTCTCTT ACCCGGAT6A 234 0 

CTTCTGCCCC CCAGGGCTCT ACATGGATGA 2400 

CCCCTGTTAC TATGACGGTG AGATCTTCCA 2460 

CATGTGCTAC TGTGAGGATG GCTTCATGCA 2520 

GCTGCCTGAC GCTGTCCTCA GCAGTCCCCT 2580 

TCGGCCCCCC ATGGTCAAGC TGGTGTGTCC 2640 

GTGTACCAJ\A ACGTGCCAGA ACTATGACCT 2700 

CTGCCTCTGC CCCCCGGGCA TGGTCCGGCA 2760 

CTGCCTCTGC CCCCCGGGCA TGGTCCGGCA 2760 

TCCCTGCTTC CATCAGGGCA AGGAGTATGC 2820 

CACTTGTGTC TGTCGGGACC GGAAGTGGAA 2880 

CTCCACGATC GGCATGGCCC ACTACCTCAC 294 0 

GGAGTGCCAG TACGTTCTGG TGCAGGATTA 3000 

CCTAGTGGGG AATAAG6GAT 6CA6CCACCC 3060 

CCTGGTGGAG GGAGGA6AGA TTGAGCTGTT 3120 

GAAGGATGAG ACTCACTTT6 AG6TGGTGGA 3180 

CAAAGCCCTC TCCGTGGTCT GGGACC6CCA 3240 

ATACCAGGAG AAAGT6TGTG GCCTGTGTGG 3300 

CACCAGCA6C AACCTCCAA6 TGGAG6AAGA 3360 

GA6CTC6CAG TGT6CT6ACA CCAGAAAAGT 3420 

TAACAACATC ATGAAGCAGA CGATGGTGGA 34 80 

CTTCCAGGAC TGCAACAAGC TGGTGGACCC 3540 

CACCTGCTCC TGTGAGTCCA TTGGGGACTG 3600 

TGCCCACGTG TGTGCCCAGC ATGGCAAGGT 3660 

CCAGAGCTGC GAGGAGAGGA ATCTCCGGGA 3720 

CAGCTGTGCA CCTGCCTGTC AAGTCACGTG 3780 

GCAGTGTGTG GAGGGCTGCC ATGCCCACTG 3840 

GCAGACCTGC GTTGACCCTG T^GACTGTCC 3900 

CTCAGGAAAG AAAGTCACCT TG7VATCCCAG 3960 

TGATGTTGTC AACCTCACCT GTGAAGCCTG 4020 

CACAGATGCC CCGGTGAGCC CCACCACTCT 4080 

GCACGATTTC TACTGCAGCA GGCTACTGGA 4140 

GCTGTCCGAG GCTGAGTTTG AAGTGCTQAA 4200 

GCGCATCTCC CAGAAGTGGG TCCGCGTGGC 4260 

CTACATCGGG CTCAAG6ACC GGAAGCGACC 4320 

GAAGTATGC6 G6CAGCCAGG T6GCCTCCAC 4380 

AATCTTCAGC AAGATCGACC GCCCTGAAGC 4440 

CCAGGAGCCG CAACG6ATGT CCCGGAACTT 4500 

GAAGGTCATT 6TGATCCC66 TGGGCATTGG 4560 

CATC6AGAAG CAGGCCCCTG AGAACAA66C 4620 

GCAGCAAAG6 GACGAGATCG TTAGCTACCT 4680 

TACTCTGCCC CCCCACATGG CACAAGTCAC 4740 

CCTGGGGCCC AAGAGGAACT CCATGGTTCT 4800 

CAJ\AATTGGT GAAGCCGACT TCAACAGGAG 4860 

GATGGATGTG GGCCAGGACA GCATCCACGT 4920 

CGTGGAGTAC CCCTTCAGCG AGGCACAGTC 4980 

GATCCGCTAC CAGGGCGGCA ACAGGACCAA 5040 

CCACAGCTTC TTGGTCAGCC A6GGTGACCG 5100 

CACCGGAAAT CCTGCCTCTG ATGAGATCAA 5160 

CATTGGAGTG GGCCCTAATG CCAACGTGCA 5220 

CCCTATCCTC ATCCAGGACT TTGAGACGCT 5280 

GAGGTGCTGC TCCGGAGAGG GGCTGCAGAT 5340 

CCAGCCCCTG GACGTGATCC TTCTCCTGGA 5400 

TGATGAAATG AAGAGTTTCG CCAAGGCTTT 5460 

CACTCAGGTG TCAGTGCTGC AGTATGGAAG 5520 

GGTCCCGGAG AT^CCCATT TGCTGAGCCT 5580 

CAGCCAAATC GG6GAT6CCT TG6GCTTT6C 5640 

T6CCAGGCCG GGAGCCTCAA AGGC6GT66T 5700 

AGTGGATGCA GCAGCTGATG CCGCCAGGTC 5760 

TGGAGATCGC TACGATGCAG CCCAGCTACG 5820 

CGTGGTGAA6 CTCCAGCGAA TCGAAOACCT 5880 

CCTCCACAAA CTQTGCTCTG 6ATTTGTTAG 5940 

GAGGCCC6GG GACGTCTGGA CCTTGCCA6A 6000 
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CCAGTGCCAC ACCGTGACTT GCCAGCCAGA TGGCCAGACC TTGCTGAAGA GTCATC6GGT 6060 

CAACTGT6AC CGGGGGCTGA GGCCTTCGTG CCCTAACAGC CAGTCCCCTG TTAAAGTG6A 6120 

AGAGACCTGT GGCTGCCGCT GGACCTGCCC CTGCGTGTGC ACAGGCAGCT CCACTCGGCA 6180 

CATCGTGACC TTTGATGGGC AG/^TTTCAA GCTGACTGGC AGCTGTTCTT ATGTCCTATT 624 0 

5 TCAAAACAAG GAGCAGGACC TGGAGGTGAT TCTCCATAAT GGTGCCTGCA GCCCTGGAGC 6300 

AAGGCAGGGC TGCATGAAAT CCATCGAGGT GAAGCACAGT GCCCTCTCCG TCGAGCTGCA 6360 

CAGTGACATG GAGGTGACGG TGAATGGGAG ACTGGTCTCT GTTCCTTACG TGGGTGGGAA 6420 

CATG6AAGTC AACGTTTATG GT6CCATCAT GCATGAGGTC AGATTCAATC ACCTTGGTCA 6480 

CATCTTCACA TTCACTCCAC AAAACAATGA GTTCCAACTG CAGCTCAGCC CCAAGACTTT 6540 

10 TGCTTCAAAG ACGTATGGTC TGTGTGG6AT CTGTGATGAG AACGGA6CCA ATGACTTCAT 6600 

GCTGAGGGAT G6CACAGTCA CCACAGACTG GAAAACACTT GTTCA6GAAT G6ACT6T6CA 6660 

GCG6CCAGGG CAGACGTGCC A6CCCATCCT GGAG6AGCAG TGTCTTGTCC CCGACAGCTC 6720 

CCACTGCCAG GTCCTCCTCT TACCACTGTT TGCTGAAT6C CACMGGTCC TGGCTCCAGC 6780 

CACATTCTAT GCCATCTGCC AGCAGGACAG TTGCCT^CCAG GA6CAAGTGT GT6AGGTGAT 6840 

15 CGCCTCTTAT GCCCACCTCT 6TCX3GACCAA CX3GGGTCTGC GTTGACTGGA GGACACCTGA 6900 

TTTCTGTGCT ATGTCATGCC CACCATCTCT GGTCTACAAC CACTGTOAGC ATGGCTGTCC 6960 

CCGGCACTGT GATGGCAACG TGAGCTCCTG T6GGGAGCAT CXICTCCGAAG GCTGTTTCTG 7020 

CCCTCCAGAT AAAGTCATGT TGGAAGGCAG CTGTGTCCCT GAAGAGGCCT GCACTCAGTG 7080 

CATTGGTGAG GATGGAGTCC AGCACCAGTT CCTGGAAGCC TGGGTCCCGG ACCACCAGCC 7140 

20 CTGTCAGATC TGCACATGCC TCAGCGGGCG GAAGGTCAAC TGCACAACGC AGCCCTGCCC 7200 

CACGGCCAAA GCTCCCACGT GTGGCCTGTG TGAAGTAGCC CGCCTCCGCC AGAATGCAGA 7260 

CCAGTGCTGC CCCGAGTATG AGTGTGTGTG TGACCCAGTG AGCTGTGACC TGCCCCCAGT 7320 

GCCTCACTGT GAACX3TGGCC TCCAGCCCAC ACTGACCAAC CCTGGCGAGT GCAGACCCAA 73 80 

CTTCACCTGC GCCTGCAGGA AGGAGGAGTG CAAAAGAGTG TCCCCACCCT CCTGCCCCCC 744 0 

25 GCACCGTTTG CCCACCCTTC GGAAGACCCA GTGCTGTGAT GAGTATGAGT GTGCCTGCAA 7500 

CTGTGTCAAC TCCACAGTGA GCTGTCCCCT TGGGTACTTG GCCTCAACCG CCACCAATGA 7560 

CTGTGGCTGT ACCACAACCA CCTGCCTTCC GGACAAGGTG TGTGTCCACC GAAGCACCAT 7620 

CTACCCTGTG GGCCAGTTCT GGGAGGAGGG CTGCGATGTG TGCACCTGCA CCGACATGGA 7680 

GGATGCCGTG ATGGGCCTCC GCGTGGCCCA GTGCTCCCAG AAGCCCTGTG AGGACAGCTG 7740 

30 TCGGTCGGGC TTCACTTACG TTCTGCATGA AGGCGAGTGC TGTGGAAGGT GCCTGCCATC 7800 

T6CCTGTGAG GTG6TGACTG GCTCACCGCG GGGGGACTCC CAGTCTTCCT GGAAGAGTGT 7860 

CGGCTCCCAG TQ6GCCTCCC C6GAGAACCC CTGCCTCATC AAT6AGTGTG TCCGAGTGAA 7920 

GGAG6AGGTC TTTATACAAC AAAGGAACGT CTCCTGCCCC CAGCTGGAGG TCXXTCTCTG 7980 

CCCCTCQGGC TTTCAGCTGA GCTGTAAGAC CTCAGCGTGC TGCCCAA6CT GTCGCTGTGA 8040 

35 GCGCATGGAG GCCTGCATGC TCAATGGCAC TGTCATTGGG CCXIGGGAAGA CTGTGATGAT 8100 

CGATGTGTGC ACGACCTGCC GCTGCATGGT GCAGGTGGGG GTCATCTCTG GATTCAAGCT 8160 

GGAGTGCAGG AAGACCACCT GCAACCCCTG CCCCCTGGGT TACAAGGAAG AAAATAACAC 8220 

AGGTGAATGT TGTGGGAGAT GTTTGCCTAC GGCTTGCACC ATTCAGCTAA GAGGAGGACA 8280 

GATCATGACA CTGAAGCGTG ATGAGACGCT CCAGGATGGC TGTGATACTC ACTTCTGCAA 8340 

40 GGTCAATGAG AGAGGAGAGT ACTTCTGGGA GAAGAGGGTC ACAGGCTGCC CACCCTTTGA 8400 

TGAACACAAG TGTCTGGCTG AGGGAGGTAA AATTATGAAA ATTCCAGGCA CCTGCTGTGA 8460 

CACATGTGAG GAGCCTGAGT GCAACGACAT CACTGCCAGG CTGCAGTATG TCAAGGTGGG 8520 

AAGCTGTAAG TCTGAAGTAG AGGTGGATAT CCACTACTGC CAGGGCAAAT GTGCCAGCAA 8580 

AGCCATGTAC TCCATTGACA TCAACGATGT GCAGGACCAG TGCTCCTGCT GCTCTCCGAC 864 0 

45 ACGGACGGAG CCCATGCAGG TGGCCCTGCA CTGCACCAAT GGCTCTGTTG TGTACCATGA 8700 

GGTTCTCAAT GCCATGGAGT GCAAATGCTC CCCCAGGAAG TGCAGCAAG T GAG GCTGCTG 8760 

CAGCTGCATG GGTGCCTGCT GCTGCCTGCC TTGGCCTGAT GGCCAGGCCA GAGTGCTGCC 8820 

AGTCCTCTGC ATGTTCTGCT CTTGTGCCCT TCTGAGCCCA CAATAAAGGC TGA6CTCTTA 8880 
TCTTGCTGCA TGTTCTGCTC TTGTGCCCTT CTGAGCCCAC AAT 

Seq ID No! 149 Protein sequence; 
Protein Accession #: -^NP 000543.1 



55 1 11 21 31 41 51 

I I I I i I 

MIPARFAGVL liAIiAIiZLPGT LCAEGTRGRS STARCSLF6S DFVNTFDGSM YSFAGYCSYL 60 

LAGGCQKRSF SIIGDFQNGK RVSIiSVYLGE FFDIKjFVNG TVTQGDQRVS HPYASKGIiYI. 120 

ETEAGYYKLS GEAYGFVARI DGSGNPQVLL SDRYPNKTCG LCGNFNIFAB DDFMTQEGTL 180 

60 TSDPYDFANS VJALSSGEQWC ERASPPSSSC NISSGEMQKG LWEQCQLLKS TSVPARCHPL 24 0 

VDPEPFVAIiC EKTLCECAGG LECACPALLE YARTCAQEGM VLYGWTDHSA CSPVCPAGME 300 

YRQCVSPCAR TCQSLHINEM CQERCVDGCS CPEGQLLDEG LCVESTECPC VHSGKRYPPG 360 

TSLSRDCNTC ICRNSQWICS NEECPGECLV TGQSHFKSFD NRYFTPSGIC QYLLARDCQD 420 

HSFSIVIETV QCADDRDAVC TRSVTVRLPG liHNSLVKLKH GAGVAMDGQD IQIiPLIiKGDL 4 80 

65 RIQHTVTASV RLSYGEDLQM DWDGRGRLLV KIjSPVYAGKT CGLCGNYNGN QGDDFIiTPSG 540 

LAEPRVEDFG NAWKLHGDCQ DLQKQHSDPC ALNPRMTRFS EEACAVLTSP TFEACHRAVS 600 

PLPYIiRNCRY DVCSCSDGRE CliCGALASYA AACAGRGVRV AWREPGRCEI, NCPKGQVYLQ 660 

CGTPCNLTCR SLSYPDEECN EACLEGCFCP PGLYMDERGD CVPKAQCPCY YDGEIFQPED 720 

IFSDHHTMCY CEDGFMHCTM SGVPGSLLPD AVLSSPLSHR SKRSLSCRPP MVKLVCPADN 780 

70 LRAEGLECTK TCQNYDLECM SMGCVSGCIjC PPGMVRHENR CVALERCPCF HQGKEYAPGE 840 

TVKIGCNTCV CRDRKMNCTD HVCDATCSTI GMAHYLTFDG LKYLFPGECQ YVIiVQDYCXSS 900 

NPGTFRILVG NKGCSHPSVK CKKRVTILVB GGBIELFD6E VNVKRPMKDE THFEWESGR 960 

YIILLLGKAL SWWDRHLSI SWUCQTYQE KVCGLCGNFD GIQNNDXiTSS NLQVEEDPVD 1020 

F6NSHKVSSQ CADTRKVPLD SSPATCHMNI MKQTMVDSSC RILTSDVFQD CNKIiVDPEPY 1080 

75 IiDVCIYDTCS CESI6DCACF CDTIAAYAHV CAQHGKWTW RTATLCPQSC EBRNLRENGY 1140 

ECEHRYNSCA PACQVTCQHP EPIiACPVQCV EGCHAHCPPG KILDEItliQTC VDPEDCPVCE 1200 
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10 



15 



20 



25 



VAGRRFASGK 
DISEPPLHDF 
YHDGSHAYXG 
AIiLIiMASQEP 
SSVDBLEQQR 
FVLEGSOKIG 
ILQRVREIRY 
GDIQWPIGV 
SPAPDCSQPIi 
IDVPWNWPE 
TDVSVDSVDA 
VTIiGNSFliHK 
R6LRPSCPNS 
EQDIiEVIIiHN 
NVYGAIMHEV 
GTVTTDWKTL 
AICQQDSCHQ 
DGNVSSCGDH 
CTCIiSGRKVN 
ERGIiQPTLTN 
STVSCPLGYL 
MGLRVAQCSQ 
WASPENPCLI 
ACMLNGTVIG 
CGRCLPTACT 
CU^GGKIMK 
SIDINDVQDQ 



KVTLNPSDPE 
YCSRLLDLVF 
IiKDRKRPSEZi 
QRMSRNPVRY 
DEIVSYLCDL 
BADFNRSKEF 
QGGNRTNT6L 
GPNANVQELE 
DVII^LIiDGSS 
KAHU5LVDV 
AADAARSNRV 
IiCSGFVRICM 
QSPVKVBBTC 
GACSFGARQG 
RFNHLGHIFT 
VQEWTVQRPG 
EQVCEVIASY 
PSEGCFCPPD 
CTTQPCPTAK 
PGECRPNFTC 
ASTATNDCGC 
KPCEDSCRSG 
NECVRVKEEV 
PGKTVMIDVC 
IQIiRGGQIMT 
IPGTCCDTCE 
CSCCSPTRTE 



HCQICHCDW 
LLDGSSRIiSE 
RRIASQVKYA 
VQGLKKKKVZ 
APEAPPPTIiP 
MBEVZQRMDV 
AIiRYLSDHSF 
RIGWPNAPIL 
SFPASYFDEM 
MQREGGPSQI 
TVPPIGIGDR 
DED6NEKRPG 
GCRWTCPCVC 
OWSIEVKHS 
PTPQNNEPQI> 
QTCQPIIiEEQ 
AHIiCRTNGVC 
KVMLEGSCVP 
APTCGLCEVA 
ACRKEECKRV 
TTTTCLPDKV 
FTYVIiHEGEC 
PIQQRNVSCP 
TTCRCMVQVG 
LKRDETLQDG 
EPECNDITAR 
PMQVALHCTN 



NLTCEACQEP 
AEFEVIiKAFV 
GSQVASTSEV 
VIPVGIGPHA 
PHMAQVTVGP 
GQDSIHVTVL 
XjVSQGDREQA 
IQDFBTLPRE 
KSFAKAFISK 
GDALGFAVRY 
YDAAQLRIIiA 
DVWTIiPDQCH 
TGSSTRHIVT 
ALSVEIjHSDM 
QLSPKTFASK 
CLVPDSSHCQ 
VDWRTPDFCA 
EEACTQCIGE 
RLRQNADQCC 
SPPSCPPHRL 
CVHRSTIYPV 
CGRCIjPSACE 
QLEVPVCPSG 
VISGFKIiECR 
CDTHFCKVNE 
LQYVKVGSCK 
GSWYHEVUI 



GGIjWPPTDA 
VDMMBRLRIS 
liKYTLFQXFS 
NXjKQIRIjIEK 
GIiLGVSTLGP 
QYSYMVTVEY 
PNLVYMVTGN 
APDI.VLQRCC 
ANIGPRLTQV 
IiTSEMHGARP 
GPAGDSNWK 
TVTCQPDGQT 
FDGQNFKLTG 
EVTVNGRLVS 
TYGLCGICDE 
VIiIiliPIiFAEC 
MSCPPSLVYN 
DGVQHQPIiEA 
PEYECVCDPV 
PTLRKTQCCD 
GQFWEEGCDV 
WTGSPRGDS 
FQIiSCKTSAC 
KTTCNPCPIjG 
RGEYFWEKRV 
SEVBVDIHYC 
AMECKCSPRK 



PVSPTTt.YVE 
QKWVRVAWE 
KIDRPEASRX 
QAPENKAFVIi 
KRNSMVLDVA 
PF5EAQSK6D 
PASDBIKRIiP 
SGEGXiQIPTXj 
SVLQYGSITT 
GASKAWILV 
LQRZEDLPTM 
LUCSHRVNCD 
SCSYVLFQNK 
VPYVGGNMEV 
NGRNDFMhRD 
HKVIiAPATFY 
HCEHGCPRHC 
WVPDHQPCQI 
SCDIiPPVPHC 
EYECACNCVN 
CTCTDMEDAV 
QSSWKSVGSQ 
CPSCRCERME 
YKEENNTGEC 
T6CPPFDEHK 
QGKCASKAMY 
CSK 



1260 
1320 
1380 
1440 
1500 
15€0 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



30 Seg ID NO: 150 DNA sequence 

Nucleic Acid Accession #: NM_001508.l 

Coding sequence: 1-1362 (underlined sequences correspond to start and stop codons) 



35 



40 



45 



50 



55 



60 



ATGG CTTCAC 
CCCGAGTTTG 
TTCGTGATGG 
AAAGGATACT 
TTGGTGTTCC 
ACGTCCAGCT 
GCTACGCTGC 
TTCAGGTACA 
GTCACCTCCG 
GTGAACGTGC 
CAGCCCGAGA 
CAGTCCAGCA 
AT6T6CTGGA 
ACGC6GCCTC 
ACCATCATCT 
ATTC6GAGGA 
GCGTACATGA 



TGCCGCCTGT 
ACCACCGACA 
TCTGCAAGGA 
TCTAAGTCCC 
AATTCTGCTG 



11 
I 

CCAGCCTCCC 
AGGTGGCCAC 
GCCTTCTGGG 
TGCAGAAGGA 
TCATCGGCAT 
ACACCCTGTC 
TGCACGTGCT 
AGGCTGTGTC 
CCCTGGTGGC 
CCAGCCACCG 
CCTCCAATAT 
TCTTCGGCGC 
ACATGATGCA 
CGCAGCTGAG 
TCCTGAG6CT 
TCAT6GCTGC 
TCCTCCTCCC 
ACAC6GTGTC 
CGCTGCAGCA 
GCGCCCGCTT 
67VACTGAGAA 
AGTCATTGAG 
CAGAGAAT6G 



21 
I 

GGGCAGTGAC 
CTGGATCAAA 
GAACAGCGTC 
GGTGACAGAC 
GCCCATGGAG 
CTGCAAGCTG 
GACGCTCAGC 
GGGACCTTGC 
ACTGCCCTTG 
GGGTCTCACT 
GTCCATCTGT 
CTTCGTGGTC 
GGTGCTCATG 
6AAGTCCGAG 
GATTGTTGTG 
6GCCAAACCC 
CTTCTCGGA6 
CTCGCAGCAG 
aSCCAACCAC 
TGTGCAGCGC 
GATTTTCTTA 
TCTCGAGTCA 
TTTTCAGGAG 



31 



41 
I 



TGCTCCCAAA TCATTGATCA 
ATCACCCTTA TTCTGGTGTA 
ACCATTCGGG TCACCCAGGT 
CACATGGTGA GTTTGGCTTG 
TTCTACAGCA TCATCTGGAA 
CACACTTTCC TCTTCGAGGC 
TTTGAGCGCT ACATCGCCAT 
CAGGTGAAGC TGCTGATTGG 
CTGTTTGCCA TGGGTACTGA 
TGCAACCGCT CCAGCACCCG 
ACCS^CCTCT CCAGCCGCTG 
TACCTCGTGG TCCTGCTCTC 
AAAAGCCAGA AGGGCTCGCT 
AGCGAAGAGA 6CAGGACC6C 
ACATTGGCC6 TATGCTGGAT 
AAGCACGACT GGACGAGGTC 
ACGTTTTTCT ACCTCAGCTC 
TTTCGGCGGG TGTTCGTGCA 
GAGAAGCGCC TGCGCGTACA 
CCGTTGCTCT TCGCGTCCCG 
AGCACTTTTC AGAGCGAGGC 
CTAGAGCCCA ACTCAGGC6C 
CATGAAGTTLGa 



51 
I 

CAGTCATGTC 

CCTGATCATC 

GCTGCAGAAG 

CTCGGACATC 

TCCCCTGACC 

CTGCAGCTAC 

CTGTCACCCC 

CTTCGTCTGG 

GTACCCCCT6 . 

CCACCACGAG 

6ACCGT6TTC 

C6TAGCCTTC 

GGCCGGGGGC 

CAGGAGGCAO 

GCCCAACCAG 

CTACTTCCGG 

GGTCATCAAC 

GGTGCTGTGC 

TGCGCACTCC 

GCGCCAGTCC 

CGAGCCCCAG 

GAAACCAGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID No: 151 Protein Beouence! 
Protein Accession #: NP 001499.1 



65 



70 



75 



MASPSLPGSD 
KGYIiQKEVTD 
ATI-LHVLTLS 
VNVFSHRGIiT 
MCWNMMQVIiM 
ZRRIMAAAKP 
CRIjSIiQHANH 
SKSQSIiSIiES 



11 
I 

CSQIZDHSHV 
HMVSIiACSDI 
FERYIAICHP 
CNRSSTRHHE 
KSQKGSLAGG 
KHDWTRSYFR 
BKRLRVHAHS 
LBPNSGAKPA 



21 
I 

PEFEVATWIK 
IiVFLZGMPMC 
FRYKAVSGPC 
QPBTSNNSZC 
TRPPQLRKSE 
AYMZIiIiPFSE 
TTDSARFVQR 
NSAAENGFQE 



31 

I 

ITLZIiVYLII 
FYSIZWNPIiT 
QVKIiliXGFVW 
TNLSSRWTVF 
SEESRTARRQ 
TFFYIjSSVXN 
PLI.F71SRRQS 
HEV 



41 
I 

FVMGLLGNSV 
TSSYTLSCKL 
VTSAXiVAIiPL 
QSSXFGAFW 
TIIFLRLIW 
PIiLYTVSSQQ 
SARRTBKZFL 



51 
I 

TIRVTQVLQK 
HTFIiFBACSY 
LFAM6TEYPL 
YIiWLLSVAP 
TIiAVCNMPNQ 
FRRVFVQVLC 
STFQSBAEPQ 



60 
120 
180 
240 
300 
360 
420 



Seq ZD NO: 152 DNA sequence 



253 



BNSDOCID: <WO 0207G492A2_I_> 



wo 02/079492 



PCT/US02/04915 



Nucleic Acid Accession #: none found 

Coding sequence: 3-65 (underlined sequences correspond to start and stop codons) 

5 1 11 21 31 . 41 51 

I I I I I I 

TTATTATTTT GTGTAAACTA TATTCTGCTT ATAGAGAGTC TCTGAGACTA AAATTGACAA 60 

CTTGAAAAGT ATTCCAAGGA ATATTATGAA AATAGGGCAA CATGGACTGT TTAAGATCTC 120 

CATGTAATTG AAATTCATGC AAGGAAACAA CTCATAGAAA AGATAAATAT GGATGCCCTT 180 

10 CACATGTTAT CAACCTCGTA ACTTTTGGTG CTTGCTGAAT CAGTCCATGA AAAGCTACAG 240 

CCCGCTCTTT GGGAATGCTA CATACCCATT TCTGGTATTT AAAAAATATC TAGGAGGAGC 300 

TAAATGACAA AACACAGCAG TGTTTTGAG6 GAGAAAGGAC CATCATTTAT AATGCTCTGT 360 

ACATACTACC AGAGCTGCTT GGAAAATTAA AGGCCACTTG TGGCTTTTTC CTACCAACTG 420 

ATACGTTTAA ATTTGCCCTA GGATTSAGCT AACAGCAAAA AAAAAAAAAA AAAAAAAARA 480 

15 GAGAGAAAGA AAGGAGKAAA CAGTGGTAAT AAAAAAATCC ATCTGTCTTC TTGCTATGTT 540 

AATATTAATA AATCATAATA TGACAAGACC CTCACTGAAT AAGAGTATTT TCAGTCATCA 600 

GAAGCCAGCT GTTGGTAGGC ATTAATGAGT TTAAAATTGT TCTCTVATTGA AAAAACATCA 660 

CACTATTTTG CCAATACCAA AGTAATTATA ATACTGTGTC CTCCTGTAAT TTTTTGAGAA 720 

GTGGTTATAA AGGGCATATT TACATAAATT CTACTTTATT CCTCAACTTC TTTGATGAAT 780 

20 GTAACCCAAT TTTACTTCTT TAAAAAGTCT CAATTCAAGC TGGATTAGCC AGCTCAGCAT 84 0 

AATCAACTAG ACAGTGGTTT GTTAAATTTA GCAGCATACT TCGTTCCCAT TCTAATTAAA 900 

GTCATGAGTT CTTGAATCCC AGAGAAATAA TGCTTAGGAA CTTCTCTCAA TCTGCTTGGC 960 

TTGGCCTAGA GAAGTGGCCA TTTTATCAAC AGGRAAAAAA AAAATTTTCT CTACTACAAC 1020 

CCCGTTGCCT TCTGAAAAAC AGCAAGTTAT TTCTTTATAT AATTATCATT TTATTATTTT 1080 

25 ATGGAAAATT AATTTATTAA TTAATAGCCT ATTATGTGTT CTCACTTGCT TCTCTAAGTA 114 0 

ATATTTTGAG ATAAAATGTT GAATAAAACC ATGGATTATA GAGAAAAGTC AAAATATATG 1200 
TGTAATATTT AATTATTTTA TAAGTTTTAT AATAAAGTAT TCCATTTCTT TATCTT 

30 Seq ID No: 153 Protein sequence: 
Protein Accession #: none found 

1 11 21 31 41 51 

35 I I I I I I 

IILCKLYSAY RESLRLKIiTT 

Seq ID NO: 154 DWA sequence 
Nucleic Acid Accession #: none found 
40 Coding sequence: 1-36 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

AC ^ I I I I I 

4^ CTGG ATGATA TGGAAGAAAT GGATGGGTTA AGGTAAAAGG CTGATCACAG ATGGGTTCCT 60 

CTCAAGGTTA AAATAGTTTA AGTGCCAGAA GAAZ^GGTGG GCACCAGCGA ATTAAGAACC 120 

ATCTTTGAAT GGTCCCCTTG GTTAAATACT TAACTTTTGT CATCAGTGTC TGCATTTATG 180 

AAATGAAGA6 GAATTCACTA ATATGCTACG TGATCTTTTG TTTGTCATGA AAAGAGTTAC 240 

TGTTGTGTA6 TTCTCTGTTC CAGGGCTGCC TTTGCTCCAC AAAGCACTGA GAAGCAGTGG 300 

50 CCCTGTACAA CCATACTGCC TCTCAACACT GTGTAATAGG CTAACACCGC CCAGOGAACC 360 
TTCCTGGGAG ATATAAAATA CATAGGTTTA GGCTGGCAAA AAAAAATiAAA AAA 



55 



60 



Seq ID No: 155 Protein sequence; 
Protein Accession #: none found 



1 11 21 31 41 51 

I I I I I I 

IiDDMEBMDGI> R 



Seq ID NO: 156 DNA sequence 
Nucleic Acid Accession #: NM 032961.1 

65 Coding sequence: 827-3949 (tinderlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

arx I I I 1 1 t 

70 CAGGCTCAGA GGCTGAAGCA GGAGGAAGGA AGGACTGGAA GGTU^AAAGAG ACAGGTTAGA 60 

GGGAAAGAGG CTTGGGAAGA AAACAGCA6A AAAGAAACTG CTCATTACAC TTACAGAGAG 120 

6CAA6TAACG 6TGGAGATGA GGACAGAG6G AACCAAGACT CTGAAAGACA AAAAATACAA 180 

ATAGAGC6AA AGAGGAAAAA AATGTCAAGA AGAACATCCA TCCGGAGAAA T6AAGAGAAT 240 

GAAAGTTTTA AACTGCAGAG CCGTTCTGTG CTTTTCCXSGC ACAAAATTAT ATCGCTGATT 300 

75 TTAAGCCCTT TT6CATTTGC CAGCCGTTGA CATTAAGA<3G CATGTTTAAC C3GTGCCAACA 360 

GCATCTCCTT TTCCTTCTCC TCTTCCTCTT CTTCTTCTTC CTCX:TCCTCC TCCTCTTTTT 420 



254 



JNSDOCID: <WO 02079402A2_L> 



wo 02/079492 



PCT/US02/04915 



CCTCCTCCTC GTTCTCCTCC CATCAGCAAG 
GAAATTTCCT CTTTGGGATT TGCCAGCGCC 
TGTATTATTG TTATTTTATT AATTAGTCAG 
CCTGTCACCC TTCCTGTGCT AAGATTTAAA 
5 AAAATGAAGC AAAAGGAGTA AGATTTTTAA 
CGCACTTTTA TTTGTATTTT TTCAGATTTT 
TGGGTGGCTG ACTGGCTGCG GGAAGCTACT 
ATTGTTTGCC TTGCTCTGGA TGGTGGAAGG 
GGAGGAGCAG GAACATGGCA CTTTCGTGGG 
10 TACAAAACTT TCGGCTCGCG GGTTTCAGAC 
CCTCAACCTG GAGACAGGGG TGCTGTACGT 
CAAACAGAGC CCCTCCTGTG TCCTGCACCT 
GTTCCAG6TG GAGATCGAGG TGCTGGACAT 
AGACCTGACQ GTGG/U^TCT CTGAGAGCGC 
15 CGCATTCGAC CCAGAC6TGG GCACCAACTC 
CTACTTCTCC CTGGAC6TGC AGACCCAGG6 
GGAGAAGCCA CTGGACCGAG AGCAGCAAGC 
CQGAGGAGGT GG66GAGGAG TAGGAGAAGG 
CCCCCAGCAG CAGCX3CACCG GCACGGCCCT 
20 CAATGTGCCC GCTTTCGACC AACCCGTCTA 
AGGCACTCTC GTGATCCAGC TCAACGCCAC 
CGTGTACTCC TTCAGCAGCC ACATTTCGCC 
GCGCACTGGC AGACTGGAGG TAAGCGGCGA 
AGTGTACGTG CAAGCCAAGG ACCTGGGCCC 
25 AGTGCGAGTA CTGGATGCTA ATGACAACGC 
AGCGGTGAGT GAGGGCGCGG CGCCCGGCAC 
CGACTCAGAG GAGAATGGGC AGGTGCAGTG 
CAAGTCTTCC TTTAAGAATT ACTACACCAT 
GGGGGACTCC TACACCCTGA CTGTAGTGGC 
30 CAGTAAGTCG ATCCAGGTAC AAGTGTCGGA 
GCCGGTCTAC GACGTGTATG TGACTGAAAA 
GAGCGCCACC GACCGGGATG AGGGCGCCAA 
CCAGATCCAG GGCATGAGCG TCTTCACCTA 
GTACGCCCTG CGCTCCTTCG ACTATGAGCA 
35 CCGGGACGCT GGCAGCCCCC AGGCGCTGGC 
66ATCAAAAT GACAACGCCC CTGCCATCGT 
AGCGC6TGA6 GTGCT6CCCC 6CTC6GCG6A 
CGTGGACGCG GACGACGGCG AGAACGCCCG 
AATGAACCTC TTTCGCATGG ACTGGCGCAC 
40 GGCCAAGCGC 6ACCCCCAGC GGCCTTATGA 
GCCGCCCCTT TCCTCCACCG CCACCCTGGT 
CCAG66CGG6 GGC6GGA6CG GAGGCGGAGG 
TG6C6GCX3GG GAAACCTCGC TAGACCTCAC 
GTCCTTCATC TTCCTGCTGG CCATGATCGT 
45 GCTCAACATC TATACTTGTC TGGCCAGCGA 
CGGAGGTTCG ACCTGCTGTG GCCGCCAAGC 
AGACATCATG CTGGTGCAGA GCTCCAATGT 
GGAGTCCGGG GGCTTTGGCT CCCACCTVCCA 
GACCCCTGAG TCCGCCAAGA CCGACCTGAT 
50 TACGGACACT GAGCACAACC CCTGCGGGGC 
TGATATCATC TCCAACGGAA GCATTTTGTC 
CAGCTATCTA GTTGACAGAC CTCGCCGAGT 
AGTAAGCTCT AAGGACAGTG GTCATGGAGA 
CACCAACCGT 6CCCAGTCAG CTGGTATGGA 
55 AGCTCTG6GC CACTCAGATC GGTGCTGGAT 
GGCT6CTGAT TATCGCAGCA ATCTGCATGT 
GGTGTTTGAA ACTCCAGAAG CCCAGCCTG6 
AGAGAAG6CC CTTCACA6CA CTCTGGAGAG 
GCGA6CGCCT TACAAACCAC CATATTTGAC 
60 GGACTTACCT GAAGCA6CAT GATTTGCACA 
ACTTCATTAT CTTGGCCATC CAGTTAGTCA 
GTCATCATGG CCAATTATAG GACCTAATTG 
TGTGCAGAAC TGTAGAAACT TTAGAGGCAA 
TGTTTACAGC ACTATCTATC TTTCTCT CT C 
65 ATTCACCACG AGAAGCCAGT CATAAAGATA 
CACTGTTTTA AACTTGACTG TTTTATATTA 
TTCCAACTTT ACAAGAGAAA TTGTGATTAT 
TTGTATTCTG AAGACCCACA AAATATCAAA 
AAGTGTTTAC TGTACTATTT CAAAGCTTCT 
70 TATAATTTTC CTAAAATGTG GTACAACTCA 
CATCATACAA TAAAATAAAA GGTAATTCAG 
TCATTAATAG TTTTCTCCCA ATTTCCATAT 
AGAAAATGAT GCTCTAAGCT ACAAAATTTT 
AAAGATGTAG CTATTGATGT TATCAGACAG 
75 ACAATCTGCA TAAGTCTGAT TCTATTTCTA 
TTATAAAGAA TCGATJ\AATT CACCTGTATT 



AAGACAAACC GAGGACAGTC TTGAAATATC 480 

AAGACTGTCG GT^TAAAGGA CGCTGACTAT 540 

TGGAAAGATT ACAGATGAGG AAAGGGGACG 600 

AAAAAATGAG GCTGGATTGC GGGAAGCTCT 660 

AGACAGAAAG CCACAGGAGC CCCCACGTAG 720 

TTTTT6TTTC GTGGTGGTGG GGGAGGTGAT 780 

TCCTTTCCTT TTGGAG ATGA TTGTGCTATT 84 0 

AGTCTTTTCC CAGCTTCACT ACACGGTACA 900 

GAATATCGCT GAAGATCTGG GTCTGGACAT 960 

GGTGCCCAAC TCAAGGACCC CTTACTTAGA 1020 

GAACGAGAAA ATAGACCGCG AACAAATCTG 1080 

GGAGGTCTTT CTGGAGAACC CCCTGGAGCT 1140 

TAATGACAAC CCCCCCTCTT TCCCGGAGCC 1200 

CACGCCAGGC ACTCGCTTCC CCTTGGAGAG 1260 

CTTGCGCGAC TACGAGATCA CCCCCT^CAG 1320 

GGAT6GCAAC CGATTCGCTG AGCTGGTGCT 1380 

6GTGCACCGC TAC6TGCTGA CCGCGGTG6A 1440 

AGGGGGAGGT GGCGG6GGAG CAGGCCTGCC 1500 

ACTCACCATC CGAGTGCTGQ ACTCCAATGA 1560 

CACTGTGTCC CTACCAGAGA ACTCTCCCCC 1620 

CGACCCGGAC GAGGGCCAGA ACGGTGAGGT 1660 

CCGGGCGCGG GAGCTTTTCG GACTCTCX3CC 1740 

GTTGGACTAT GAAGA6AGCC CAGTGTACCA 1800 

CAACGCCX3TG CCTGCGCACT GCTOVGGTGCT 1860 

GCCAGAGATC AGCTTCAGCA CCGTGAAGGA 1920 

TGTGGTGGCC CTTTTCAGCG TGACTGACCG 1980 

CGAGCTACTG GGAGACGTGC CTTTCCGCCT 2040 

CGTTACCGAA GCCCCCCTGG ACCGAGAGGC 2100 

TCGGGACCGG GGCGAGCCTG CGCTCTCCAC 2160 

TGTGJ\ACGAC AACGCGCCGC GTTTCAGCCA 2220 

CAACGTGCCT GGCGCCTACA TCTACGCGGT 2280 

CGCCCAGCTT GCCTACTCTA TCCTCGAGTG 234 0 

CGTTTCTATC AACTCTGAGA ACGGCTACTT 24 00 

GCTGAAGGAC TTCAGTTTTC AGGTGGAAGC 2460 

TGGTAACGCC ACTGTCAACA TCCTCATAGT 2520 

GGCGCCTCTA CCAGGGCGCA ACGGGACTCC 2580 

GCCGGGTTAC CTGCTCACCC GCGTGGCCGC 2640 

GCTCACTTAC AGCATCGTGC 6TGGCAAC6A 2700 

CGGGGA6CTG CGCACAGCAC GCCGAGTCCC 2760 

GCTGGTGATC GAGGTGCGCG ACCATGGGCA 2820 

GGTTCAGCTG GTGGATGGC6 CCGTGGAGCC 2880 

GTCAGGAGAG CACCAGCGCC CCA6TCGCTC 2940 

CCTCATCCTC ATCATCGCGT TGGGCTCGGT 3000 

GCTGGCCGTG CGTT6CCAAA AAGAGAAGAA 3060 

TTGCTGCCTC TGCTGCTGCT GCTGCGGTGG 3120 

CCGGGCGCGG AAGTVAGAAAC TCAGCAAGTC 3180 

ACCCAGTAAC CCGGCCCAGG TGCCGATAGA 3240 

CAACCAGAAT TACTGCTATC AGGTATGCCT 3300 

GTTTCTTAAG CCCTGCAGCC CTTCGCGGAG 3360 

CATCGTCACC GGTTACACCG ACCAGCAGCC 34 20 

CAACGAGACT AAACACCAGC GAGCAGAGCT 3480 

TAACAGTTCT GCATTCCAGG AAGCCGACAT 3540 

CAGTGAACAG GGAGATAGTG ATCATGATGC 3600 

TCTCTTCTCC AATTGCACTG AGGAATGTAA 3660 

GCCTTCTTTT GTCCCTTCTG ATGGACGCCA 3720 

TCCTGGCATG GACTCTGTTC CAGACACTGA 3780 

GGCAGAGCGG TCCTTTTCCA CCTTTGGCAA 3 840 

GAAGGAGCTG GATGGACTGC TGACTAATAC 3900 

ACGGAAAAGG ATATG CTAQT CAATTCTACA 3960 

AAGTCGACCA ACTUUUkGCAT CAACTTTTCA 4020 

TGT6TAACT6 AGTATTAGAT TTCGGATGGA 4080 

CTCTCAGCAG GCCTGAGAAA TGAGTTGAAA 4140 

CAGATTTTGC CTCCCCGATC AGT6TGTGCC 4200 

CAAATGTCAC TGAGCCCTTT AGATGTTTAT 4260 

AAG6AAATTT GTGCATTATA AATGCAATAT 4320 

TTTTTGTGT6 ATCAAGTGTT CCGCAAGCTA 4380 

GTTCTTTTCA CCTGTGGGTT ATAAAAAATG 4440 

GACATTCTGT AGTTTATACA CCGTGTTGCA 4500 

AAATAAATAT AAAATATATA TATTATATTA 4560 

GTTGGTTTTT AAATGGATGC ATACAGTCCA 4620 

GGTCCCAAAG ACAAACTTAC TAAGAAAA7A 4680 

CTTACTCAAC CGTGTTTTTC CTTGTTTAAA 474 0 

GTCAAAAACT CATATTGAAT TTTCAATGCC 4800 

AGCACTGACT ATGTACTATC AAACTATCTA 4 860 

TGACTTTGAA TTTAGAATCA CTTTUUIGCTT 4920 

T6T1X3TTAGA AAAAAACT66 6TGTCTGTAC 4980 



255 
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ATTTTGTGGT 
ACTCACACAG 
TATAATTCTA 
AAAATATGAA 
TTCAGTATTC 
AATTCTTCAT 
TTAATTATTA 



GTAAAATATG 
AATTTTATTT 
TATGAATATA 
AGCTCTAAAT 
CATGTAAAAA 
GAGTCAGCCT 
AATTTAGTAA 



TAATTGAAGA 
TACATAGTTT 
TAGAGATATA 
TTAAAATAAA 
TTTTATAGCT 
TC7y\AAGTTA 
GACGCAAAAA 



TTACTATTTT 
TGTGACTTAA 
G7UUVCATCTG 
TTTAGAGATA 
TAAATGTAGT 
AGCTTGCCTT 
AAAAAAAAAA 



AAGAAGTCAT 
TTACACATGA 
AACTGGTAAA 
GAATCAtGGT 
CAGTGTTTGA 
TTACTTTTAT 
AAAA 



CAGTCATATC 5040 

ATATAT^AATC 5100 

GAATAACTAT 5160 

ACATTATTGT 5220 

TTAATGATUU^ 5280 

GTCAACAATA 534 0 



10 



Seq ID No: 157 Protein sequence ; 
Protein Accession #: MP 116586.1 



15 



20 



25 



30 



MIVLLLPALL 
TPYLDLNLET 
SPPEPDLTVE 
AELVLEKPLD 
LDSNDNVPAF 
PGLSPRTGRli 
STVKEAVSEG 
LDREAGDSYT 
YIYAVSATDR 
PQVEARDAGS 
TRVAAVDADD 
RDHGQPPLSS 
ALGSVSFIFL 
KLSKSDIMLV 
SPSRSTDTEH 
QEADIVSSKD 
SDGRQ7VADYR 
1.LTNTRAPYK 



11 
I 

WMVEGVPSQL 
GVLYVNEKID 
ISESATPGTR 
REQQAVHRYV 
DQPVYTVSIjP 
EVSGELDYEE 
AAPGTWAIiF 
LTWARDRGE 
DEGANAQIiAY 
PQAIiAGNATV 
GBNARIiTYSI 
TATLWQLVD 
LAMIVIiAVRC 
QSSNVPSNPA 
NPCGAIVTGY 
SGHGDSEQGD 
SNLHVPGMDS 
PPYLTRKRIC 



21 
I 

HYTVQEBQBH 
RBQICKQSPS 
FPLBSAFDPD 
LTAVD6GGGG 
ENSPPGTLVl 
SPVYQVYVQA 
SVTDRDSEEN 
PALSTSKSIQ 
SILECQIQGM 
NILIVDQNDN 
VRGNEMNLiFR 
GAVEPQGGGG 
QKEKKLNIYT 
QVPIEESGGP 
TDQQPDIISN 
SDHDATNRAQ 
VPDTEVFETP 



31 
I 

GTFVGNIABD 
CVIiHIiEVFLE 
VGTNSLRDYE 
6VGE6GGG6G 
QLNATDPDEG 
KDIiGPNAVPA 
GQVQCELLGD 
VQVSDVNDNA 
SVFTYVSINS 
APAIVAPLPG 
MDWRTGELRT 
SGGGGSGEHQ 
CLASDCCLCC 
GSHHHNQNYC 
GSILSNETKH 
SAGMDIiFSNC 
BAQP6AERSF 



41 

I 

LGLDZTKLSA 
NPLBLFQVBI 
ITPNSYFSIiD 
6A6LPPQQQR 
QMGEWYSFS 
HCKVLVRVLD 
VPPRIiKSSFK 
PRFSQPVYDV 
ENGYLYAIiRS 
RNGTPAREVL 
ARRVPAKRDP 
RPSRSGGGET 
CCCGGGGSTC 
YQVCLTPESA 
QRAELSYIiVD 
TEECKAXiGHS 
STFGKEKAIiH 



51 
I 

RGFQTVPNSR 
BVLDINDNPP 
VQTQGD6NRP 
TGTALLTIRV 
SHISPRAREL 
ANDNAPEISF 
NYYTIVTEAP 
YVTENNVPGA 
FDYEQLKDFS 
PRSTlEPGYLIj 
QRPYEIiVIEV 
SLDLTLILII 
CGRQARARKK 
KTDIiMFLKPC 
RPRRVNSSAF 
DRCWMPSFVP 
STLERKBIjDG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq ID NO: 158 DNA sequence 
35 Nucleic Acid Accession #: NM_022159.1 

Coding sequence: 70-1690 (underlined sequences correspond to start and stop codons) 



40 



11 21 31 41 51 



GTGAAATTTA AACTCCAGTC CTGTGGCGAA AATGCTAATT GCACTAACAC AGAAG6AAGT 60 

TATTATTGT A TG TGTGTACC TGGCTTCAGA TCCAGCAGTA ACCAAGACAG GTTTATCACT ' 120 

AATGATGGAA CCGTCTGTAT AGAAAATGTG AATGCAAACT GCCATTTAGA TAATGTCTGT 180 

ATAGCTGCAA ATATTAATAA AACTTTAACA AAAATCAGAT CCATAAAAGA ACCTGTGGCT 24 0 

45 TTGCTACT^G AAGTCTATAG AAATTCTGTG ACAGATCTTT CACCAACAGA TATAATTACA 300 

TATATAGAAA TATTAGCTGA ATCATCTTCA TTACTAGGTT ACAAGAACAA CACTATCTCA 360 

GCCAAGGACA CCCTTTCTAA CTCAACTCTT ACTGAATTTG TAAAAACCGT GAATAATTTT 420 

GTTCAAAGGG ATACATTTGT AGTTTGGGAC AAGTTATCTG TGAATCATAG GAGAACACAT 4 80 

CTTACAAAAC TCATGCACAC TGTTGAACAA GCTACTTTAA GGATATCCCA GAGCTTCCAA 540 

50 AAGACCACAG AGTTTGATAC AT^TTCAACG GATATAGCTC TCAAAGTTTT CTTTTTTGAT 600 

TCATATAACA T6AAACATAT TCATCCTCAT ATGAATATGG ATGGAGACTA CATAAATATA 660 

TTTCCAAAGA GAAAAGCTGC ATATGATTCA AATGGCAATG TTGCAGTTGC ATTTTTATAT 720 

TATAAGAGTA TTGGTCCTTT GCTTTCATCA TCTGACAACT TCTTATTGAA ACCTCAAAAT 780 

TATGATAATT CTGAAGAGGA GGAAAGAGTC ATATCTTCAG TAATTTCA6T CTCAATGAGC 840 

55 TCAAACCCAC CCACATTATA T6AACTTGAA AAAATAACAT TTACATTAAG TCATCGAAAG 900 

GTCACA6ATA GGTATAGGAG TCTATGTGCA TTTTGGAATT ACTCACCTGA TACCATGAAT 960 

GGCA6CTG6T CTTCAGAGGG CTGT6AGCTG ACATACTCAA ATGA6ACCCA CACCTCAT6C 1020 

CGCTGTAATC ACCTGACACA TTTTGCAATT TTGATGTCCT CTGGTCCTTC CATTGGTATT 1080 

AAAGATTATA ATATTCTTAC AAGGATCACT CAACTAGGAA TAATTATTTC ACTGATTTGT 1140 

60 CTTGCCATAT GCATTTTTAC CTTCTGGTTC TTCA6TGAAA TTCAAAGCAC CAGGACAACA 1200 

ATTCACAAAA ATCTTTGCTG TAGCCTATTT CTTGCTGAAC TTGTTTTTCT TGTTGGGATC 1260 

AATACAAATA CTAATAAGCT CTTCTGTTCA ATCATTGCCG GACTGCTACA CTACTTCTTT 1320 

TTAGCTGCTT TTGCATGGAT GTGCATTGAA GGCATACATC TCTATCTCAT TGTTGTGGGT 1380 

GTCATCTACA ACAAGGGATT TTTGCACAAG AATTTTTATA TCTTTGGCTA TCTAAGCCCA 144 0 

65 GCCGTGGTAG TTGGATTTTC GGCAGCACTA GGATACAGAT ATTATGGCAC AACCAAAGTA 1500 

TGTTGGCTTA GCACCGAAAA CAACTTTATT TGGAGTTTTA TAGGACCAGC ATGCCTAATC 1560 

ATTCTTGTTA ATCTCTTGGC TTTTGGAGTC ATCATATACA AAGTTTTTCG TCACACTGCA 1620 

GGGTTG7UUVC CAGAAGTTAG TTGCTTTGAG AACATAAGGT CTTGTGCAAG AGGAGCCCTC 1680 

GCTCTTCTGT TCCTTCTCGG CACCACCTGG ATCTTTGGGG TTCTCCATGT TGTGCACGCA 1740 

70 TCAGTGGTTA CAGCTTACCT CTTCACAGTC AGCAATGCTT TCCAGGGGAT GTTCATTTTT 1800 

TTATTCCTGT GTGTTTTATC TAGAAAGATT CAAGAAGAAT ATTACAGATT GTTCAAAAAT 1860 

GTCCCCTGTT GTTTTGGATG TTTAAGGTAA ACATAGAGAA TGGTGGATAA TTACAACTGC 1920 

ACAAAAATAA AAATTCCAAG CTGTGGATGA CCAATGTATA AAAATGACTC ATCAAATTAT 1980 

CCAATTATTA ACTACTAGAC AAAAAGTATT TTAAATCAGT TTTTCTGTTT ATGCTATAGG 2040 

75 AACTGTAGAT AATAAGGTAA AATTATGTAT CATATAGATA TACTATGTTT TTCTATGTGA 2100 

AATAGTTCTG TCAAAAATAG TATTGCAGAT ATTTGGAAAG TAATTGGTTT CTCAGGAGTG 2160 



256 
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ATATCACTGC ACCCAAGGAA AGATTTTCTT TCTAACACGA GAAGTATATG AATGTCCTGA 2220 

AGGAAACCAC TGGCTTGATA TTTCTGTGAC TCGTGTTGCC TTTGA7VACTA GTCCCCTACC 2280 

ACCTCGGTAA TGAGCTCCAT TACAGAAAGT GGAACATAAG AGAATGAAGG GGCAGAATAT 2340 

CAAACAGTGA AAAGGGAATG ATAAGATGTA TTTTGAATGA ACTGTTTTTT C TGTA GACTA 2400 

5 GCTGAGAAAT TGTTGACATA AAATAAAGAA TTGAAGAAAC ACATTTTACC ATTTTGTGAA 2460 

TTGTTCTGAA CTTAAATGTC CACTAAAACA ACTTAGACTT CTGTTTGCTA AATCTGTTTC 2520 
TTTTTCTAAT ATTCTAAAA 

Seq ID No: 159 Protein sequence t 
10 Protein Accession NP_071442.1 



1 11 21 31 41 51 

I I I I I I 

15 MCVPGFRSSS NQDRFITNDG TVCIENVNAN CHLDNVCIAA NINKTLTKIR SIKEPVAIiIiQ 60 

EVYRNSVTDIj SPTDIITYIE IliAESSSIiLG YKNNTISAKD TLSNSTLTEP VKTVNNFVQR 120 

DTFWWDKLS VNHRRTHLTK LMHTVEQATL RISQSFQKTT EPDTNSTDIA LKVFFFDSYN 180 

MKHIHPHMNM DGDYINIFPK RKAAYDSNGN VAVAFLYYKS IGPIiI*SSSDN FLLKPQNYDN 240 

SEEEERVISS VISVSMSSNP PTLYEIjEKIT FTLSHRKVTD RYRSLCAFWN YSPDTMNGSW 300 

20 SSEGCEIiTYS NETHTSCRCN HLTHFAILMS SGPSIGIKDY NILTRITQLG IIISLICLAI 360 

CIFTFWFFSE IQSTRTTIHK NLCCSLFIiAE LVFLVGINTN TNKIiFCSIIA GLIiHYFFLAA 420 

FAWMCIEGIH LYLIWGVIY NKGFLHKNFY IFGYIiSPAW VGFSAAIjGYR YYGTTKVCWI, 480 

STENNFIWSF IGPACLIILV NLIiAFGVIIY KVFRHTAGIiK PEVSCFENIR SCARGAIALIi 54 0 

PLIjGTTWIFG VLHWHASW TAYIiFTVSNA FQGMFIFI»PL CVIiSRKIQEE YYRLFKNVPC 600 

25 cfgclr 



Seq ID NO: 160 DNA sequence 
Nucleic Acid Accession none found 
30 Coding sequence; 1-216 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

35 TGTC TGCTTA TGCGGTGGCT CGCTGCTCAG JUkCAGGATGG CAGAGATGAG CACCACCATC 60 

AAAAACTCAA GGACCAGTGC TGTGGGTCCA GTCATCTGTT TCATGGAATT CACCAGTCTG 120 

GTATCTTCAA AATCCAGAAG GATGATGGCA GATGGCAGGA AGGAGGAAGA GGGTAATCTG 180 

GAAGAGTTTC CTGACCTACT CTGCTGCTGT GAT TAA ACAA CCACCAGGAA ATTTTGATGA 240 

CACTGTTCTC CTGAGCTCCT CCCTTTCCTC GGGGAAGAAA AGCATTGAAA CTACAAAAAT 300 

40 AAAGTGTTAT TTGGCTGGAG TGAGGTCTCA TGTCTGCTTA TGCGGTGGCT CGCTGCTCAG 3 60 

AACAGGGAAC CATTGGAGAT ACTCATTACT CTTTGAAGGC TTACAGTGGA ATGAATTCAA 420 

ATACGACTTA TTTGAGGAAT TGAAGTTGAC TTTATGGAGC TGATAAGAAT CTTCTTGGAG 4 80 

AAAAAAAGAC TGGTACTTCT GAATTAACCA AAATCACAGT ATTCTGAAGA TGATTCTACA 540 

AAGCCTGCTO TTTCTACAAA GGCTGCTGAT GATTTCTACA AAGCCTGCTG TAGTGTTGCT 600 

45 GTGGCCTCTG CTTAAAAAAG TAGAAAACAC ATTGATGCAG CATGTTCACC CCAACCTCCC 660 

TGCCTAAA6G CTCAGGGACC ATCTTGGAAG AGGAAGGCGC GTGAGATTGT AAGA6CCGAA 720 

TTAGGGGGAT GGAGTGTGGA GAATAAGGAC ACTTCATCTT GGATG CTCA C CTGCCAAATT 780 

GACTTCTGAT GAAAGCCAGC TCCAGAAATG TGCCTACAGT TACTACTTTC ACCTAAACCC 840 

TGCCCTTAGT CAAATCCTTC TCTTCTTCTA AGCAATCAAC TTCAATTCCT TGTATAACCC 900 

50 ACAGTATAAA AGGGCTTTTA TACCATTCTA TCCTATTGCA TGTAAGCCTT GGGTCTGGGA 960 

GGTAACAGTG TGGGATTCCA CCATCTCATC TCCCTGCCAC CCAAACATGC CTGCTCTTCT 1020 
TTAAGCAATA TTAAATGTTT GTACTTCA 

Seq ID No: 161 Protein sequencer 
55 Protein Accession #: none fo\md 



1 11 21 31 41 51 

I I I 1 I I 

60 CIiIiMRWIJWlQ NRMAEMSTTI KNSRTSAVGP VICFMEFTSL VSSKSRRMMA DGRKEEEGNIi 60 

EEFPDLLCCC D 

Seq ID NO: 162 DNA sequence 
Nucleic Acid Accession #: none found 
65 Coding sequence: 1-159 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

70 GAGA CCCTCC AGAGGCAGGG CCCAGGATTG AAGAGGGAAG CCCTGCTCCA CACGTG TTC A 60 

TCAGGAAGGA CCCACAGACT GCTGCTCCTG GAGGCCTCTC GGTTTAT6GA TGTGTGTTTG 120 

TTCCATAAAC CCTCAGAG6G TCACCTGGAG ACCCGCTAAA ATGCA6GTTC TTGGGCCACA 180 

TCCTAQACCT TCTGACCGAC CCAGGGAGTG OGGCCCAGGA AGCTGCATTT GACAGATATC 240 

CCCGTGTGAT CATCATGCAC ACAGGAGTGA GAGAACCAGT GTTCTCCCCG GGCAGAAGGQ 300 

75 AAGCTCGT6T GCAGGACACC TCACACCTCC TTTCCCATTC CCCTGCCAGG CTCTCCCT6C 360 

TGACATTCTT TTTGCGGGAG AGCTGTGAAT TCTGAAGATT AGGTTGCTTC TCACCCCAAG 420 



257 



BNSDOCID: <WO 02070492A2J_> 



wo 02/079492 



PCT/US02/04915 



CTCCAGAAGT CCAGGCTGAG CCAAACCAAG CTTCAAGTTG TGCCTGGACT TGGAGAACCA 4 80 

GGAGGTGAGG GGACTGACTA CTTGAAGATC ACATGGAGGA GGAGTCTGAT CCAGGCCCAG 540 

GCACCAAGGA AAGGCCATGC AAGGACACAG GGAGAAGGGC AGCTGTCTGT AAGCCAGAAA 600 

GAGCCTTCAC TAGAAACCAA ATCAGCCAGA ACCTTCATCT TGGACTTTCC AGCCTTCAGA 660 
5 GATGTGAAAA AATAAATTTC TGTTGATTAA CCTAAAAAA 



10 



Seq ID No: 163 Protein aeouence: 
Protein Accession # : none found 



11 



21 



31 



41 



51 



15 



20 



25 



30 



35 



40 



45 



50 



ETLQRQGPGL KRETU^IiHTCS SGRTHRLLLL EASRFMDVCIi FHKPSEGHLE TR 

Seq ID NO: 164 DNA sequence 

Nucleic Acid Accession #: NM_020241.1 

Coding seqnience: 4-1557 (underlined sequences correspond to start and stop codons) 



GC CATG CAGA 
CTGGGGGG.CG 
GACTACCTGA 
GAAGGTGCTG 
GGGGACAGGG 
TACCAGAGGA 
GGCAAACAGG 
ACGCTCTTTG 
ACCCTGCAGC 
CACGCCAATG 
CTAGCCATTG 
AAACATGACT 
CATGTCTACT 
GTGTCCCGCG 
AAGCAGTGGA 
TTCTACTTCA 
GTCCTGGCCG 
GACCTGACAC 
TCCATCTGGA 
GCCCCCGGGA 
AAGACCCACC 
CGGACCCTGA 
GGCAACCAGA 
CGGCCCAATG 
AGGGTGTGTG 
CX3ACGCTGGG 
CCCCCCACTC 
AGGGCCTGCC 
CGGCGAAGGT 
GCCACCCGTC 
G 



11 
I 

CCCC6CGAGC 
CCCACGGCCT 
ACCACTATCC 
ACGACCTCAA 
ACAACCTCTA 
AGCTGACCTG 
AGGGCGAGTG 
TGTGCGGTTC 
CCGTCGGAGA 
TTGCCCTCTT 
ATGCTGTCAT 
CCAAGTGGTT 
TCTTCTTCCG 
TGGCCCGAGT 
CGTCCTTCCT 
ACGTGCT6CA 
TTTTTTCCAC 
AGGTGGCAGC 
CGCCGGTGCC 
TGCAGTACAA 
CTCTGATGGA 
TGAGGCACCA 
CCGTTGTCTT 
CCAGCACCTC 
TCCACGAGCG 
GCTTCCAGAA 
TGCAGAGGGA 
CGGAAGTCAC 
GGGTGGGGCC 
CCCTTGTGAC 



21 
I 

GTCCCCTCCC 
CTTTCCTGAG 
CGTGTTTGTG 
CATCCAGCGA 
CCGCGTAGAG 
GAGATCTAAC 
TCGAAACTTC 
CAACGCCTTC 
CAACATCAGC 
CTCTGACGGG 
CTACCGCAGC 
CAAA6AGCCT 
GGA6ATTGCG 
GTGCAAGAAC 
GAA6GCGCGG 
GGCTGTCACG 
GCCCAGCAAC 
TGTGTTTGAA 
GGAGGATCAG 
TGCCTCCAGC 
CGAAGCGGTG 
GCTGACTCGA 
CCTGGGTTCT 
AGGGACGTCT 
ACGATCGTGG 
GGCCCGGGGG 
AGCGGGGACA 
ATCGGCAGCA 
CCTCTGTAAA 
CTCCCCCCTC 



31 

I 

CGCCCGGCCC 
GAGCCGCCGC 
GGCAGCX3GGC 
GTCCTGCGGG 
TTGGAGCCCC 
CCCAGCGACA 
GTAAAGGTGC 
AACCCGGTGT 
GGTATGGCCC 
ATGCTCTTCA 
CTCGGGGACA 
TACTTTGTCC 
ATGGAGTTTA 
6ACGTGG6A6 
CTCAACTGCT 
GGCGTG6TCA 
AGCATCCCTG 
GGCCGCTTCC 
GTGCCTCGAC 
GCCTTGCCGG 
CCCTCGCTGG 
GTGGCTGTGG 
GAGGCGGGGA 
GGGCGTGTGT 
TGGCCCCAGC 
CCTCCGAGGT 
ATGCCGGGGT 
GCTGTCTAAA 
TACGGCCCCA 
TGACCTCCAG 



41 
I 

TCCTGCTTCT 
CGCTTAGCGT 
CCGGACGCCT 
TCAACAGGAC 
CCACGTCCAC 
TAAACGTGTG 
TGCTCCTTCG 
GCGCCAACTA 
GCTGCCCGTA 
CAGCTACTGT 
GGCCCACCCT 
ATGCGGTGGA 
ACTACCTGGA 
GCTCCCCCCG 
CTGTACCCGG 
GCCTCGG66G 
GCTCGGCTGT 
GAGAGCAGAA 
CCCGGCCCGG 
ATGACATCCT 
GCCATGCGCC 
ACGTGGGAGC 
CGGTCCTCAA 
GTCAAGTGGG 
GGCCTGGGCG 
GCCGGTTAGG 
TTCAGGCAGG 
GGGCTTGGGG 
GGGTGGTGAG 
CTGACCATGC 



51 
1 

GCTGCTGCTA 
GGCCCCCAGG 
GACCCCCGCA 
GCTGTTCATT 
GGAGCTGCGG 
TCGGATGAAG 
GGACGAGTCC 
CAGCATAGAC 
CGACCCCAAG 
TACCGACTTC 
GCGCACCGTG 
GTGGGGCAGC 
GAAGGTGCTG 
CGTGCTGGAG 
AGACTCCCAT 
CCG6CCCGTG 
CTGCGCCTTT 
GTCCCCCX3AG 
GTGCTGCGCA 
CAACTTTGTC 
CTGGATCCTG 
CGGCCCCTGG 
GTTCCTCGTC 
CCACGCGTGC 
TTGGCTGAGC 
AGTT TGA ACC 
AGACACGAGG 
GCCTGGGGGG 
AGAGTCCCAT 
ATGCCACGTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



55 



Seq ID No: 165 Protein sequence; 
Protein Accession #: NP 064626.1 



11 



21 



31 



41 



51 



60 



65 



70 



75 



MQTPRASPPR 
GADDLNIQRV 
KQEGECRNFV 
ANVAIiFSDGM 
VYFFFREIAM 
YFNVLQAVTG 
IWTPVPEDQV 
TLMRHQLTRV 
VCVHERRSWW 



PALLLIiLbliL 
LRVNRTLFIG 
KVLLIiRDEST 
LFTATVTDFL 
EFNYLEKVW 
WSIiGGRPW 
PRPRPGCCAA 
AVDVGAGPWG 
PQRPGRWLSR 



GGAHGLFPEB 
DRDNIiYRVEL 
LFVCGSNAFN 
AIDAVIYRSL 
SRVARVCKND 
LAVFSTPSNS 
PGMQYNASSA 
NQTWFIiGSE 
RW6FQKARGP 



PPPLSVAPRD 

EPPTSTEIiRY 
PVCANYSIDT 
GDRPTLRTVK 
VGGSPRVLEK 
IPGSAVCAFD 
LPDDILNPVK 
AGTVLKFLVR 
PRCRIiGV 



YUIHYPVFVG 
QRKIiTWRSNP 
LQPVGDNISG 
HDSKWFKEPY 
QWTSFLKARL 
IjTQVAAVFEG 
THPIiMDEAVP 
PNASTSGTSG 



SGPGRLTPAE 
SDINVCRMKG 
MARCPYDPKH 
FVHAVEWGSH 
NCSVPGDSHF 
RFREQKSPES 
SLGHAPWIIiR 
RVCQVGHACR 



60 
120 
180 
24 0 
300 
360 
420 
480 



Seq ID NO: 166 PNA seouence 

Nucleic Acid Accession #: NM_032108.1 

Coding sequence: 39-2705 (underlined sequences correspond to start and stop codons) 



11 



21 



31 



41 



51 



TCCGAGGCGT CACCTCCTCC TGTCGCCTGG CCCTCGC CAT G CAGACCCCG CGAGOGTCCC 
CTCCCCGCCC GGCCCTGCTG CTTCTGCTGC TGCTACTGGG GGGCGCCCAC GGCCTCTTTC 



60 
120 



258 



BNSDCCID: <WO 02079492A2_I_> 



wo 02/079492 PCT/US02/04915 



CT6A6GACCC GCCX3CC6CTT A6C6TG6CCC 
TTGTGGGCAG CG6GCCCGGA CGCCTGACCC 
AGCGAGTCCT GCG66TCAAC AGGACGCT6T 
TA6AGCTGGA GCCCCCCAC6 TCCACGGA6C 
5 CTAACCCCAG CGACATAAAC GTGTGTCGGA 
ACTTCGTAAA G6TGCTGCTC CTTCGGGACG 
CCTTCAACCC GGTGTGCGCC AACTACAGCA 
TCAGCGGTAT GGCCCGCTGC CCGTACGACC 
ACGGGATGCT CTTCACAGCT ACTGTTACCG 

10 GCAGCCTCGG GGACAGGCCC ACCCTGCGCA 
AGCCTTACTT TGTCCATGCG GTGGAGTGGG 
TTGCGATGGA GTTTAACTAC CTGGAGAAGG 
AGAACGACGT GGGAGGCTCC CCCCGCGTGC 
CGCGGCTCAA CTGCTCTGTA CCCGGAGACT 

15 TCACGGGCGT GGTCAGCCTC GGGGGCCGGC 
GCAACAGCAT CCCTGGCTCG GCTGTCTGCG 
TTGT^GGCCG CTTCCGAGAG CAGAAGTCCC 
ATCAGGTGCC TCGACCCCGG CCCGGGTGCT 
CCAGCGCCTT GCCGGATGAC ATCCTCAACT 

20 CGGTGCCCTC GCTGGGCCAT 6CGCCCTGGA 
CTCGAGTGGC TGTGGACGTG GGAGCCGGCC 
GTTCTGAGGC GGGGACGGTC CTCAAGTTCC 
CGTCTGGGCT CAGTGTCTTC CTGGAGGA6T 
GGCCCGGCGG TGGCGAGACA GG6CAGCG6C 

25 GGGGCCTGCT GGCTGCCTTC CCCCGCTGCG 
AGTACTCGGG GTGTATGAA6 AACTGTATCG 
CCGACGGCTC CTGCATCTTC CTCAGCCCGG 
CC6GGGCCAG CACCTCAG6C TTAGGGGACT 
AG6ACCGCGC 6GGGCTGGTG TCGGT6AACC 

30 TGGTGGGAGC CGTGGTGTCC G6CTTCAGCG 
GGGAGCTGGC CCGGCGCAAG GACAAGGAGG 
TGCTGA6CGT CAGCCGCCTG GGCGAGCGCA 
GCGGTGGCGG TGGCGCCGGG GTTCCCCCGG 
GCTGGGCCAA GGCCACGCTG CTGCAGGGCG 

35 CCAC6CCCGA GCAGACGCCG CTGCCGCAGA 
ACGCCCTGGG CCCCCGCGCC TGGGACCACG 
CCTCCCTCCT GCTGCTGGCG CCCGCCCGGG 
CGACCCCCQA CGGCCGCCTC TATGCTGCCC 
CGCTCACCCC CCACGCCAQC CCGGACCGCC 

40 TGGACCCAGC CTCAGCCGCC GATGGCCTCC 
GCCTGAGGAG GCCACTGGGC CCCCACGCCC 
CGTTCAACAG CGGCGAGGCC CGGCCTGGGG 
GCACAGACTT GGCCCACCTC CTCCCCTATG 
C CTAG GCCGG GG6CCCCCC6 ATGCCTTGGC 

45 ACGGTGCCAG AACGCCGGGG CCCGGGGCAA 
ACCCACXrCGC GGAGTGGGGG GCCCCCTCCX3 
CCCTACCCGG GGCCGCAGGA CGCTGAGAC6 
CTAT6GATTT GAGGTTGACC TTATGCGCGT 
CTTTTGCGGT TTTCTAACCA ATTGCACAAC 

50 GGCTT66ACG CCGGTGGGGA ATGGG6GGCC 
CTGGAAAGGT CCCTCCCCAA CCCAGGCCCC 
GCCGT6TTCG TGTGCAAGGG GCCGGGGAGG 
GCTQTGGGCG TGTGTGTCAA GTGGGCCACG 
CGTGGTGGCC CCAGCGGCCT GGGCGTTGGC 

55 GGGGGTCTCC GAGGTGCCGG TTAGGAGTTT 
GGACAATGCC GGGGTTTCAG GCAGGAGACA 
CAGCAGCTGT CTAAAGGGCT TGGGG6CCTG 



CCAGGGACTA CCTGAACCAC TATCCCGTGT 180 

CCGCAGAAGG TGCTGAC6AC CTCAACATCC 240 

TCATTGGGGA CAGGGACAAC CTCTACCGCG 300 

TGCGGTACCA GAGGAAGCTG ACCTGGAGAT 360 

TGAAGGGCAA ACAGGAGGGC GAGTGTCGAA 420 

AGTCCACGCT CTTTGTGTGC GGTTCCAACG 4 80 

TAGACACCCT GCAGCCCGTC GGAGACAACA 540 

CCAAGCACGC C2\ATGTTGCC CTCTTCTCTG 600 

ACTTCCTAGC CATTGATGCT GTCATCTACC 660 

CCGTGAAACA TGACTCCAAG TG6TTCAAAG 720 

GCAGCCATGT CTACTTCTTC TTCCGGGAGA 780 

TGGTGGTGTC CC6CGTGGCC CGAGTGTGCA 840 

TGGAGAAGCA GTG6ACGTCC TTCCTGAAGG 900 

CCCATTTCTA CTTCAACGTG CTGCAGGCTG 960 

CCGTGGTCCT GGCCGTTTTT TCCACGCCCA 1020 

CCTTTGACCT GACACAGGTG GCAGCTGTGT 1080 

CCGAGTCCAT CTGGACGCCG GTGCCGGAGG 114 0 

GCGCAGCCCC CGGGATGCAG TACAATGCCT 1200 

TTGTCAAGAC CCACCCTCTG ATGGACGAGG 1260 

TCCTGCGGAC CCTGATGAGG CACCAGCTGA 1320 

CCTGGGGCAA CCAGACCGTT GTCTTCCTGG 1380 

TCGTCCGGCC CAATGCCAGC ACCTCAGGGA 1440 

TTGAGACCTA CCGGCCGGAq AG6TGTGGAC 1500 

TGCTGA6CTT GGAGCTGGAC 6CAGCTTCGG 1560 

TGGTCCGAGT GCCTGTGGCT CGCT6CCAGC 1620 

GCAGTCAGGA CCCCTACTGC GG6T6GGCCC 1680 

GCACCAGAGC CGCCTTT6AG CAG6AC6TGT 1740 

GCACAGGACT CCTGCGGGCC AGCCTCTCCG 1800 

TGCTGGTAAC GTCGTCGGTG GOGGCCTTCG 1860 

TGGGCTGGTT CGTGGGCCTC CGTGAGCGGC 1920 

CCATCCTGGC GCACGGGGCG GGCGAGGCGG 1980 

GGGCGCAGGG TCCCGGGGGC CGGGGCGGAG 204 0 

AGGCCCTGCT GGCGCCCCTG ATGCAGAACG 2100 

GGCCCCACGA CCTGGACTCG GGGCTGCTGC 2160 

AGCGCCTGCC CACTCCGCAC CCGCACCCCC 2220 

GCCACCCCCT GCTCCCGGCC TCCGCTTCAT 2280 

CCCCCGAGCA GCCCCCCGCG CCTGGGGAGC 2340 

GGCCCGGCGG CGCCTCCCAC GGCGACTTCC 2400 

GGCGGGTGGT GTCCGCGCCC ACGGGCCCCT 2460 

CGCGGCCCTG GAGCCCGCCC CCGACGGGCA 2520 

CTCCGGCCGC CACCCTGCGC CGCACCCACA 2580 

ACCGCCACCG CGGCTGCCAC GCCCGGCCGG 2640 

GGGGG6CGGA CAGGACTGC6 CCCCCCQTGC 2700 

AGTGCCAGCC ACGG6AACCA 66AGCGAGAG 2760 

CTCCGAGT66 GT6CTCAAGT GCCCCCCGCG 2820 

CCACAAGGAA GCACAACCAG CTCGCCCTCC 2860 

GTTT6GGGGT GG6TGGGCG6 GAG6ACTTTG 2940 

AGGTTTTGGT TTTTTTTGCA GTTTTGGTTT 3000 

TCC6TTCTCG GGGT6GCG6C AGGCAGG6GA 3060 

ACAGCTGCAG ACCTAAGCCC TCCCCCACCC 3120 

TG6CGTGTGT GGGTGT6C6T GCGTGTGCGT 3180 

TGGGCGTGTG TGTGCGTGCC AGCGAAGGCT 3240 

CGTGCAGGGT GTGTGTCCAC GAGCGACGAT 3300 

TGAGCCGACG CTGGGGCTTC CAGAAGGCCC 3360 

GT^CCCCCCC CACTCTGCAG AGGGAAGCGG 3420 

CGAGGAGGGC CTGCCCGGAA GTCACATCGG 3480 
GGGGGCGGCG AAAG 



60 Seq ID No: 167 Protein secnience; 
Protein Accession #: NP_115484.1 

1 11 21 

65 MQTPRASPPR PALLIiLIilililj GGAHGIiFPED 
GADDLNIQRV IiRVNRTLFIG DRONIjYRVBIi 
KQEGECRNFV XVLLLRDBST XiFVCGSNAFN 
ANVALFSDGM LFTATVTDFL AIDAVIYRSIj 
VYFFPREIAM EPNYIiEKWV SRVARVCKND 

70 YFNVLQAVTO WSIiGGRPW LAVFSTPSNS 
IWTPVPBDQV PRPRPGCCAA P6MQYNASSA 
TLMRHQIiTRV AVDV6AGPWG NQTWFIiGSE 
YRPDRCGRPG GGETGQRIiLS LBIiDAASGGIi 
DPYCGWAPDG SCZFLSPGTR AAFEQDVSGA 

75 TSSVAAFWG AWSGFSVGW FVGLRERREZi 
GPGGRGGGGG GGAGVPPBAI. LAPLMQN6WA 



31 41 51 

III 

PPPLSVAPRD YliNHYFVFVG SGPGRLTPAE 60 

EPPTSTELRY QRKLTWRSNP SDZNVCRMKG 120 

PVCAHYSIDT LQPVGDNZSG MARCPYDPKH 180 

GDRPTLRTVK HDSECWFKEPY FVHAVEWGSH 240 

VGGSPRVLBK QWTSFItKARIj NCSVPGDSHF 300 

IPGSAVCAFD LTQVAAVFEG RFREQKSPES 360 

LPDDIIiNFVK THPLHDEAVP SliGHAPWILR 420 

AGTVZiKFLVR PNASTSGTSG I»SVFIjEEFET 480 

LAAFPRCWR VPVARCQQYS GCMKNCIGSQ 540 

STSGLGDCTG LLRASLSEDR AGIiVSVNLLV 600 

ARRKDKEAIL AHGAGEAVZiS VSRIiGERRAQ 660 

KATIiLQGGPH DIiDSGUliPTP BQTPLPQKRL 720 



259 



BNSOCCID: <WO 02079492A2_L> 



wo 02/079492 



PCT/US02/04915 



10 



20 



PTPHPHPHAI* GPRAWDHGHP LLPASASSSL LLLAPARAPE QPPAPGEPTP DGRbYAARPG 780 
RASHGDFPIiT PHASPDRRRV VSAPTGPLDP ASAADGLPRP WSPPPTGSLR RPLGPHAPPA 840 
ATLRRTHTFN SGEARPGDRH RGCHARPGTD LAHLLPYGGA DRTAPPVP 

Seq ID NO: 168 DNA sequence 
Nucleic Acid Accession #: AW205664 

Coding sequence: 1-13S (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

£SGCACGAGG AGAACAGGGG CCTCTGCCTC AGTTTGCCCG GGAGCCAGCC AGGGCCCATC 60 

CTAATTTGGA GCACAGTCTT CCCGGTGCCT AGACATGCCA AGGCCCCTCC CACGTGGTAC 120 

15 ACCCTCTCCG TTTAGTACCT GACCACCTGT TTCAAAACGC AGGTGTTTCT GGTTTAGAAA 180 

CTTGGAAGGC GGAATGTGTT TTCGTGTCTT CTAGGAAGGG TCTGCTGAGG ACCA6ACCAC 240 

GTAAGCCTGA GTGGATCCTG ACTCAGCTGC AGCCCTTACC TGCCTCGTGC TGATGATCTA 300 

TGCATGGCGT TATGTAGATC ACX5TGCGGCA GAGACAGCCA CTGTCCTGTG TGCGGGTTTT 360 
TAAAACAGCT GCCCTGGATG AAACGGAATA AACCAGTGAT GCTAAAAAAA AAAAAAAAAA 



Seq ID No: 169 Protein secaience: 
Protein Accession #: AW205664 



25 1 11 . 21 31 41 51 

I I I I I I 

RHEBNRGLCL SLPGSQPGPI LIWSTVFPVP RHAKAPPTWY TZ>SV 



30 Seq ID NO: 170 DNA sequence 

Nucleic Acid Accession #: AB033100 

Coding sequence: 32->2623 (underlined sequences correspond to start and stop codons) 



35 



40 



45 



50 



55 



60 



65 



70 



75 



1 
I 

AGGTCTGGGG 
GACGGTCTCG 
GCACTCCGTC 
CATCATCCCC 
GATCCATGAT 
TGAGCACTAC 
GGATGTCACT 
CCGGCAGGTG 
CAGGCGGGTC 
GCGGGAGGAA 
AGACAAGCAG 
CCTGGAGCTG 
CCATGTGTAC 
TGAGGACGAC 
CTACAGGTAC 
CGCCTTTGTC 
GCCTCCCCCA 
GGTCCTGGGC 
CCCCACGCAG 
CATGGTGCCC 
CGAGTTGCAT 
ACCGGAGAGC 
GAGCCTGGAG 
GCTGGCCTTT 
GCCCGTGACG 
CCTACGGGAG 
GGCCAACTTC 
GGCCCTGGGG 
CTGGGTGAGC 
GTGGCCTGGG 
CCATCTAAGC 
CCTTACCATG 
CCGCATCCCC 
GGCCCTGCGG 
CG6CCAGGGC 
AGGCTTCCCC 
GG6TGAATTT 
GAAGGAGGTG 
CCTGCGGGAG 
MT6CGGAGG 



11 
I 

TCCTGAGGCT 
GCAGGCACCC 
AGCATCCACT 
AACAAGGTGG 
GAGCTGCTCA 
CTGGTGCAAG 
GAGAAGATGG 
CAGGGTGGGC 
CTCCAGAAAC 
MCTGTGCTTT 
AACCTTCATG 
GCCATCCGGA 
CATAACACCG 
TTGCATGTGA 
CACCGCCTGC 
AGTGTTCTCC 
GCCCTCGTCT 
ACCCTCATCC 
GCCAAGCCCC 
CAGGGAAGGA 
GACCTGA/UVG 
CCAGCCCAGG 
CGATACTTCT 
GCCCTCAGTT 
CTGAGCTCAG 
GACGATCTGG 
CGGCGGGTGC 
AGCATCCTGG 
CTTCGGGAGG 
CCCCCTGTGG 
GAGCCTCCCC 
CAGGAGGTCT 
ATGCCGGACT 
GCCGCCCTCT 
CGTACCACAA 
6AGGTG66T6 
CAGGTA6TAA 
GACGCAGCGC 
ATCATCATCT 
CTGCA6CTGC 



21 
I 

GCTGGCAGAC 
CATTTGAGGG 
CCTTCCAGAG 
CCCCTGTTGT 
AGGCTCATTA 
GAGCTCAGGC 
ATGTGCTGGG 
TCACTGTGTT 
TCCAGAAGGA 
TCCTGCGTGC 
AGAACCTCCA 
AAGAGATCCA 
AGGACCTGTG 
CGGAGGAGGT 
CCCTGCCCGA 
GGGAGACCCC 
TCAGCTGCCA 
T6CTTCACCG 
TGCCTATGGA 
6GAT6GTGGA 
AAGTGGTCTT 
GAAGCGGCAG 
ACCTGATCCT 
TCAGCCGCTG 
CAGGCCCTGT 
TCTCCCCGGA 
CCCGCATGCC 
CCTACCTGAC 
AGGCCGTGTT 
CTCCTGACCA 
CAGGGAAGGA 
TCAGCCAGCA 
TCTGTGCCCC 
CCAAGGACCC 
CT6CGATGGT 
AGGAGGAGCT 
TGAAGGTGGT 
TGGACACTGT 
GCACCTACCG 
GGAGCCT6CA 



31 
I 

TATGGQTACA 
CCTACAGGGC 
CACTAGCTTG 
GATCACGTAC 
CACGTTGGGC 
CTTACCCCAG 
CACCGTGGGA 
CGGCATGGGA 
CGGACATAGG 
AGATGAGGAC 
GGGCCTTGGA 
CGACTTTGCC 
GGGGGAGCCC 
GTACAAGCGG 
GCAAGGGAGT 
CAGCCTGCTG 
GATGGGCGTG 
CAGT6GGACC 
6CAGTTCCAG 
A6AG6T6GAC 
GGAAAACCAG 
CCGACACAGC 
GTTTAACTAC 
GCTGTGTGCC 
GGCTCCGAGG 
CGCGCTCAGC 
CATCTACGGC 
GGACGCCAAG 
GGAGTGTGAC 
GCTGGAGACC 
GGGCCCCCTG 
CCGCAGGGCC 
CCGAGAGGAG 
AGGCACTGGC 
GGTGGCTGTC 
CGTGAGT6TG 
GCAGCTGCTA 
CAGC6AGACC 
CCAGGCGAAG 
GTACTTGGAG 



41 
I 

ACGGCCAGCA 
A6TGGCACGA 
CATAACAGCA 
AACTGCAAGG 
CGGCTCTCGG 
GGCCGCTACT 
AGCTGTGGGG 
CAGCCCAGCC 
GAGTGTGTCA 
TTTGTGTCCT 
CCCGGGGTCC 
CAGCTGAGCG 
CATGCTGTGG 
CCCCTCTTCC 
CCCCTGGAGG 
CAGCTCCGTG 
GGCAGGACCA 
ACCTCCCAGC 
6TGATCCA6A 
AGA6CCATCA 
AAGAAGTTAG 
GTCTGGCAGA 
TACCTTCAT6 
CACCCTGAGC 
GACCTCATCG 
ACTGTCAGAG 
ACGGCCCAGC 
AGGAGGCTGC 
GGGCACACCT 
CTGGAGGCCC 
ACCTACAGGT 
TGTCCTGGCC 
GACTTTGACC 
TTCGTGTTCA 
CTGGCCTTCT 
CCTGATGCCA 
CCCGATGGGC 
AT6ACGCCCA 
GCAGC6AAA6 
CGCTATGTCT 



51 
I 

CAGCCCAGCA 
TGGACAGTCG 
AGGCCAAGTC 
AGGAGTTCCA 
ACAACACCCC 
TCarGGTGCG 
CCCCCAACTT 
TCTTAGGGTT 
TCTTCTGTGT 
ACACACCTCG 
GGGTGGAGAG 
AGAACACATA 
CCATCCATGG 
TGCAGCCCAC 
CCCAGTTGGA 
ATGCCCAC6G 
ACCTGG6CAT 
CA6A6GCTGC 
GCTTTCTCCG 
CTGCCTGTGC 
AAGGTATCCG 
GGGCGCTGTG 
AGCAGTACCC 
TGTACCGCCT 
CCAGGGGCTC 
AGATGGATGT 
CCAGCGCCAA 
GGAAGGTTGT 
ACAGCCTGCG 
AGCTGAAGGC 
TCCAGACCTG 
TCACCTACCA 
AGCTGCTGGA 
GCTGCCTCAG 
GGCACATCCA 
AGTTCACTAA 
ACCGTGTGAA 
TGCACTACCA 
AGGCGCAG6A 
6CCTGATTCT 
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CTTCAACGCG 
GATGCAG6AG 
C6AGCTGGA6 
GAGCTGCAGC 
CCCCCCACCC 
GGCCCTGAGG 
TTG6GAGCCT 
CAAACCACCA 
GTGCACACTG 
CCCCCAGTTG 
TGGCAGCCCC 
CTGCTCTGCC 
GCCTGAGGTG 
ACCCAGTTTT 
CCTACCCAGC 
AGGGAGCCCC 
AGGCCTGCTC 
GCTGCAGGTC 
CCTGCTGCAG 
TGGACAGTGA 
GGCCATCCTT 
CT6CTGAGTT 
CAGGTGGCAG 
GTGCTTGCTG 
G6GACCAGSA 
GATGTGTCAT 
CA6TATTGGA 
GTTCACTACC 
CTTTGCAGAC 
GTGTTCTCAT 
CTCTCTGCCT 
CRGGGGTGTT 
GCATTAAAAA 



TACCTCCACC 
GTGGCATCGA 
AGCG6GGAGG 
CTCGAGCCCT 
ACAGGGCCCC 
GGTGCTGGCC 
TTTTAGAAAG 
AGGTGTGTGG 
CTGTGTGTAC 
CCAAACACTG 
TGGCACAGAG 
ATTGCCGCTC 
GGTGGAGGGG 
CTGGACTCTC 
CTGGTGGGGC 
TCTCATGGGG 
CACTTGTCTG 
CCCCGGCATC 
CAGGAGCCCC 
GGTGTGCAAG 
GCTGAGCATC 
AGAGGCTGCT 
A6AAGTGCCA 
AAACCCAGGA 
AAGCCTGTCT 
TGGTCATGAT 
AAATATTTGA 
TTTTCAGGTT 
AAGGTCTAGA 
GGTTGGCTCT 
CAGTTTCCCC 
GTGAGGATTC 
CAGCTAAATG 



TGGAGAAG6C 
AGGCTGGCAT 
ACCAGCCCTT 
CT6CCCCCGA 
ACGCAGGCCT 
TTGAAATGAT 
AACTTTTTAT 
CTGACCTCCA 
CTTGCAGACA 
TGGATCTCTC 
CAGACCCGGC 
CCCTTCTTGC 
ACAGTGTTCT 
ATGCCCCCAT 
TGGCAGGATG 
AGGAAAGAGC 
GGAACCTGGG 
TCTCTCTGTC 
TUVGGAGTGCT 
GGT6CACTGA 
TTTGAGCCTG 
GGGATCCACT 
TGTTTGCGTT 
6CTGAACAGT 
TTGGTTAGGC 
ATTTGAAAAG 
CCCCCTTG6C 
TATTGTTTTT 
TGCGGAGTCA 
GACTTTCAGC 
ATCTGTAAAA 
ATTTGTGATT 
TG 



CGACTCCTGG 
CTACGAGATC 
CTCCAGGCTG 
GGACTTGCTG 
GGGGT6TCTG 
TCCCCCACTT 
AGGACAGGGA 
GGGAGGAGCA 
GGCCGGCGTT 
TGTCCTCTTC 
CACTGGTAGC 
TGCCCAAGCA 
GGATAGATCT 
CTCCGACCTG 
GTGGAGGTTT 
TTCCAGGGGG 
CAGGAGGCAC 
CCGGCAGCCC 
AGCTGAGGGT 
GGGTGGTGGG 
CCTTCCGGTG 
GTTTCCACAC 
GA6CCTTGCA 
GAGGAGGCTG 
TOGTGTACTT 
GG6AGGAGGC 
TGAATTCTTT 
ATTTTTGCAT 
GAGATGGGAC 
TGTGTTGGGA 
TGGGAGAATA 

Trr ri TTTT T 



CAGAGGCCCT 
CTTAACGAGC 
C6CTACCGGT 
TAGG GGGCCT 
AGGT6CTCTT 
CCTGGAGAGA 
GACAGCACAG 
CTCACTGGAG 
CAGCCTCCAA 
TCCCCTCTCT 
TCCCCACTTC 
CTGCCCTCGG 
ATTATGTGAA 
GGAGACTTCA 
CTCAAGGAGC 
CGAACGCAGC 
AGAGGAAGCC 
AGGATGGCCT 
GGTTGCTGGG 
AGGGGATCAC 
6GA6CA6AAA 
A6CGGGAAG6 
GCTCTTCCAG 
TCCACCTTGC 
CT6CA6GAAA 
CX3AAGTTGTT 
TGCAGAACTA 
GAATTAAGAC 
TGAAT66GGA 
CCACTGGCTG 
ATACTTGCCT 
TTTTTGTACA 



TCAGCACCTG 
TGGGCTTCCC 
GGGAGGAGCA 
TACTCCCTGT 
6GCTGG6AGC 
CTGAGCGGAG 
CCATCCCTTG 
TGCTCACAAG 
GGGGCTCACT 
CAGATTGGCC 
CTTACTCCTG 
GCGTCTGGCA 
AGGCAGCTTC 
GGAATGACAA 
TGGAGACTTC 
ACAGAGGAAG 
AAGGCCTGGA 
GGTGCCCCCA 
GTGGTCCTCA 
CTGGGTTCCA 
A06CCAGACC 
CT6CTG6GAA 
CTGGGGACTG 
TTGGCTCACT 
AAAAAAAAA6 
CCCATTTATC 
CTGTGTGTCT 
GTTTTAATTT 
GGGATCCTTT 
ATCACATCAC 
ACCTACCTCA 
GAGCTTTTAA 
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Seq ID No: 171 Protein sequence ; 
Protein Accession #: BAA86588.1 
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MGTTASTAQQ 
ITYNCKEEFQ 
TVGSCGAPNF 
BDFVSYTPRD 
EPKAVAIHGE 
LLQIiRDAHGP 
FQVZQSFIjRM 
HSVWQRALHS 
PRDLIARGSIi 
AKRRIiRKWW 
PLTYRFQTCIi 
TGFVFSCLSG 
LLPDGHRVKK 
LERYVCLIIiP 
RliRYRWQEQS 



11 
I 

TVSAGTPFEG 
IHDELLKAHY 
RQVQGGIiTVP 
KQNI«HENIiQG 
DDLHVTEEVY 
PPALVFSCQH 
VPQGRRMVEE 
LERYFYLILF 
REDDI»VSPDA 
VSLRKEAVIiB 
TMQEVFSQHR 
QGRTTTAMW 
EVDAALDTVS 
KAYIaHIiBKAD 
CSLEPSAPED 



21 
I 

LQGSGTMDSR 
TLGRLSDNTP 
GMGQPSLLGF 
LGPGVRVESIi 
KRPLFLQPTY 
GVGRTNLGMV 
VDRAZTACAE 
NYYLHEQYPIi 
LSTVREMDVA 
CZX3HTYSLRW 
RACPGLTYHR 
AVLAFWHZQG 
BTMTPMHYHL 
SWQRPFSTNM 
IiL 



31 
I 

HSVSIHSFQS 
EHYLVQGAQA 
RRVLQICLQKD 
ELAIRKEIHD 
RYHRIiPXjPEQ 
LGTIiIIiLHRS 
IiHDIiKEWIiE 
AFALSFSRWIi 
NFRRVPRMPI 
PGPPVAPDQIi 
IPMPDFCAPR 
FPBVGEEEIjV 
REIIICTYRQ 
QBVASKAGIY 



41 
I 

TSLHNSKAKS 
LPQGRYFLVR 
GHRECVIPCV 
FAQLSENTYH 
GSPLEAQIiDA 
GTTSQPEAAP 
NQKKLBGIRP 
CAKPELYRLP 
YGTAQPSAKA 
ETLBAQLKAH 
EEDFDQIiLEA 
SVPDAKFTKG 
AKAAKEAQEM 
EZLNBLGFPE 



51 
I 

IIPNKVAPW 
DVTEKMDVIiG 
REBVLFIiRAD 
VYHNTEDLWG 
FVSVIiRETPS 
TQAKPZiPMEQ 
BSPAQGSGSR 
VTLSSAGPVA 
liGSIZiAYIiTD 
L5EPPP6KEG 
LRAAIiSKDPG 
EFQWMKWQ 
RRLQZiRSI^Y 
LESGEDQPPS 
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Seq ID NO: 172 DNA secruence 

Nucleic Acid Accession #: AK021806.1 

Coding sequence: 1-645 (underlined sequences correspond to start and stop codons) 
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ACT GTGCTTT 
AACCTTCATG 
GCCATCCGGA 
CATAACACCG 
TTGCATGTGA 
CACCGCCTGC 
A6TGTTCTCC 
GCCCTCGTCT 
ACCCTCATCC 
GCCAAGCCCC 
CAGGGAAGGA 
AGTTTTCTGG 
CCCAGCCTG6 
AGCCCCTCTC 



TCCTGCGTGC 
AGAACCTCCA 
AAGAGATCCA 
AGGACCTGTG 
CGGAGGAGGT 
CCCTGCCCGA 
G6GAGACCCC 
TCAGCTGCCA 
TGCTTCACCG 
TGCCTATG6A 
GGATGGTGGA 
ACTCTCATGC 
T6GGGCTG6C 
AT666GAGGA 



AGATGAGGAC 
GGGCCTTGGA 
CGACTTTGCC 
6GGGGAGCCC 
GTACAAGCGG 
GCAAGGGAGT 
CAGCCTGCTG 
GATGGGCGTG 
CAGT6GGACC 
GCA6TTCCAG 
AGAGGTGGAT 
CCCCATCTCC 
AGGATGGTGG 
AAGA6CTTCC 



TTTGTGTCCT 
CCCGGGGTCC 
CAGCTGAGCG 
CATGCTGTGG 
CCCCTCTTCC 
CCCCTG6AGG 
CAGCTCCGT6 
GGGAGGAGCA 
ACCTCCCAGC 
GTGATCCAGA 
A6ATCTATTA 
6ACCTGGGAG 
AGGTTTCTCA 
AGGGGGCGAA 



ACACACCTCG 
GGGTGGAGAG 
AGAACACATA 
CCATCCATGG 
TGCAGCCCAC 
CCCAGTTG6A 
AT6CCCACGG 
ACCTGG6CAT 
CAGAGGCT6C 
GCTTTCTCCG 
T GTGAA AGGC 
ACTTCAGGAA 
AGGAGCTG6A 
CGCAGCACAG 



AGACAAGCAG 
CCTGGAGCTG 
CCAT6TGTAC 
TGAGGACGAC 
CTACAGGTAC 
C6CCTTTGTC 
6CCTCCCCCA 
G6TCCTGGGC 
CCCCAC6CAG 
CATGGTGCCC 
AGCTTCACCC 
TGACAACCTA 
GACTTCAGGG 
AGGAA6AGGC 
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CTGCTCCACT 
CAGGTCCCCC 
CTGCAGCAGG 
CAGTGAGGTG 
ATCCTTGCTG 
TGAGTTAGAG 
TGGCAGAGAA 
TTGCTGAAAC 
CCAGGAAAGC 
TGTCATTGGT 
ATTGGAAAAT 
ACTACCTTTT 
GCAGACAAGG 
TCTCATGGTT 
CTGCCTCAGT 
GGTGTTGTGA 
TAAAAACAGC 



TGTCTGGGAA 
GGCATCTCTC 
AGCCCCAAGG 
TGCAAGGGTG 
AGCATCTTTG 
GCTGCTGGGA 
GTGCCATGTT 
CCAGGAGCTG 
CTGTCTTTGG 
CATGATATTT 
ATTTGACCCC 
CAGGTTTATT 
TCTAGATGCG 
GGCTCTGACT 
TTCCCCATCT 
GGATTCATTT 
TAAATGTG 



CCTGGGCAGG 
TCTGTCCCGG 
AGTGCTAGCT 
CACTGAGGGT 
AGCCTGCCTT 
TCCACTGTTT 
TGCGTTGAGC 
AACAGTGAGG 
TTAGGCTCGT 
GAAAAGGGGA 
CTTGGCTGAA 
GTTTTTATTT 
GAGTCAGAGA 
TTCAGCTGTG 
GTAAAATGGG 
GTGATTTTTT 



AGGCACAGAG 
CAGCCCAGGA 
GAGGGTGGTT 
GGTGGGAGGG 
CCGGTGGGAG 
CCACACAGCG 
CTTGCAGCTC 
AGGCTGTCCA 
GTACTTCTGC 
GGAGGCCGAA 
TTCTTTTGCA 
TTGCATGAAT 
TGGGACTGAA 
TTGGGACCAC 
AGAATAATAC 
TTTTTTTTTT 



GAA6CCAAGG 
TGGCCTGGTG 
GCTGGGGTGG 
GATCACCTGG 
CAGAAAAGGC 
GGAAGGCTGC 
TTCCAGCTGG 
CCTTGCTTGG 
AGGAAAAAAA 
GTTGTTCCCA 
GAACTACTGT 
TAAGACGTTT 
TGGGGAGGGA 
TGGCTGATCA 
TTGCCTACCT 
TGTACA6A6C 



CCTGGAGCTG 
CCCCCACCTG 
TCCTCATGGA 
GTTCCAGGCC 
CAGACCCTGC 
TGGGAACAGG 
GGACTGGTGC 
CTCACTGGGA 
AAAAAGGATG 
TTTATCCAGT 
GTGTCTGTTC 
TAATTTCTTT 
TCCTTTGTGT 
CATCACCTCT 
ACCTCACGGG 
TTTTAAGCAT 



900 
960 

1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



20 



Seq ID No: 173 Protein sequence ; 
Protein Accession #: AK021806.1 
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I I I I I I 

TVLFLRADED FVSYTPRDKQ NIjHENLQGLG PGVRVESLEL AIRKEIHDFA QliSENTYHVY 60 

HNTEDLWGEP HAVAIHGEDD LHVTEEVYKR PLFIiQPTYRY HRLPIjPEQGS PLEAQIjDAFV 120 

SVIiRETPSI*Ij QliRDAHGPPP ALVFSCQMGV GRTNUSMVLG TLILLHRSGT TSQPBAAPTQ 180 
AKPLPMEQFQ VIQSPLRMVP QGRRMVEEVD RSIM 
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Seq ID- NO: 174 DNA sequence 
.Nucleic Acid Accession #: NM_0165B0.2 

Coding sequence: 1212-4766 (underlined sequences correspond to start and stop codons) 
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GGGAAGC6GG 
AGACCATGGG 
TCCAGAGGTA 
AACGTCACTG 
TTTAATCCTC 
AGATGAG6TA 
AGTTGCAAAG 
CAGCTGCTTT 
ACAAGATTGG 
GAGAGTTGGG 
ACGTGCAAGC 
CCCTCAGCAG 
CCGGCCTCAT 
TTGGCATTGA 
TGCTTCTCTG 
ACTGGGTCAG 
GTTCCCCAGT 
CTTCCAATGA 
GACT^GGTTG 
AACACTGGAG 
1GGCG6TAAG 
ACTTATTTCT 
CAGA6GAA6T 
AGAGGCG6AG 
TTCAGGTGGA 
TGTGCCGACA 
CTCTGATCCA 
AAGGCGAGCA 
ACAGAGCTCT 
GTGAGCACTT 
TAGTGGTGAA 
ATGACAATGG 
CCAATGACAA 
CTGCACCTGG 
GGGAGGTGGA 
TTGATGCCAA 
CCTACGAGGT 



11 
I 

AGGAGAGCPV 
CACCCTCATA 
GCCATAGGGT 
CCTGTGACTC 
ACAGTTTCCT 
ATGGAGGCCC 
TCAGAATTTG 
CCAGTGAGAC 
GAAAAAGACA 
GAAGGGGAGA 
AGGTG6AGGC 
CCGCCTTGCT 
CCTCCTCAGC 
AACTCACAGC 
TTTCCCCAGC 
GTCCCTCTTA 
AG6GG6TCTC 
AGATGGCCTT 
TGCCGGCAGC 
GACTTGTCCA 
CATGA TGCAA 
TTTAGGGGAT 
GCCATCTGGT 
GCAAGCTGGG 
CTCTGAGGAA 
GTGGGATCCC 
TGTGGAGATC 
GGAGCTGGAA 
T6ACCCAGAC 
TGCCTTGGAT 
GGAGCTGGAC 
GAACCCCCCC 
TAGCCCTGCG 
TACGCTTCTC 
GTTCTTCCTC 
GACAGGCCAG 
GGATGTTCAG 



21 
I 

CACGGTCAAG 
AGTCAGTGTG 
GTGACAAGTT 
GGGGCCA6GC 
GCTGAAAGGG 
AGGA71A6TTA 
AACTCAGGCA 
AAAAACGGGT 
GGAATGAGAG 
GAAACAGGAA 
TCAAGTTTTC 
GCGTGGACAG 
GGCAGGCCAC 
CCTCCCTTTT 
CTAGCAACTG 
GACCAGCTCT 
CCCTG6CAAT 
GCCTTGGGGT 
TCTGGGGGAA 
GCCTTGAAAG 
CTTCTGCAAC 
TGTCAGGAGG 
ACA6TGATCG 
GCTGCCTTCC 
GGCTTGCTCA 
TGCCTGGTTT 
CAAGTGCTGG- 
ATCTCTGAGA 
ACAGGCCCTA 
GTCATTGTGG 
AGGGAAATCC 
AAGTCAGGTA 
TTTGCTGAGA 
ATAAAACTGA 
AGTAAGCACA 
GTCATTCTGC 
GCAAGGGACC 



31 
I 

TTGCACAGGT 
GGCAGG6ACT 
GTGCAGATTA 
CCAGGCCAAA 
CTACTATTCT 
AGTGACTTGT 
GTTTACCTCT 
GATCAGGGCA 
GGGAACAATG 
ACATGACTTG 
TGCTCACTTG 
CAGCTTCCCA 
TTAGCTTCAC 
CCTGTAGGTG 
TTTGGCAGTC 
TGTCCATCAT 
TCTTGATC6G 

G6AGCACGG6 
AACTCTAGTG 
TTCTGCTGGG 
TGACCACTCT 
GGAAGCT6TC 
AGGTGTTGCA 
GCACAGGCAG 
CCTTTGATGT 
ACATCAATGA 
GCGCCTCTCT 
ACACCCTGCA 
GCCCTGATGA 
ATTCATTTTT 
CCAGCTTGGT 
GTTCACTGGC 
CCGCCACAGA 
TGCCTCCAGA 
GTCGACCTCT 
TGGGTCCCAA 



41 

1 

TCTTGCAGCT 
6CCCCAGGGC 
CAACACTCAC 
6CCCTTCCTA 
TACTCCCATC 
CCCAGATGAC 
GATGGCTGCT 
GAGTCAAGAC 
GGGGAAAAGA 
CCCGGGAGGG 
GTGATGCAGA 
TCTGGCCTGT 
AGGAAATGCT 
GGGTTTCCAT 
AGAGTCCCAC 
TTGCTGAAGT 
CX5TTTGGACA 
TCATAATCAT 
6CTGATCAAG 
GTTTCTGAAT 
GCTTTTGGGG 
CACGGTGAAA 
CCAGGAACTG 
GCT6CCTCA6 
GCGGCTGGAT 
GCTTGCCACA 
CCACCAGCCA 
GCGAACCCGG 
CACCTACACT 
GACCAAACAT 
TGATCTGGTG 
CAAGGTCAAC 
ACTGG/^AATC 
CCCTGACCAA 
GGTGCTGGAC 
AGACTATGAA 
TCCTATCCCA 



51 
I 

TCTGGAATCA 
CAATCCAAGA 
COCTTGCAAT 
CATCATTTCG 
CCCACTCTAC 
ACCGCTGGTA 
CTGTTAATCA 
AGAGAGGTAA 
TAGGAACAAA 
GCATCAGTCC 
GG CTCCCTTT 
CCCCGGAGCC 
CTTTCTCTAA 
AGGAAAAAGC 
ATCCTGCTCA 
GGACCAACTA 
TCTCAGATCG 
CTAACTATG6 
CCATCCAGGA 
CTAGCCCACT 
CCAGGTGGCT 
TACCAAGTGT 
GGCCGGGAGG 
GCGCTCCCCA 
CGAGAGCAGC 
GGGGATTTGG 
CGGTTTCCCA 
ATCCCCCTGG 
CTGTCTCCCA 
GCAGAACTCA 
TTAACTGCCT 
GTCTTGGACT 
CAAGAAGATG 
GGCCCCAATG 
ACCTTCAGTA 
AAGAACCCTG 
GCCCATTGCA 
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AAGTTCTCAT 

CCTCCCAGCC 

TGGCAGATGA 

TGGGCCACTT 

CACTGGACAG 

TCCAGCCCTT 

CACCTGTGTT 

TTCACCTCAT 

ACCGCATCCA 

TCACTGCTCA 

CAGAGGACAG 

ATGCCAAT6A 

TCTCCGTGCT 

GCTTGGGCCC 

TTTTGACAAC 

6CATCCGCAG 

TCGTCAATGT 

TAGAGGACCA 

CCAGTGTGGA 

TGACX5GTGAT 

TGTCCATCTG 

CCACCTACCG 

TCGTGCCTGT 

ATGTGGACAA 

ACCTCACCCC 

AGAGCCGAGA 

ATGCCTCCCG 

CCAGGCCTCT 

AGGAAGCCCC 

ATGGCAAAGT 

GGCTGTCTGT 

CTCCTGTTCA 

AACCAAACCA 

CAGACACAGA 

GGCCTTTGGA 

GTCTGCTGGA 

6GATGGC6AG 

ATGCTGCAQC 

AGCTOAGCCC 

TGGAGATGCT 

GGCGGCTCTC 

GCATGAAAGT 

GCAGCAGCAG 

GGGCCTGAGG 

GCGGCCTGAG 

ACTAACAGCT 

GGGACGGTTT 

CAGTCCTCTG 

GGTAGCAGGA 

ATAAAGGAAA 



CAAGGTTCTG 

ATCACTGGTG 

CTTGGATTCA 

CAGGCTGAAA 

AGAGCAGTGG 

ATCAGCCAAG 

TGAGAAAAGC 

TACCATCAA6 

G6ACTCCCCA 

GAGGTCACTG 

CX36GCAACCC 

TAATGCCCCA 

TGTGAATGCC 

AGCGGGCACT 

CATTGTG6CA 

TGGAAATGAA 

CACCAATGCC 

GGGAAGCCCC 

CCACCTGAGG 

CTGCCTGGCT 

CCGGACAGAA 

CCAGCAGCCC 

GCTCAGGGGT 

GGAGGCGATG 

GACCCTGTAC 

GGTGCTGCAA 

GGAGAACCTG 

GAAGGTTGCA 

ACAGAGGCCA 

GTCCCCTGAG 

GGCT6CCTTC 

GCAAATCTCC 

CCGA6GAAAT 

TGGCCCAAGT 

TCCTGAAGA6 

CCCCAGCACA 

ACTCTCTTTG 

CACGGAGGAG 

AACAGGCACG 

GCTGGAACAG 

GGTCTGCGGG 

GCAAGGGGAC 

CAGCAGGTGC 

ATCTGTGGAC 

AACTTTAGGG 

GACTGACCAA 

GTGGCTGAGA 

GCAGAACAGA 

GTCAG6GGGC 

AGCAGTGATT 



GATGTCAATG 

TCAGAAGCTC 

GGACACAATG 

AGAACTAATG 

CCCAAATATA 

AAACAGCTCA 

AGGTATGAA6 

GCTCATGATG 

GTTGCTCACT 

AACTATGAAG 

ATGCTTGCAT 

GAGGTGGTCC 

TCCACAGGCC 

GACACACCTC 

AGAGATGCAG 

GCCCACCTCT 

AGCAGCCTCA 

CCCTTACAGA 

GACTCAGCCC 

GTACTGTTGG 

AAGAAGGACA 

AAGAGGCCCC 

CAGGCAGGTG 

ATGGAAGCAG 

AGGACGCTGC 

GACACGGTCA 

AACCTTCCCG 

GGCAGCCCCA 

CCAGCCTCCT 

AAAGAATCAG 

6CCGAGCGGA 

CAGCTGCTGT 

AAGTACTTGG 

GCAAGGGCT6 

GACCTCTCTG 

GGTCTGGCCC 

CCCCTCACCA 

CCAAGGACCT 

AGGCTGGCCA 

CGCTCCAGCA 

AGGACCCTCA 

CCAGGTGGAA 

CTG TGAA CAT 

AAGAGCTGGT 

TGACTGATGC 

AGCAGCCCCT 

TAAGTGTTTC 

TGCCACGGAG 

TGTACCCTGG 

CAAAAAAAAA 



ACAACATCCC 
TTCCCAAGGA 
GTTTGGTCCA 
GCAACACATA 
CCCTCACTCT 
GCATTCAGAT 
TCTCCACGCG 
CAGACTTGGG 
TAGTAGCTAT 
AGATGGCCGG 
CCAGTGTCTC 
AGCCTGTGCT 
ACCTGCTG6T 
CACTGGCCAC 
ACTCGGGGGC 
TCATCCTCAA 
TTGGGAGTGA 
CCCGAGCCCT 
GCAAGCCTGG 
GCATCTTCGG 
ACAGGGCCTA 
AGAAACACAT 
AGCCTTGTGA 
GCTGGGACCC 
GTAATCAAGG 
ACCTCCTTTT 
AGCCCCAGCC 
CAGGGAGGCT 
CTGCAACCCT 
GGCCCCGTCA 
ACCCCGTGGA 
CCTTGCT6CA 
CCAA6CCAGG 
GAGGCCAGAC 
TGAAGCAACT 
TGGACCGGCT 
CCAACTACCG 
TCCAGACGTT 
GCACCTTTGT 
TGCCCGTGGA 
GTTTAGACTT 
AGACGGGGAC 
ACCTCAGACG 
TTCTAAAATC 
TACCCCCACA 
TGTAAGCAGC 
CTGGCAAAAC 
TATCACAGGC 
G6GTGCCAG6 
AAAAAAAAAA 



Seq ID No: 175 Protein sequence; 
Protein Accession #: NP_057664.1 



11 



21 



31 



AAGCATCCAC 
CAGTTTTATT 
CTGCTGGCTG 
CATGTTGCTA 
GTTAGCCCAA 
CAGTGACATC 
GGAAAACAAC 
CATTAATGGA 
TGACTCCAAC 
CTTTGAGTTC 
TGT6TGG6TC 
CAGCGATGGA 
GCCCATC6AG 
TCACAGCTCC 
AAATGGAGAG 
CCCTCATACG 
GTGGGAGCTG 
GTTGAGGGTC 
GGCCTTGAGC 
GTTGATCCTG 
CAACTGTCGG 
TCAGAAGGCA 
AGTCGGGCAG 
CTGCCTGCAG 
CAACCAGGGA 
CAACCATCCC 
T6CCACA6GC 
GGCTGGAGAC 
GA6AC6GCAG 
GATCCTGCGG 
GGAGCTCACT 
TCAGGGCCAA 
AGGCAGCAGG 
AGACCCAGAA 
6CTAGAAGAA 
GAGCGCCCCT 
TGAC7VATGTG 
CX3GCAAGGCA 
CTCGGAGATG 
GGCCGCCTCC 
GGCCACCAGT 
TGAGGGCAAG 
CCTCTGGATC 
TTGTAACTCA 
GAGGAGGCAA 
TCTGAGTCTT 
ATATGTGGAG 
AGGAAAG6GT 
AAATGCTCTC 
AAAAAAAAAA 



41 



GTCACATGGG 
GCTCTTGTCA 
AGCCAA6AGC 
ACCAATGCCA 
GACCAAGGAC 
AACX3ACAATG 
TTACCCTCTC 
AAAGTCTCAT 
ACAG6AGA6G 
CAGGTGATCG 
AGCCTCTTGG 
AAAGCCA6CC 
ACTCCCAATG 
CGGCCATTCC 
CCCCTCTACA 
GGGCAGCTGT 
GAGATAGTAG 
ATGTTTGTCA 
ATGTCGATGC 
GCTTTGTTCA 
GAGGCCGAGT 
GACATCCACC 
TCCCACAAAG 
GCCCCCTTCC 
GCACCGGCGG 
AGGCAGAGGA 
CAGCCAC6TT 
CAGGGCAGTG 
CGACATCTCA 
A6CCTGGTCC 
GTGGATTCTC 
TTCCA6CCCA 
AGTGCAATCC 
CAGGAG6AAG 
QAGCTGTCAA 
GACCCGGCCT 
ATCTCCCCGG 
GAGGCACCAG 
AGCTCACTGC 
GAGGCGCTGC 
GCAGCCTCAG 
AGCAGAGGCA 
CAAGAACCAG 
CTAGCTAGCG 
GAGCCCCAGG 
TTGGAGGACA 
CACAAAGGGT 
GGCCTTCTTG 
TGACCTATCA 



22B0 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 



51 



60 



65 



70 



75 



MMQIjIiQLLIjG 
QAGAAFQVLQ 
VEIQVIjDIND 
ALDVIVGPDE 
SPAFAESSIiA 
TGQVILRRPIj 
SLVSEALPKD 
EQWPKYTLTIi 
TIKAHDADLG 
GQPMLASSVS 
AGTDTPPLiAT 
TNASSLIGSE 
CLAVLLGIFG 
IjRGQAGEPCB 
VliQDTVNLLF 
QRPPASSATIi 
QlSQhhShhH 
PEEDIiSVKQL 
TEEPRTFQTP 



LLGPGGYIiFL 
LPQAIiPIQVD 
HQPRFPKGEQ 
TKHAELIWK 
LEIQEDAAPG 
DYEKNPAYEV 
SPIALVMADD 
LAQDQGLQPL 
INGKVSYRIQ 
VWVSLLDAND 
HSSRPFliLTT 
WELEIWEDQ 
LIIiALFMSIC 
VGQSHKDVDK 
NHPRQRNASR 
RRQRHLNGKV 
Q6QFQPKPNH 
LEEELSSIiLD 
6KAEAPBLSP 



liGDCQEVTTL 
SEEGLLSTGR 
ELEISESASIi 
ELDREIHSFF 
TLblKIiTATD 
DVQARDIiGPN 
LDSGHNGIiVH 
SAKKQIjSIQI 
DSPVAHIiVAI 
NAPEWQPVL 
IVARDADSGA 
GSPPLQTRAL 
RTEKKDNRAY 
BAMMBAGWDP 
ENLMIiPEPQP 
SPEKESGPRQ 
RGNKYIiAKPG 
PSTGIiAIiDRZt 
TGTRIASTPV 



TVKYQVSEEV 
RLDREQDCRQ 
RTRIPLDRAIi 
DLVLTAYDNG 
PDQGPNGEVE 
PIPAHCKVIil 
CWLSQELGHF 
SDINDNAPVF 
DSNTGEVTAQ 
SDGKASIiSVL 
NGEPLYSIRS 
IiRVMFVTSVD 
NCREAESTYR 
CLQAPFHLTP 
ATGQPRSRPL 
lIiRSLVRIiSV 
GSRSAXPDTD 
SAPDPAHMAR 
SEMSSLIiEML 



PSGTVIGKIiS 
WDPCLVSFDV 
DPDTGPNTLH 
NPPKSGTSLV 
FFLSKHMPPE 
KVIiDVNDNIP 
RLKRTNGNTY 
EKSRYBVSTR 
RSLNYEEMAG 
VNASTGHLLV 
GNEAHLFIIiN 
HLRDSARKPG 
QQPKRPQKHI 
TLYRTIiRHQG 
KVAGSPTGRIi 
AAFAERNPVE 
GPSARAGGffT 
LSLPLTTNYR 
LEQRSSMPVE 



QELGREERRR 
LATGDIiALIH 
TYTIiSPSEHF 
KVNVLDSNDN 
VliDTFSIDAK 
SIHVTWASQP 
MLLTNATLDR 
ENNXiPSIiHIiI 
FEFQVIAEDS 
PIETPNGLGP 
PHTGQLFVNV 
AliSMSMLTVI 
QKADIHLVPV 
NQGAPAESRE 
AGDQGSEEAP 
EliTVDSPPVQ 
DPEQEBQPhD 
DNVISPDAAA 
AASEALRRIiS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
840 
900 
960 
1020 
1080 
1140 
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VCGRTLSLDIj atsaasgmkv qgdpggktgt egksrgssss srcl 

Seq ID NO: 176 DNA Sequence 
Nucleic Acid Accession #: AL109712.1 
5 Coding sequence: 2-128 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

in ' I I I I I 

JIU GAGTCTCTTT GGGCCAGCCG GGCTGCTGCA GACAGACAGG AAGCACGCCT GACGCTCCTC 
TACCCTCGGG CAGCACAGCG GGGCTGGGAC TCACTCTAGC TTGCCCAGCA ACTTGCTTTC 
CTGT GTGAA C TCTGGCAGGC TGCCCTCTCT GTGCAAAGCT GCCACTGGGG CCTGCTCAGG 
GTGGCCTGGA ACTTGGAGGT GGGCAGTCAG GGCCTAGGAT GGGCCTGTGT CACCAGGGCA 
TGTGCCCTTG GGCCAGTTAC TTCCTCTCAG AGCCTTGGGC TCCTCCTCTG AGGATGGGGC 

15 TTGTTGGTGT GAAATGAGGT GAGCATGTTG AGTTGGGGAG CAGCAGGACA CGCACCTGCA 
GGCAGCCGCC CTGGCCACGC TCCCTCCCTA CCTTCCGAGT CCTGGGACAG ACACAGTAGA 
6CACAGCGGG CCAGCCTGCT CTCTTCTCTG TCTACTTTTT GCAGAAGAGT CAACAGATAC 
7UVCAGGCCCA GGGAGGTGCC CCTGGGGGCC CCAGTCCCCA TCACTCCAAG GGGCAGTCCT 
GCAAGTGACA AGGTGGGCCC AATCCCTGTG GAACAGGTCT CTGAGGACCA CAGAGTCGGG 

20 CCCCAGGGAA AGCTGGGAGC CGAGCTAGAG GCAGGCAGCA A6TAAGGGCA AAGCT6TGCC 
CCTGCCCGGA AGACCTTCCT GCCCCCAGAA CCCX3ACCCTC CGCAGATAGC CCTCCCTGG6 
CAGCAGCCCC CCAGCTTCCA AGGCCCGTGC CTCACCAGAC GCCATGCTCT CACGGACTTG 
TTTGCTGCTC TGTACCCTGC AGATCTGCCC CAGAGGAGCA GGTGAAAAGC CGCGCCTGCC 
GAGGTGCTGT GGCGGTGGAG TTTTGGGCAG AGGAGTGGGG GGAAGAGTTT CTCACTTTTA 

25 AGATTCTCCA AATCCAAGAT GAAGTCATGC TGTGCTTTGG AATGGTAGAT GCTCATTTAT 
GTAAAATCAT AATAAATGTT ACACAAACTG TTAAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID No: 177 Protein sequence: 
Protein Accession #: AL109712.1 

30 

1 11 21 31 41 51 

I I I I I I 

VSLGQPGCCR QTGSTPDAPIi PSGSTAGLGL TIiACPATCFP V 

35 seq ID NO: 178 DNA sequence 

Nucleic Acid Accession #: none found 
Coding sequence: 3-107 (underlined sequences correspond to start and stop codons) 

40 1 11 21 31 41 51 

I i I I I I 

AATGGAGCAC TCCAAAGAAC (3ATTTGACCA ATAGCATTTC TTCTCTGGGG GTTGTATTTC 

AAAGCATGCA ACTCTCCAGG GAACCAGAAC TAAATTGCTT TU^AATGTVAGT CATTCCTCAG 

ATTAACTTCC TCAGATAAAG TGTCAGCGGT CTGCAGAAAC GAAGAAGACA AAACTGAGAT 

45 TATCACTCAT AATTCTCTTA CTTACTATGT CAGTGAAACA ATGAGTTTGC ATTTTTGCAA 

TCCTAGAACA TTCTTCATTA GCCCTGGGTC ATGACCTCTT CCAGTTAATT CTCTTTCACA 

CCTTTAGGAA AGATTTAAGA TGAACCTTCA ATAGGATATT AACATAACTC ATAGCCAATA 

CCACAGCTGC CTTTCAAATT AATGAGGTTA ATTGTTCTCC AGCAAACATG AGTTTGTCTT 

TGGCATTTTA AATGCTTCCC ATTGATCTGA CATTTTGCTG TTTCAA6TTT TAAAGGGCTC 

50 AAATCAAAGA CTATTGATAA CTGAGCAAAG AGCGAAGATC CAGAAATACG AAAACATTGT 

CTTTTTTTTT CCATGATUU^ CAATC^^TAGC CTTTTGAATT CAATCGT^GT TTCTACATTA 

GCCyiTCTAAG ACTTATTTAA TTATTTCTGT TCTCAGTCAA GCTAATTCAA GTGAATCAAC 

AGTATTCSACT TTTAAAATCT TTTTTAAATT TTTTTAAATC TTTAGTTTAT TAAGTTTGTA 

GAAAAGCTCT GGGGCCATGA CCACTTACGT AAATGTTTCa^ GTTTAAAAAC AAAAGATTCA 

55 GGCCTCTAAT TTGAGCCAAA TCCAGGTGAT CTTGTTTGAA ATTTTTGATG AATTTG/WVA 

GATGAAAGTG GAACTTTTAA CATTCATGTT CCCCAAATTT TTCACTGGGA AGGGATGCTA 

ATTGCCTACT TAAGATATAA GTTCAAGAAT AACaVTTTTCA TAGAAAATTC ACaU^CTTGC 

TTGACACAGC AGTGAC3VTAG TTAGATGTGG CTCA<3ATGCC TTCCAAACCT GAG(K3TCCCC 

AAAGATTTCT TTACCAGTTG TTTTTAACTA TGAATCTTAA TCTTGTTCAT TCCCCTGCCS^ 

60 AAACAAATTT AAAAG 

Seq ID No: 179 Protein sequence: 
Protein Accession #: none found 

65 

1 11 21 31 41 51 

I I I I I I 

WSTPKNDLTN SISSLGWFQ SMQLSREPEL NCIiK 

70 Seq ID NO: 180 DNA sequence 

Nucleic Acid Accession #: none found 
Coding seq[uence: 2-176 (underlined sequences correspond to start and stop codons) 

75 1 11 21 31 41 51 

I I I I I I 



264 



3NSOOCID: <WO 02079492A2_I_> 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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960 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



wo 02/079492 
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10 



15 



CCGSGTGGGG 
CAGAAGGGCT 
TGTCCGTCCT 
TGGGGGCTAC 
CAACTTGGTG 
CGTGGTGGCC 
ACCGAGGCCC 
CGATGTCTAC 
AACCCCGCCG 
TAATAGTTCC 
ACCTTGACAT 
TAAATCACAA 
CC6TCCCTAA 



CCTCGGGATG 
TTCGAGGAGG 
CATAGTCACC 
TACCCAGGGA 
GAGAATAGAA 
GCCTTCTGCT 
CTCACCACGG 
CAGACAGAGG 
ACCCCACGCG 
TGGGGGTCAC 
TTCGATGTGC 
CAGTCTCAGA 
GAGTTATCCC 



CAGGCGCCGG 
AAGAAGACGT 
GTCGGGCTGG 
TCATTCTCGG 
GGCAAATGCT 
GCGCCATCGT 
GAAGATGCCA 
TGAGCAGGAG 
GTTTTCCCTG 
CCCT6CAGCT 
TGTATTTCAC 
AAACAACCGC 



TGCCCGGGCC 
CGCTCTGGTT 
CTGCATCAGC 
CTTTGGATCT 
GGTGGCAGCG 
GGACGGCGTA 
GTTTTACTCC 
CACTGAGATT 
CACATAGGC6 
GGTGAACCGT 
TCTG6AGTCA 
ACCACCCCGC 



CCTGGGCCTG 
TGTGGGGTCT 
AGGACGGAGA 
TTCTTAGGAA 
ATCGTGTTTA 
TTTGCAGCAC 
A6TGGGGT6G 
CATGTGGGTT 
TGGTCTGAAT 
TGAT6CCCCC 
GAGTTCTGGA 
AATCCCACCA 



CTGGACCCCG 
CTGCTGCTGG 
ATGTGACCGT 
TTATTGGCAT 
TCAGTTTTGG 
AGCACATTGA 
GGTACTTGTA 
TTGCTCAGCT 
ATTTG6ATTC 
TGTGTAAGGG 
CTTGCTTCAT 
AAG66GCX3CG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID No: 181 Protein sequence; 
Protein Accession #: none foiind 



20 



1 11 21 31 41 51 

I I I I I I 

RVGPRDA6AG ARAPGPAGPR RRAFEEBBDV AIiVCGVSAAG VRPHSHRRAG CISRTEHV 



25 



Seq ID NO: 182 DNA sequence 

Nucleic Acid Accession #: AK001579.1 

Coding sequence: 1150-2637 (underlined sequences correspond to start and stop codons) 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



1 
I 

TTTTCTCTGC 
CCCCCATCCC 
GCCACCAGCT 
CCCATACAGC 
ACCTCTTCCT 
TGCG6CG6CT 
ATTTGGTCCT 
TCACGGCATG 
AGCA6CAGAT 
CCCAGCATGG 
TGAGACTCCT 
ACTTTGTGGA 
TGACCTCTGC 
AGAGCCAAGG 
CTCCGCCCCA 
GGAAGGGGGC 
AGCTGCCACT 
CCCCAGAAGA 
AACCGCCGCA 
CTAAACCAGa. 
GATGGGCGAG 
GTCTTTGATA 
ACCTGGAAGG 
CAGCAGCTCC 
CTGACTAACC 
TTT6A6ATTC 
6A6CAGGCTT 
AAAGTCCCCC 
GTGG6GCTGT 
AGGTTCTTTC 
CCAGAACGGG 
AAGCCCCCAA 
TGCACTGACG 
GACCAGCAGC 
GGCACTATGC 
AATCAGACCC 
TCATCCCAGG 
GTGTATGAGG 
ACCACACGGG 
CAACCCTTTC 
CCAGGCCCCC 
CAGGAGCTCA 
TCCCAGCCAT 
TTCCCCACCC 
TAG6ACCAGC 
GTTGCAGCTT 
GA6CACT6GA 
G6GCCCA6CC 



11 

I 

TTTTCGCTAC 
CCTTTCTCCT 
6CTGGGCCCC 
CCCGGCCCCT 
GTGCTCAGCG 
ACA6GAGATC 
GGT6GAGACA 
GAACX3CAGCC 
6AGCCGGGGT 
6CTCCGGCT6 
GGCTGAGTTC 
GGATGTCACT 
ACGGTTGCTG 
CCCAACCAGG 
TCACTTCCCA 
AGAGACACAT 
CTTTCTTCCC 
ATCAGCGCCT 
CACTGGCCAC 
TGTGCACGCG 
GGGAGCACGA 
TCGATTCTGA 
ACGTGCAGCT 
CAGACAACT6 
AGGTACTGGA 
GCGAGCATGG 
TACAAT66T6 
TGGCCCAAGC 
TGCGGTGTCG 
TGCTGCGTGG 
AGTGGCCTTT 
CACCGTGGGG 
AGGATGAAAT 
CAGTGGTCTT 
CTTTGCTGCC 
TGCGGCGACT 
GGTCTGTGGA 
AAGTAGGGGC 
AGTGGACAGT 
TCTCCAAGTC 
CTTCAAAGAG 
GCAGCCTCAT 
CCAGCCCCCA 
AACCCCCAT6 
AGTCTGAGA6 
CCTCTGCCCT 
CTAAAGGCTT 
CATTTATCTA 



21 
I 

CCCGGTCACT 
GTCCTCCCCC 
GGGCTGCT6C 
GGTCTCTGGC 
CCGGGCCCAG 
AGTGTGGTTT 
GGAAGGACCC 
ATTGGGGGCG 
GACATCCCCA 
GAAGGTGTAT 
CGTCGGGATG 
GACACACTCA 
CCTCGCTGGA 
ATCTCTGCCT 
CCCTGGGGGA 
CCATCCTGCA 
GGGTCCTCCC 
GGAGAAATAT 
CCTCATTGGG 
GAACTTGGCT 
GGTGCGAGTG 
CCAGGTAGCT 
GTCTCAGGCT 
TGTCACCCTG 
GATGCGGGGG 
GGAGCTGGAG 
CCAGCTCCCA 
TGGCTGCCTC 
TGAGGAGCCA 
CCGCTGCCTG 
GGAAGGTGCC 
CTTCACATTG 
GTGGGATTGG 
ACGACGCCAT 
TATCCGTGGG 
ACACAACCGG 
GGAGCAAGAG 
CTTCCCTGAG 
GAAGCCAGAG 
T^GCACCCTT 
CAGTCCCCAG 
CCTGAGGAAA 
ATCCCCCAGC 
CACTTCCAGT 
GGTAGGTACC 
GGCTGGAAAG 
CA6TCK3CT6C 
TACCATGAGG 



31 

1 

CTCATTTCTC 
TGCCTCTACA 
GGCTGGGCCG 
TGTCAGGGTT 
GCCCCCCAGC 
CTGCAGCTGA 
TGTATCTGCA 
CGGCTGGTGG 
TCATCGTGGA 
ACCGGAAAGG 
CCCGGTCGGT 
AACGCTTCTT 
GGGAGGCTGC 
TCCCCCACCA 
TCATCCAGAG 
TTTGTGCCTA 
CAACCCTCCT 
AAAGATGTGA 
CATCTCTATC 
CTGCTGTTTG 
CTGCAAGAGC 
CAGATTGACT 
G6AGACCTCA 
AAGGTGTCCC 
ACA6CAGCTG 
CXMCCACTGC 
GAGCCCTGCT 
TTCACAGGTA 
CCTCGCTT6C 
CTGCTGCTCA 
AAGGTCTACC 
ATACTAGAGA 
ACCACCAGCA 
TCCTCCTCTG 
GATGACAGTG 
AGGACCCTGT 
GAGCTGGAGG 
TTGATCCAGG 
T^CCCCCTCA 
GGCCAGGAGG 
GCACGGGGGT 
GGAGAGACCA 
CCCACTGGCC 
CCACCCTCCA 
AGAAGACCCA 
ACTCCA6AAT 
GTGTCCCAGG 
TAACTGAAGT 



41 
I 

TCCCCTATTC 
GTGGTTCTCC 
CCTATGGCT6 
TGGCCTCCTT 
CCCTGAGGAC 
CACCCCAGAT 
AGGAGA6GGC 
GGGCG6CACA 
TGCCTGCATC 
GGGCGCTCGT 
GAAGCTCCGA 
TCGTGAGCTC 
TGGTATTCCT 
GAATCCATGG 
ACTTGGCTCA 
AAAATCCCTC 
CCATTCCATC 
TTGGCTGCCT 
GGGTGCAGAA 
CACCCAGCGT 
TCATT6ATGG 
TGGAG6TCAG 
TCATGQAAGT 
CAACCCTGAC 
GGATG6ACTT 
ATCCCAAGGA 
CAGCTTCCCT 
TCCGACGTGA 
TGGGAAGCCG 
AGGAGAAGAA 
TGGG/U^TCCG 
AGATGCACCT 
TCCTTATIAGC 
ACCTTGCCCG 
GAGCCACCCT 
CCATGTTCTT 
AGCCTGTGTA 
ACACTTCTAC 
CCAGCCAGAA 
AGAGGCCACC 
CCCTAGAGGA 
CTGCA6GCCT 
TTCCAACACA 
GCCAGCCCCT 
GAAACTCTTA 
CCAGTGTGGT 
ACA66TCAT6 
AAGGAGA6CA 



51 
I 

CTTGTCTCTT 
CCGCTGAGCT 
CGGTCCCCCT 
CGTGGTGACC 
ATGGTGCATC 
AAGAAAGAGC 
CGGCTGGACT 
GGGCTGCAGG 
AGTTTTGTTA 
GCCCGCAGCC 
CCAGGGGAGC 
GATGACCCTG 
AAGATCCCTG 
TTTGGCAGCC 
GGGGGAGGTG 
CCTCTGTACC 
CCCAGAGCTG 
GCCGCGGGTC 
ATGTGCGGCT 
GTTCCAGACG 
CTACATCTCT 
TCTTATCACC 
TTATATA6AG 
TGCTGAGGAG 
6TGGGTGACT 
AAAGGTCTTA 
GCTCTTGAAA 
6AGCCCACGG 
CTTCCAGGAG 
AAGCTCTAAA 
CAAGAAGTTA 
CTACTTGTCC 
CCAGCACGAT 
TCAGAAGTTT 
CCTCTCTGCC 
TCCAATGAAG 
CGAGGAGCCA 
CTCCTTCTCC 
GTCATTGGAT 
T6AGCCCCCT 
ACAGCTGCTC 
GGGAAGTCCT 
GACACCTGGC 
CACATGACCC 
TCGTGGCACT 
GCT6T6GAAG 
GCCCCTCTCT 
GTGAATGTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
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AACTGTGTTT CTTAGAGCCA TAAGCCCCAC ATATTATCCC T6AACAAGGG CAGCTCCTGC 2940 

TTTATATATT TGATACGTAG GGGTTCCATG AGAGATTTTG G6TTTTAAAG GAATG6TTTT 3000 

ACTGCATTAA AGAAAAAAAA TGCTTTGGAA ACCAGAGGCC TGGGTGATGT TAAAGTCTAT 3060 

CCTGTCCCAC TTCCTACATT CTGGGACTAC CGTGAAGCCT GGAGTAGGGA GAGCGAGTTT 3120 
3 GGGAGCTGGG ACTCGGGGAG TCAAAAATAG ATGAGTAATT GTCAATAAAC CTGGGAACC 

Seq ID No: 183 Protein sequencer 
Protein Accession #: ;aC0O1579.1 

10 

1 11 21 31 41 51 

I I I I I I 

MSLTHSNASF VSSMTLPLHG CCLAGGRLIiV FLRSLRAKAQ PGSLPSPTRI HGLAALRPIT 60 

SHPGGSSRDL AQGEVGRGQR HIHPAPVPKN PSLCTSCHSP PPGPPQPSSI PSPELPQKNQ 120 

15 RIiEKYKDVIG CLPRVNRRTL ATLIGHIiYRV QKCAALNQMC TRNIiALLFAP SVFQTDGRGE 180 

HEVRVLQELI DGYISVFDID SDQVAQIDI*E VSLITTWKDV QLSQAGDLIM EVYIEQQLPD 240 

NCVTLKVSPT LTAEELTNQV LEMRGTAAGM DLWVTFEIRB HGELERPIiHP KEKVIiEQAliQ 300 

WCQLPBPCSA SldjhKKVPhh QAGCLPTGIR RESPRVGLLR CRBEPPRMiG SRFQERPPLI* 360 

RGRCUiLIiKE KKSSKPEREW PLEGAKVYLG IRKKLKPPTP WGFTIiIIiBKM HLYIiSCTDED 420 

20 EMWDWTTSIL KAQHDDQQPV VLRRHSSSDL ARQKPGTMPL IjPIRGDDSGA TLIiSANQTIiR 460 

RLHNRRTLSM FFPMKSSQGS VBEQEBLEEP VYEBPVYEEV GAFPELIQDT STSFSTTREW 540 

TVKPENPLTS QKSLDQPFLS KSSTLGQBER PPEPPPGPPS KSSPQARGSIi EEQIiLQEItSS 600 
LILRKGETTA GLGSPSQPSS PQSPSPTGIiP TQTPGFPTQP PCTSSPPSSQ PLT 

25 Seq ID NO: 184 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 1-81 (underlined sequences correspond to start and stop codons) 

30 1 11 21 31 41 51 

i - I I I I I 

GTAGAGTTAG TGTCAATGTG CTTAGAATAT ACCAAATTCA TAAACATTTT CTCTAAAAAA 60 

GTATTAAGCT TAAAAAGT TA A TTCAGTTTA AGGAATATAA ACCAAATTAT TTTATATTTG 120 

AATC TCAACA TAAGAAGTCA AAATGTAATG CTGCCAGATA ACAATATCAA AGGTATTTTT 180 

35 CTTTCTCTAT AATTTCATCA GTATGTCCTC TCCCTTTTCT CCTATTTGTC AAATTTTAGC 24 0 

AACCCTAACT CTGCTAATTA TAAGCTAGGC AAGTAATCTT GGACAAGTTA TTTGACCTCT 300 

CACTGCACCA GCTTTGTTAT CTGTAAAATG ATGATAATAC C7VACACCTTC TTCTTGGGGT 360 

ACTGAAGATG AGAGAACATG ATATGTGTAA AGTGCCTTCC ACAATACCCA GAACATAGCA 420 

AACATGTAAT GAATGTAGTA ATAGTAATTA TTTTATTTTC TTTTGATTCA GTTGGGACTA 480 

40 TGTTCAGCTG TAACAGAATA CCCAAAATAA CAGTTTTAAA CAAATTT^G TTTTGTTGTG 540 

AAGTTTTGTT ACGAATTCAG ACAATCCAGG GCTTTTATAG AT6CACCAGG ATCAGCAGGT 600 

ACAAAGGCAT GTTTCCTGAT TTCTGCCAGT CTCAAT6CAT GGGTTGCAAT CCAGAGTCCA 660 

GGATGGCAGT TCCAGCCCTG GTTACGCCCA TATTAGCACA CAGAAAG3WI GA6AAAGGGA 720 

TGTGCCTCTT CACTTTAATC ATAGCTCCCA CTAGATGCAC CCACTACTTC TGCTGATACT 780 

45 CCATTAGCTA ATGCTTGCTT ACATGGTCAC ACTTAGTTTC CAGAGAGACA TGTCTGGACA 84 0 

GTCATGTGCT CAATTAATAT CCAAGTGTCC AATTACTGAG AAAAA7AGAA ACTAGCACCT 900 

TTGCTTGGTT GCATTCTTCT TAGCATAAGC CACATTCTTT TTATGAAGTT GTCCTCAGTT 960 

ACTTGGATGC CTCAGTTGTC CTTTCATTTA GAAATGCTCC TTGGACATCC TGAATCTGAC 1020 

TTCTTTTGTC ATCAGCACCA TCACTACCAC TGCCTTCTTC AAAGCCACCA CGTTCTGTCC 1080 

50 CAGGATGGTT GCAACAACCA CCATAGGGAC TTTTTGCTTC TACTTCCACA CAATAGCCAG 1140 

AGTAAGCTTT TGAAT^TGTA GGTCAGATCA TGTCTCTCTC TTCTCTTCT^ AACCCTCCGA 1200 

TGGCTTTTCA TATTACTCAA- AAGAAAACCT AAAACTTTGC TGTGAGATCT ATGTGACCCG 1260 

GCTTATTCTT CCTCTTACTT TATCTCTGTA TTGCTCTTCC TCACTCTACT CCAGCCATCC 1320 

CACCTCCTTG CTGCTTGTCC TATACTCCTA AAAGAAGTTC AGTCTTCCCT TATGATATTT 1380 

55 GCACTTAAAA TAGAAAAAAA AAAAAAAAAA AGCTCAGAGA GGCTGAGTTG TCCAAGGTCA 1440 

TGCAGGTTAG AAGTCATGGA -GCTGCGATCT AAATCCATGT CAGTCTGACT ATGAGTTCTG 1500 

CACC6TTCTA TTCAACCCCA TTGCCTAGAG GTGCTTGATT GCTCAATAAT AGATTCCATG 1560 

GACACAGTCA GCTCTTTCTG AGAAAAGGCA GCTCAGCATT TCCATGAGAT CCGCACATCC 1620 
TTTTGCAGAA GAAAAC 

60 

Seq ID No: 185 Protein sequence ; 
Protein Accession #: none found 



65 1 11 21 31 41 51 

I I I I I t 

VELVSMCLEY TKPINIFSKK VLS1.KS 

Seq ID NO: 186 DNA sequence 
70 Nucleic Acid Accession #: NM__002203.2 

Coding sequence: 43-3588 (underlined sequences correspond to start "smd stop codons) 

1 11 21 31 41 51 

I I I I I I 

CTGCAAACCC AGCGCAACTA CGGTCCCCCG GTCAGACCCA GGATGGGGCC AGAACGGACA 60 
GGGGCCGCGC CGCTGCCGCT 6CTGCTGGTG TTAGCGCTCA GTCAAGGCAT TTTAAATTGT 120 
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TGTTTGGCCT 

CAGTTTGGGT 

TCACCCTGGA 

CTATCCACTG 

ACTGAGATGA 

GGTTTTCTCA 

GGTGTGTGTT 

CAGCCCTGCC 

CCTTGGGATG 

ACAAAGACAC 

AACACATATA 

GGGGACCTCA 

GCTTCTGGTG 

CATGATGGTT 

TTTGGCATAG 

AAAGAAATAA 

GAAGCAGCTC 

ACTGTTCAAG 

TACTCTTCTC 

ACCATTGTCC 

ATTCT6CAGG 

G6AGAAA6CA 

CTATATAGTG 

ATTGGCTCCT 

GACGTGCTCT 

GTCTACCTGT 

GAGGGCATTG 

GATGGCTTTA 

TACATTTACA 

TCCGATGGAG 

GATTTAAATG 

CTCTGGTCAC 

ACTTTGGTCA 

CCTACTAAGC 

TTTTCATCCA 

AAGAATATGG 

CCCTCTGATG 

ACTAGCCCTG 

AAA6ACT6TG 

CCA6CTGCTC 

6TAACACTGA 

6AAAACTTGT 

GTGGCTGCAT 

CAACAGGT6A 

TCTCTCAG TT 

AACCTCAAAA 

AATTTTTATG 

GTTGGTCCAA 

GCAACTGTAA 

ACTGGGGTGC 

AAAATAGGAC 

GAATTGAACT 

ATGAAAGGAG 

TCAACGTTCC 

ATATATGTGA 

AAAGCCGAAG 

TTAGCTCTGG 

ACCAAAAATC 

ACCT6CAGTG 

TTTTTAAATC 

AAAACTGCAG 

GTAG6GAAAT 

AACTGGCTG6 

CAAGCATGAC 

TCTCTTTAAA 

GTACTTCCAC 

ACA6GTTTTT 

TAATTTTATT 

CTTCCACACC 

TACCTCCTAT 

TTTTGTTTAA 

TGTGCCAGAG 

CTTCTAGGAT 

TTCTTTGGCA 

TATTATAGAA 

TTACCCCTCA 



ACAATGTTGG 

ATGCAGTGCA 

GTGGCTTTCC 

CCACATGTGA 

AAACCAACAT 

CATGTGGTCC 

CTGACATCAG 

CTTCCCTCAT 

CAGTAAAGAA 

AGGTGGGGTT 

AAACCAAAGA 

CAAACACATT 

GGCGACGAAG 

CAATGTTGAA 

CA6TTCTT66 

AAGOSATCXSC 

TACTAGAAAA 

GAG6AGACAA 

AAAATGATAT. 

AGAAGACATC 

ACAGAAATCA 

CTCACTTTGT 

TGAATGAGAA 

ATTTTGGTAG 

TGGTAGGTGC 

TTACTATCAA 

AAAACACTCG 

ATGATGTGAT 

ATGGTCATCA 

CCTTTAGGAG 

GGGATTCCAT 

AAAGTATTGC 

ACAAGAATGC 

AAAACAATCA 

GAGTAACCTC 

TAGTAAATCA 

TT6TCAACTC 

CCCTTGAAGC 

6TGA6GATG6 

AA6AACAACC 

AAAATAAAAG 

TTTTTGCATC 

CTCA6AAGTC 

CTTTTACTAT 

TCCAAGCCTT 

TTCCTCTCCT 

AAATCTCTTC 

AATTCATCTT 

TCATCCACAT 

AAACAGACAA 

AAACATCTTC 

GCAGAACTGC 

AATACTTTGT 

AGACAGTACA 

TTGAAGATAA 

TACCAACAGG 

TTGCAATTTT 

CAGATGAGAT 

6GAACCGGCA 

CCATATTTTT 

GTCA6TTT6G 

AATA6G6AAA 

CCCAGAGTTT 

AACTTTTAAA 

ATATTTGTCT 

TTGTGTATAT 

TC7VATTTATG 

TATAAACTAG 

CCATCTTGCT 

ATGTCCATTT 

AACTCAGAAT 

GAAGGAAAAG 

TTGTTTGGCT 

ACCTTCCTCC 

GCCCTCTACA 

TCCAAAGTTC 



TCTCCCAGAA 

GCAGTTTATA 

TGAGAACCGA 

AAAACTAAAT 

GAGCCTCGGC 

TCTGTGGGCA 

TCCTGATTTT 

AGATGTTGTG 

TTTTTTGGAA 

AATTCAGTAT 

AGAAATGATT 

CGGAGCAATT 

TGCTACGAAA 

A6CT6TGATT 

GTACTTAAAC 

TA6TATTCCA 

GGCTGG6ACA 

CTTTCAGATG 

TCTGATGCTG 

TCATGGCCAT 

CAGTTCATAT 

TGCTGGTGCT 

TGGCAATATC 

TGTGCTGTGT 

ACCAATGTAC 

AAAGGGCATT 

ATTTGGTTCA 

TGTTGGTTCA 

GGGCACTATC 

CCATCTCCAG 

CACCGATGTG 

TGATGTAGCT 

TCAGATAATT 

AGTGGCCATT 

CAGGGGGTTA 

AGCACAGAGT 

TTTGGATTTG 

CTATTCTGAG 

ACTTTGCATT 

CTTTATTGTC 

GGAAAGT6CA 

ATTCTCCCTA 

TQTTGCCTGC 

TAACTTTGAC 

AAGTGAAAGC 

GTATGATGCT 

GGATGGGAAT 

CTCCCTGAAG 

CCCTCAGTAT 

GGCTGGTGAC 

TTCTGTATCT 

TTCCTGTAGT 

TAATGTGACT 

GCTAACGGCA 

CACTGTTACG 

AGTTATAATA 

ATGGAAGCTC 

TGATGAGACC 

GCATCCCAGC 

TTTATCATGT 

ATGAA6AAAT 

ATACCTATTT 

ACATTCTAAT 

GAAAAATAT6 

TTAAACAGCA 

TTTAATGAAT 

CTGCTCATCC 

GTAAAATTTG 

CTAATGATCA 

AAGTTAGGAG 

ATAACATTTA 

GAGGAAATTT 

GACTGGCAGT 

TCCCTTACTG 

GCCTGACTTT 

CCACTCCTTC 



AATCCAAAAG 
ATGGGAGATG 
TTGCAAACTT 
TTGATCCTCA 
CAGCAATGTG 
CAGCTCTCAG 
GTTGTGTGTG 
AAATTTGTAC 
GCCAATAATC 
GTAGCAACAT 
CAATATGCAA 
GTAATGGTAiG 
GATCAATGCA 
AGAAAC6CCC 
ACAGAAAGAT 
TTAGGAGAAC 
6AAAT6TCAC 
6GTGCAGTG6 
TTGATCTTTC 
TTAGGTTACT 
CCTCGGGCAA 
ACGGTTATTC 
TCAGTTGATG 
ATGAGTGACC 
TTGGGTCAGC 
GCAATTGCAG 
CCACTAGAAA 
CGCACAAAGT 
TACTTTGGGA 
TCTATTGGTG 
ATAGAAGCTT 
CTCAAACTCT 
GTATATAACA 
TTTAAAGAAA 
TGCCCCGAGC 
CGT6TGGACA 
ACTGCCAAGG 
TCTGATCTAG 
A6CAACCAAA 
TACAACACTG 
CCGGTT6ATG 
GATGTAGGCT 
TTCAATCTTC 
CAAGAAGAAA 
GAAATTCACT 
GTTCCTTCAA 
GTAACAACAG 
ACCAAAGAAA 
ATCAGTTGTA 
TTCAAAAGTG 
AATGTTACCT 
ACCAGAATTT 
GCTGCAGAAA 
ATTCCCCTGA 
GGAAGTATAA 
G6CTTCTTCA 
ACAGAGCTCA 
CAGG6TTTGC 
C6TAGGTAAA 
TGTG6GGGGT 
TATATGATGG 
TTGCATTGTG 
ATACTCTCA6 
ACTACAGAA6 
ATTGATGTTA 
A7UVGTTGCCA 
TTGTTGGTTC 
AAACATGCTT 
AGGGGGCGAT 
TGTAAAATCC 
CCTTTCTCTT 
AACCTAGTGA 
AACCACTCTC 
CTCTCCAGCG 
AGGACAGCTG 



TTTCCGGTCC 

GCAACTGGTT 

TGTATAAATG 

CAACAAGCAT 

CCAGGAACAT 

GGAATCAGTA 

CCAGCTTCTC 

ATGAATCAAA 

7AGGCCTTGA 

CAAGAGTTGT 

CCCAGACATC 

GAAAATATGC 

TTGTAACTGA 

ACCATGACAA 

TTGATACTAA 

ACTTTTTCAA 

AAATTTTCAG 

AAGTGGGATT 

GAGCTTTTGG 

CTAAACAAGC 

CTGT6GCTGC 

ATTATACCGG 

AGGCTCACCG 

TGGATAAAGA 

TAAAGAAAGA 

ACCAATTTCT 

CTCTTTCAGA 

ATCAGAATTC 

ATTCCCAGAA 

GGTCCTTGGA 

CCTTTGGACA 

CATTCACACC 

GCTTCAGTGC 

TCACACTTGA 

ACAATGAAAG 

ACATCATTTA 

TCAGTCTGGA 

TCTTCAGTAT 

TCCTAGATGrT 

ACAAAA66TT 

GAATTGTTGT 

GGACAGAA6T 

ACCCTGCTTT 

AAAACCTTCA 

ACAAGGCTGA 

TAACAAGATC 

TCGTGCACAG 

GAAGTGTTCC 

AGAACCCACT 

ATGCAGATAT 

AAAATTTCAG 

GCTGGTTGAA 

GGAACGGGAC 

TCAACACCTA 

TGATAATGAA 

TTGCTGGAAT 

AAAGAAAATA 

GTAGCTGAAC 

TGTTTGCGTG 

CTAACCTGGT 

GGGGGAGGT6 

GGGAAAAAAA 

TCAGAAACAT 

ATTTTAAGGG 

TGGAA6TGCT 

ACAAGAGGGG 

CAGATGATAC 

CTTTTATACC 

GAATAACTGA 

ATAGAGACTA 

CATCTGCTAG 

TTAGGAGGCA 

ATTTTTGAAA 

CCACCTCCTG 

GTCCAAAGTT 

CTGTGCATTA 



TTCAAGTGAA 
ACTGGTTGGT 
TCCTGTTGAC 
TCCAAATGTT 
GGGAACTGGA 
TTACACAACG 
ACCTGCAACT 
TAGTATTTAT 
TATAGGCCCC 
GTTTAACTTG 
CCAATATGGT 
CTATTCAGCA 
CG6TGAATCA 
TATACTGAGG 
AAATTTAATA 
TGTGTCT6AT 
CATTGAAGGT 
CA6TGCAGAT 
CT66AGTGGG 
CTTTGACCAA 
AATTTCTACT 
CCAGATAGTG 
AGGTGACCAG 
CACCATTACA 
GGAAGGAAGA 
TGAAGGCCCC 
CATCAACATG 
TGGAGCTGTA 
AATCTTGGGA 
TGGCTATGGA 
AGTGGTTCAA 
AGAAAAAATC 
AAAGTTCAGA 
TGCAGATGGA 
GT6CCTGCAG 
TATACAGGAG 
AAACCCTGGC 
TCCTTTCCAC 
CCX3ACAAATA 
AACATTTTCA 
TGATTTTTCA 
AACATGCCAG 
AAAGAGAGAA 
GAATCAGGCG 
TAATTTGGTC 
TACC/ikACATA 
TTTTGAAGAT 
AGTAAGCATG 
GATGTACCTA 
CAATCCACTG 
GCACACCAAA 
AGACGTTCAC 
TTTCGCATCA 
TAACCCTGAG 
ACCTGATGAG 
CCTTTTGCTG 
TGAAAAGATG 
CAGCAGACCT 
CATGGATTTC 
ATTTTAA6AG 
C6GGGGGCAG 
GTAATCTTTA 
GAAATGCTTC 
GGAAAACTGT 
TGATATGTAA 
AAAACAT^AAC 
TTCCAAGTGA 
ACGGCTGCCC 
GCTTAGAGTA 
A6GCACAAAA 
AAGCCCATCC 
CAACAGTTCT 
GATGAGTAAT 
GTGGTACCAT 
ATCCCCTCCT 
GATATTAGGG 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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900 
960 
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1080 
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1500 
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2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2860 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
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GGGAAAGTCA 
TTCAGGGAGC 
CTAAATGTTG 
CCAGAAGTTA 
GCTGTCTTGT 
GTTCAAAAGG 
TTTTAACAAG 
ATTTCTACTT 
CTTAGATTAA 
AAACTTCAGA 
TTTAGTTTTA 
CAAGAATTTG 



TCTGTTTAAT 
TATTTTCATT 
GAATGTTATG 
CAGATGAGGC 
TTCTGAAGTA 
TAGATCCTGA 
CACCCCAGTC 
TTTGCACCTT 
AATTCACAGA 
GTCCTCATTA 
AAAGTCTATG 
ACTT6GAAAA 



TTACACACTT 
TAGTGCTAAA 
GGATGTAAAC 
ACTGGAAACC 
CTTTTTCTTC 
GATGATTTGG 
ACTAGGATGC 
ATTTTCTCTG 
CACTACATAT 
TAAAATGG6A 
ATCTGATCTG 
G 



GCATGAATTA 
CAAGTAAGAA 
AATGTAAAGT 
ACCACCAAAT 
CACAAGAGTG 
TCAGATTGGG 
AGATGGACCA 
TTCCTGAGCC 
CTAAAGCTTT 
AGACTGAGCT 
GACTTCCTAT 



CTGTATATAA 
AAATAAGCTA 
AAAACACTCT 
TAGCAGGTGC 
AATTTGACCT 
ATAAGGCCCA 
CACTTTGAGA 
CCCACATTCT 
GACAAGTCCT 
6GA6TTCAGC 
AATACAAATA 



ACTCCTTAAC 
GAGTGAATTT 
CAGGATTTCA 
ACCTTCTGTG 
AGGCAAGTTT 
GCAATCTGCA 
AACACCACCC 
CTA6GAGAAA 
TGACCTCTAT 
AGTGATGCTT 
CACAATCCTC 



4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
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Seq ID NO: 187 Protein aeouence: 
Protein Accession #: NP 002194 .1 
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MGPERTGAAP 
NWLLVGSPWS 
RNMGTGGPLT 
ESNSIYPVJDA 
QTSQYGGDLT 
HDNILRFGIA 
IFSIEGTVQG 
KQAFDQILQD 
AHRGDQIGSY 
QFLEGPEGIE 
SQKIL6SDGA 
FTPBKITLVN 
NERCLQKNMV 
FSIPFHKDC6 
ZWDFSENIiF 
NIiQKQASLSF 
VHSFEDV6PK 
ADZNPIiKIGQ 
NGTFASSTFQ 
AGILLLIiAIiV 



11 

I 

LPLLIiVIALS 
GFPENRMGDV 
CGPLWAQQCG 
VKNFIiEKFVQ 
NTFGAIQYAR 
VIiGYIiNRNAL 
GDNFQMEMSQ 
RNHSSYLGYS 
FGSVLCSVDV 
NTRFGSAIAA 
FRSHLQYF6R 
KNAQIILKIjC 
VNQAQSCPBH 
ED6LCISDLV 
FASFSIiPVDG 
QAIjSESQEEN 
FIFSLKVTTG 
TSSSVSFKSE 
TVQIiTAAAEI 
AILWKLGFFK 



21 

I 

QGILNCCIjAY 
YKCPVDLSTA 
NQYYTTGVCS 
GliDIGPTKTQ 
KYAYSAASGG 
DTKNLIKEIK 
VGFSADYSSQ 
VAAISTGEST 
DKDTITDVLL 
LSDINMDGPN 
SIiDGYGDLNG 
FSAKFRPTKQ 
IIYIQEPSDV 
LDVRQIPAAQ 
TEVTCQVAAS 
KTUSNLVNIiKZ 
SVPVSMATVI 
NFRHTKELNC 
NTYNPEIYVI 
RKYBKMTKNP 



31 
I 

NVGLPEAKIF 
TCEKLNLQTS 
DISPDFQIiSA 
VGIilQYANNP 
RRSATKVMW 
AIASIPTERY 
NDILMIiGAVG 
HFVAGAPRAN 
VGAPMYMSDL 
DVIVGSPIiEN 
DSITDVSIGA 
NNQVAIVYNI 
VNSIiDLRVDZ 
EQPFIVSNQN 
QKSVACDVGY 
PLLYDABIHL 
IHIPQYTKEK 
RTASCSNVTC 
EDNTVTIPIiM 
DEZDETTELS 



41 

I 

SGPSSEQFGY 
TSIPNVTEMK 
SFSPATQPCP 
RWFNLNTYK 
VTDGESHDGS 
FFNVSDBAAL 
AFGWSGTIVQ 
YTGQIVIiYSV 
KKEEGRVYIiF 
QNSGAVYIYN 
FGQWQLWSQ 
TLDADGFSSR 
SIiENPGTSPA 
KRIiTFSVTIiK 
PAIiKREQQVT 
TRSTNINFYE 
NPLMYLTGVQ 
WIiKDVHMKGE 
IMKPDEKAEV 



51 

I 

AVQQFINPKG 
TNMSLGIiir.T 
SLIDWWCD 
TKEEMIVATS 
MLKAVIDQCN 
LEKAGTLGEQ 
KTSHGHLIFP 
NENGNITVIQ 
TIKKGILGQH 
GHQGTIRTKY 
SIADVAIEAS 
VTSRGLFKEN 
LBAYSETAKV 
NKRESAYNTG 
FTINFDFNLQ 
ISSDGNVPSI 
TDKAGDISCN 
YFVNVTTRIW 
PTGVIIGSII 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: 188 DNA sequence 

Nucleic Acid Accession #: NM_002210.1 

Coding sequence: 42-3188 (underlined sequences correspond to start and stop codons) 



GGCTACCGCT 
GACGGCTGCG 
TGTGCCGCGC 
GTTACTTCG6 
TCGTG6GA6C 
TCAAATGTGA 
ATAGAGATTA 
CTGTGAG6TC 
AGATGAAACA 
TTGAGTATGC 
GAGGATTCAG 
TTTATTGGCA 
CCAATGTTTA 
TTGATGACAG 
ATGACTTTGT 
ATGGGAAGAA 
GATTTTCTGT 
CACCTCTCTT 
TGTCTCTACA 
TTGCACGGTT 
ATATTGCAAT 
ATGGAAGATC 
CTCGAMCAT 
ATGGATATCC 
CCAGACCAGT 
ACAATAAAAC 
TCT6CTTAAA 
TTCTTTTGGA 
66TCCCCAAG 
AATTGATAGC 
TTTTTATGGA 



11 

I 

CCCGGCTTGG 
CCTCGGTCCC 
CTTCAACCTA 
CTTCGCCGTG 
TCCCAAAGCA 
CTGGTCTTCT 
TGCCAA6GAT 

gaaacaggat' 
ggagcgagag^ 

TCCATGTAGA 
CATTGATTTT 
AGGTCAGCTT 
CAGCATCAAG 
CTATTTGGGT 
TTCAGGAGTT 
CATGTCCTCC 
AGCTGCCACT 
CATGGATCGT 
GAGAGCTTCA 
TGGCAGTGCC 
TGCTGCTCCA 
AACAGGCTTG 
GCCACCAAGC 
AGACTTAATT 
TATCACTGTA 
CTGCTCACTG 
GGCAGATGGC 
TAAACTCAAG 
TCACTCCAA6 
GTATCTGCGG 
ATATCGGTTG 



21 
I 

CGTCCCGCGC 
CGCGGCCTCC 
GAC6T6GACA 
GATTTCTTCG 
AACACCACCC 
ACCCGCCGGT 
GATCCATTG6 
AAAATTTTGG 
CCTGTTGOAA 
TCACAAGATA 
ACTAAAGCTG 
ATTTCGGATC 
TATAATAACC 
TATTCTGTGG 
CCAAGAGCAG 
TTATACAATT 
GACATTAATG 
GGCTCTGATG 
GGAGACTTCC 
ATAGCTCCTT 
TATGGGGGTG 
AACGCAGTCC 
TTTGGCTATT 
6TA6GAGCTT 
AATGCTGGTC 
CCTGGAACAG 
AAAGGAGTAC 
CAAAAGG6A6 
TACATGACTA 
GATGAATCT6 
6ATTATAGAA 



31 
I 

GCACTTCGGC 
CGCTTCTTCT 
GTCCTGCCGA 
TGCCCAGCGC 
AGCCTGGGAT 
GCCAGCCAAT 
AATTTAA6TC 
CCTGTGCCCC 
CATGCTTTCT 
TTGATGCTGA 
ACAGAGTACT 
AAGTGGCAGA 
AATTAGCAAC 
CTGTCGGAGA 
CAAGGACTTT 
TTACTGGCGA 
GAGATGATTA 
GCAAACTCCA 
AGACGACAAA 
TGGGAGATCT 
AAGATAAAAA 
CATCTCAAAT 
CT^TGAAAGG 
TTGGTGTAGA 
TTGAAGTGTA 
CTCTCAAAGT 
TTCCCAGGAA 
CAATTCGACG 
TTTCAAGGGG 
AATTTAGAGA 
CAGCTGCTGA 



41 
I 

G ATG GCTTTT 
CTCGGGACTC 
GTACTCTGGC 
GTCTTCCCGG 
TGTGGAAGGA 
TGAATTTGAT 
CCATCA6TGG 
ATTGTACCAT 
TCAAGATGGA 
TGGACAGGGA 
TCTTGGTG6T 
AATCGTATCT 
TCGGACTGCA 
TTTCAATGGT 
GGGAATGGTT 
GCAGATGGCT 
TGCAGATGTG 
AGAGGTGGGG 
GCTGAATGGA 
GGACCAGGAT 
AGGAATTGTT 
CCTTGAAGGG 
AGCCACAGAT 
TCGAGCTATC 
CCCTAGCATT 
TTCCTGTTTT 
ACTTAATTTC 
AGCACTGTTT 
G66ACTGAT6 
CAAACTCACT 
TACAACAG6C 



51 
I 

CCGCCGCGGC 
CTGCTACCTC 
CCCGAGGGAA 
ATGTTTCTTC 
GG6CA66TCC 
GCAACAGGCA 
TTT6GAGCAT 
TG6AGAACTG 
ACAAAGACTG 
TTTTGTCAAG 
CCTGGTA6CT 
AAATACGACC 
CAAGCTATTT 
GATGGCATAG 
TATATTTATG 
GCATATTTCG 
TTTATTGGAG 
CAGGTCTCAG 
TTTGAGGTCT 
GGTTTCAATG 
TATATCTTCA 
CAGTGGGCTG 
ATAGACAAAA 
TTATACAGGG 
TTAAATCAAG 
AATGTTAGGT 
CAG6TGGAAC 
CTCTACAGCA 
CAGTGTGAGG 
CCAATTACTA 
TTGCAACCCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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TTCTTAACCA GTTCACGCCT GCTAACATTA GTCGACAGGC TCACATTCTA CTTGACTGTG 1920 
GTGAAGACAA TGTCTGTAAA CCCAAGCTGG AAGTTTCTGT AGATAGTGAT CAAAAGAAGA 1980 
TCTATATTGG GGATGACAAC CCTCTGACAT TGATTGTTAA GGCTCAGAAT CAAGGAGAAG 2040 
GTGCCTACGA AGCTGAGCTC ATCGTTTCCA TTCCACTGCA GGCTGATTTC ATCGGGGTTG 2100 
5 TCCGAAACAA TGAAGCCTTA GCAAGACTTT CCTGTGCATT TAAGACAGAA AACCAAACTC 2160 
GCCAGGTGGT ATGTGACCTT GGAAACCCAA TGAAGGCTGG AACTCAACTC TTAGCTGGTC 2220 
TTCGTTTCAG TGTGCACCAG CAGTCAGAGA TGGATACTTC T6TGAAATTT GACTTACAAA 2280 
TCCAAAGCTC AAATCTATTT GACAAAGTAA GCCCAGTTGT ATCTCACAAA GTTGATCTTG 2340 
CTGTTTTAGC TGCAGTTGAG ATAAGAGGAG TCTCGAGTCC TGATCATATC TTTCTTCCGA 24 00 
10 TTCCAAACTG GGAGCACAAG GAGAACCCTG AGACTGAAGA AGATGTTGGG CCAGTTGTTC 24 60 
AGCACATCTA TGAGCTGAGA AACAATGGTC CAAGTTCATT CAGCAAGGCA ATGCTCCATC 2520 
TTCAGTGGCC TTACAAATAT AATAATAACA CTCTGTTGTA TATCCTTCAT TATGATATTG 2580 
ATGGACCAAT GMCTGCACT TCAGATATGG AGATCAACCC TTTGAGAATT AAGATCTCAT 2640 
CTTTGCAAAC AACTGAAAAG AATGACACGG TTGCCGGGCA AGGTGAGCGG GACCATCTCA 2700 
15 TCACTAAGCG GGATCTTGCC CTCAGTGAAG GAGATATTCA CACTTTGGGT TGTGGAGTTG 2760 
CTCAGTGCTT GAAGATTGTC TGCCAAGTTG GGAGATTAGA CAGAGGAAAG AGTGCAATCT 2820 
TGTACGTAAA GTCATTACTG TGGACTGAGA CTTTTATG7A TAAAGAAAAT CAGAATCATT 2880 
CCTATTCTCT GAAGTCGTCT GCTTCATTTA ATGTCATAGA GTTTCCTTAT AAGAATCTTC 294 0 
CAATTGAGGA TATCACCAAC TCCACATTGG TTACCACTAA TGTCACCTGG GGCATTCAGC 3000 
20 CAGCGCCCAT GCCTGTGCCT GTGTGGGTGA TCATTTTAGC AGTTCTAGCA GGATTGTTGC 3060 
TACTGGCTGT TTTGGTATTT GTAATGTACA GGATGGGCTT TTTTAAACGG GTCCGGCCAC 3120 
CTCAAGAAGA ACAAGAAAGG GAGCAGCTTC AACCTCATGA AAATGGTGAA GGAAA CTCAG 3180 
AAACT TAA CT GCAGTTTTTA AGTTATGCTA CATCTTGACC CACTAGAATT AGCAACTTTA 3240 
TTATAGATTT AAACTTTCTT CATGAGGAGT A7UUUVTCCAA GGCTTTACTG CTGATAGTGC 3300 
25 TAATTGGCAT TAACCACAAA ATGAGAATTA TATTTGTCAA CCTTCTCCTT ATAAATAAGT 3360 
TCAGACATAC ATTTAATAAC ATAGGGTGAC TTGTGTTTTT AGGTATTTAA ATAATAAAAT 3420 
TTCAAGGGAT AGTTTTTATT CAAT6TATAT AAGACAGGTA GTGCCTGATT TACTACTTTA 3480 
TATAAAATAG TACCTCCTTC AGTTACTGTT TCTGATTTAA TGTACGGAAC TTTATTTGTT 3540 
GTTGTTGTTG TTGTTGTTGT TGTTGTTTTA AAGCAGTCCA AATTTGGACC TTAGCAATCA 3600 
30 TGTCTTTTGT ATAGGTACTT AATGTTAATA CATATTACAC TACAGTTTAC TTTTCAGAAT 3660 
ACTAAAGACT TTATAACTGC ATGAACTTGG ATTTTTTTAA TCACTCATAT GGTAGAATTT 3720 
TATAAACACA TACATGATAC CATCCAAATT CTTGCTTTTA ATAACAAAGG TACAATATTT 3780 
TGTTTTAGTA TGAAAATCTG GTAGATCCTA TTACACTTCT GTTTATATTA AATCCACAAT 3840 
ATTTTATTAC ATTTTTAACT TGTATAAATT TTAGGTCAAA TCCTTCAAGC CAACCTATAC 3900 
35 TAAAAATTAG TTCCATJATC ACAAATGGCT CTTTTGTGTA ATTGTTTAAT TTCACCTGAA 3960 
TATCATAATG CTTAAAGCCA TATGGAGTTG GAAATTATTT CCAAAGCATA TTTATTCCAT 4020 
TGTTTTAGTC TGGCTATTTA CAGTATAT^ AAAGCATTTT ATTAAAATAC TGTGTAGTTC 4080 
TTTGAGATAG TTGCTTATGC ATATAGTAAG TATTACATTC TTAGAGTAGA GCAGAGTTTT 4140 
TAGTTAGTAT TAATTTATTT TCCTCCATTC ATGTACTTTT CCTTATATTT CCAAAACTGT 4200 
40 TACTGAGAAT GGGTCAAGAT CAGTGAGAAA TCTTTACAGT TGACAGGAAC CTGGACCCCT 4260 
TACCCCAACT TTATGAGTAA TGCTTGGAAT AAAAAACTCT TAAGGCAACT CACTGATTTA 4320 
CTTCTAGCAA TAGCATGATG TTACAGGAAT ATTACCTCTG TTTAAGCAAG GTT^TGTGTA 4380 
AAATCAGTCT CGGCTGTCAG AATAACTTCT AAAAGGTATT TTTAT AAGCA GTTCAAGTTA 4440 
CTGAAAACCT TTTAAACCTT TCTGAAGTTC GTTAGTATAA ATTACTTTTC TAGGATTATT 4500 
45 AATAAAAGCC ACATAGGTGG CAAGTTGTAG TTTTATATGG CTCTGTAGAG TGGTGAACCT 4560 
TCTAGAGGAA TATATGATTT ATTCACAGTT CCTCAAGGCC T6G6GATGAT GATCAGTTAT 4620 
ACCTATTTTT GTGCAATTAC ATCATGTTGT ACATTAGAAA T6GAGAGTTT AAT AGCT CTT 4680 
TAACTGCTGT CCTCATTAGG TAATGATAAA TATTTCCCTT AAATAATTGA CTATTTTGCT 4740 
GTGTTTTAAA AATGATTGAA ATTTATCTTG CCATATCTCA TAATTTCATG CACAA6TTGA 4800 
50 CTGAGCTAAT CTTGAGAATA TATTCGTAAA ATAGGAGCAC ATTTA6TTGA GGTATACAA6 4860 
GTAGGACTCT AGACAAAACC TTCTATTTTA GCTTTAGT6A ATTTCAAAAG TAATGGGTCT 4920 
TGGAGTATAG ATTTTTATTA GTAGCTTGAA AGAGCTTAAT CATATGCAGT AAGTATTTTT 4 980 
ATTACCAATA AATTTAAAAT TTTTTAAGAA AAATATTTTT ATCCTAGGGC CAAGTGTTGC 504 0 
CTGCCACCAA TCAGTAAGTT AGTCTATAAC AAATTTTACC CTAACAGTTT TACCACCTAG 5100 
55 CAACAGTCAT TTCTGAAAAT ATGTTGGATA GAAAGTCACT CTTTGGCAAA AGTGTTAGAA 5160 
TTTGCTTTTG TGCCATCTAT TCCTTTTATG GCATCTATCT TGAAAGTAAT CTTGTATTGG 5220 
AGATTGAAAG ATGCTGTAAT TTAGAAATTA ACATGATATC TTAAATTACC TTTATGAAAT 5280 
ATAGTTTTGT ATAATAGCAT AGATTTTCCT TCAAAAAATG AACATTTATA TATCTACAAA 534 0 
AATATGGAGA AGAGCAATTT GAAAGCCTAC TTTCTGAAGA AAATGGTGGG ATTTTTTTTT 5400 
60 ATCATGATTA AATATCAAAA AATTGCCCTA TGAAAACTTT AAATCTCTAA AACATTTGAA 5460 
ATACTACCAT ATTTGTGATT TATTGAGAAT AAAAATCCAT TTTGAAATGT AAAATTTTTA 5520 
TGATCTGATT CAGTTTTAAG AAAACATGAA TGAACTAGAA GATATTAAAA ACATTTGACA 5580 
TTGGTAAGAA ATATTGATAC TGATATTGAT TTTTATATAG GTATTTATTT CAGAATTGAT 5640 
ATTTTGAGAA AAATACATGT GAGTCATTTT TTCTGTTTCT CTTTTCTCTT AAC6ATTATC 5700 
65 ACTGTAATTC TGAATCT 

Seq ID NO: 189 Protein seauence; 
Protein Accession ft: NP_00220l.l 

70 1 11 21 31 41 51 

I I I I I I 

MAFPPRRRLR LGPRGLPLLL SGLLLPI#CRA FNLDVDSPAB YSGPEGSYFG FAVDFFVPSA 60 

SSRMFIiLVGA PKANTTQPGI VEG6QVLKCD WSSTRRCQPI EFDATGNRDY AKDDPIjEFKS 120 

HQWFGASVRS KQDKILACAP LYHWRTEMKQ EREPVGTCFL QDGTKTVEYA PCRSQDIDAD 180 

75 GQGFCQGGFS IDFTKADRVL LGGPGSFYWQ GQI-ISDQVAB IVSKYDPNVY SIKYNNQLAT 240 

RTAQAIFDDS YIX3YSVAVGD FNGDGIDDFV SGVPRAARTL GMVYIYDGKN NSSIjYNFTGE 300 
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10 



15 



QMAAYFGFSV 
LNGFEVFARF 
LEGQWAARSM 
PSIIjNQDNKT 
ALFLYSRSPS 
TTGLQPILNQ 
AQNQGEGAYE 
TQUAGLRFS 
DHIFIiPIPNW 
ILHYDIDGPM 
TLGCXSVAQCL 
PPYKNLPIED 
FKRVRPPQEE 



AATDINGDDY 
G5AIAPLGDL 
PPSFGYSMKG 
CSLPGTAliKV 
HSKNMTISRG 
FTPANISRQA 
AELIVSIPLQ 
VHQQSEMDTS 
EHKENPETEE 
NCTSDMEINP 
KIVCQVGRIiD 
ITNSTLVTTN 
QKREQLQPHE 



ADVFIGAPIiP 
DQDGFi3DIAI 
ATDIDKNGYP 
SCFNVRFCLK 
GLMQCEEIiIA 
HZLIiDCGEDN 
ADFIGWRNN 
VKFDLQIQSS 
DV6PWQHIY 
LRIKISSLQT 
RGKSAIIiYVK 
VTWGIQPAPM 
NGEGNSET 



MDRGSDGKLQ 
AAPYG6EDKK 
DIiIVGAFGVD 
ADGKGVIiPRK 
YIiRDESEFRD 
VCKPKLEVSV 
EALARLSCAF 
NLFDKVSPW 
ELRNNGPSSF 
TEKNDTVAGQ 
SLLWTETFMN 
PVPVWVIIIiA 



EVGQVSVSLQ 
GIVYIFNGRS 
RAILYRARPV 

KLTPITIFME 
DSDQKKIYIG 
KTENQTRQW 
SHKVDIiAVIiA 
SKAMIiHIiQWP 
GBRDHItlTKR 
KENQNHSYSL 
VLAGLIiIjLAV 



RASGDFQTTK 
TGIiNAVPSQI 
ITVNAGLEVY 
KLKQKGAIRR 
YRIiDYRTAAD 
DDNPIiTItlVK 
CDLGNPMKAG 
AVEIRGVSSP 
YKYNNNTLLY 
DLAIiSEGDZH 
KSSASFNVIE 
LVFVMYRMGP 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq ID NO: 190 DNA aecnience 
Nucleic Acid Accession #: NM_004B64 

Coding sequence: 26-952 (underlined sequences correspond to start and stop codons) 



20 



25 



30 



35 



40 



CGGAACGAGG 
TCAGATGCTC 
GGCCGAGGCG 
ATTCCGAGAG 
CTGGGAAGAT 
AGTGCGGCTG 
GGGGCTCCCC 
AAGGTCGT66 
GCCCGCX5CTG 
ATCTTCXfTCC 
CCGCAGAGCG 
TCTGCACACG 
ACGGGAG6TG 
CATGCAC6CG 
CTGCTGCGTG 
GTCGCTCCAG 
GGTCCTTCCA 
GGGCTCAAGG 
TTATTTATTA 
ACTGTGTATT 
AAAA 



11 

I 

GCAACCTGCA 
CTGGTGTTGC 
AGCCGCGCAA 
TTGCGGAAAC 
TCGAACACCG 
GGATCCGGCG 
GAGGCCTCCX: 
GACGT6ACAC 
CACCTGCGAC 
GCACG6CCCC 
CGT6CGC6CA 
6TCCGCGCGT 
CAAGTGACCA 
CAGATCAAGA 
CCCGCCAGCT 
ACCTATGATG 
CTGTGCACCT 
TTCCTGAGAC 
TTAATTTATT 
TATTTAAAAC 



21 
I 

CAGCC ATGC C 
TGGTGCTCTC 
GTTTCCCGGG 
GCTACGAGGA 
ACCTCGTCCC 
GCCACCTGCA 
GCCTTCACCG 
GACCGCTGCG 
TGTC6CCGCC 
AGCTGGAGTT 
ACGGG6ACGA 
C6CTGGAAGA 
TGTGCATCGG 
CGAGCCTGCA 
ACAATCCCAT 
ACTTGTTAGC 
GCGCGGGGGA 
ACCCGATTCC 
GGGGTGACCT 
TCTGGTGATA 



31 
I 

CGGGCAAGAA 
GTGGCTGCCG 
ACCCTCAGAG. 
CCTGCTAACC 
GGCCCCTGCA 
CCTGCGTATC 
GGCTCTGTTC 
GCGTCAGCTC 
GCCGTCGCAG 
GCACTTGCGG 
CTGTCCGCTC 
CCTG6GCTGG 
CGCGTGCCCG 
CCGCCTGAAG 
GGTGCTCATT 
CAAAGACTGC 
GGCGACCTCA 
TGCCCAAACA 
TCTTGGGGAC 
AAAATAAAGC 



41 
I 

CTCAGGACGG 
CATGGGGGCG 
TTGCACTCCG 
AGGCTGCGGG 
GTCCGGATAC 
TCTCGGGCCG 
CGGCTGTCCC 
AGCCTTGCAA 
TCGGACCAAC 
CCGCAA6CC6 
GGGCCC6GGC 
6CCGATTGGG 
AGCCAGTTCC 
CCCGACACGG 
CAAAAGACCG 
CACTGCATAT. 
GTTGTCCTGC 
GCTGTATTTA 
TCGGGGGCTG 
T6TCTGAACT 



51 
I 

TGAATGGCTC 
CCCTGTCTCT 
AAGACTCCAG 
CCAACCAGAG 
TCACGCCAGA 
CCCTTCCCGA 
CGACGGCGTC 
GACCCCAA6C 
TGCTGGCAGA 
CCAG66G6C6 
GTTGCTGCCG 
TGCT6TCGCC 
66GCX3GCAAA 
AGCCAGGGCC 
ACACCGGGGT 
GAGCAGTCCT 
CCTGTGGAAT 
TATAAGTCTG 
GTCTGATGGA 
GTTAAAAAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



45 



50 



55 



60 



65 



70 



Seq ID NO: 191 Protein secmence; 
Protein Accession #: NP 004855 



1 
1 

MPGQBLRTVN 
EDIiLTRLRAN 
HRAIiFRIiSPT 
EIiHLRPQAAR 
I6ACPSQFRA 
LAKDCHCI 



11 
I 

GSQMIiLVIiLV 
QSWEDSNTDIi 
ASRSWDVTRP 
GRRRARARNG 
ANMHAQIKTS 



21 
I 

LSWLPHG6AL 
VPAPAVRILT 
LRRQIiSLARP 
DOCPLGP6RC 
LHRIiKPDTEP 



31 
I 

SLAEASRASF 
PEVRIiGSGGH 
QAPALHIiRIiS 
CRIiHTVRASI. 
APCCVPASYN 



41 

I 

PGPSEZiHSED 
UILRISRAAIi 
PPPSQSDQLL 
SDIiGWADWVIi 
PMVLIQKTDT 



51 
I 

SRFREIiRKRY 
PSGLPSASRL 
AESSSARPQL 
SPRSVQVTMC 
GVSLQTYDDL 



60 
120 
180 
240 
300 



Seq ID NO: 192 DNA secaience 

Nucleic Acid Accession #: XM_061731.1 

Coding sequence: 1-567 (underlined sequences correspond to start and stop codons) 



1 
I 

ftTGA GAAAAG 
AGAAAGTTTC 
GTAAAAGAAC 
CTGTTGTTGA 
TCCCCTCCAA 
TGGGGTGGCA 
AGTCTTTCAC 
GAGGTCTTCC 
GCTGGATGGT 
ATGGTTTATT 



11 

I 

GAAATGAGGG 
TCAAAGAAGA 
CTTTCTCTCT 
TGTCCACAGA 
TGTGCACCAA 
AAGACACCAG 
TCACCAAACA 
AGCCACTTTC 
GGATTTATCA 
CTAAAGAAAC 



21 
I 

AGAGAACACA 
TGGCATTACA 
GATTGGACTT 
CACTGGCAAG 
ATCACGT/^AA 
GAGCAATACT 
TTCCCACAAG 
AGAGCCAGGT 
GAGCTGTCAG 
TGA GTGA 



31 
I 

GAAGAGGGCA 
TTGCACATCT 
GACACACAGA 
GACAGGTTTA 
AATGGGGATA 
GATCTTCCTA 
CCTGTCCCTG 
GTAGAAGCAG 
GTTCCTTCCT 



41 
I 

GGCTTGCTCA 
CTCTGTGTCT 
AG6ATCTCAG 
CCAACATACT 
ATGACTCCCC 
TCAGAGACCC 
AGCATCA6TG 
AGATG6AAGT 
CAACCCTT6C 



51 
I 

GCTTGCTCAA 
CTCTATTGCT 
TAAAGATTTG 
GCTGTCACAC 
TGCCTTCACA 
TGGGGGCAAG 
TGACCAGA6A 
GTTCGCTQAT 
AAGAAAGAA6 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 193 Protein sequence; 
Protein Accession #: XP 061731.1 



75 



11 



21 



31 



41 



51 
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MRKGNEGENT EEXSRIiAQIAQ RKFIiKEDGIT LHZSLCIiSXA VKBPFSLIGL DTQKDLSKDL 60 

UbliMSTDTGK DRFTNILLSH SPPMCTKSRK N6DNDSPAFT WG6KDTRSNT DLPIRDPGGK X20 

SLSLTKHSHK PVPEHQCDQR EVFQPLSEPG VEAEMEVFAD AGWWIYQSCQ VPSSTLARKK IBO 
^ MVYSKETE 

Seq ID NO: 194 DNA aeouence 

Nucleic Acid Accession #: NM_005415.2 

Coding sequence: 371-2410 (underlined seq[uences correspond to start and stop codons) 

10 1 11 21 31 41 51 

I I I I I I 

GAGCTGTCCC CGGTGCCGCC GACCCGGGCC GTGCCGTGTG CCCGTGGCTC CAGCCGCTGC 60 

CGCCTCGATC TCCTCGTCTC CCGCTCCGCC CTCCCTTTTC CCTGGATGAA CTTGCGTCCT 120 

TTCTCTTCTC CGCCATGGAA TTCTGCTCCG TGCTTTTAGC CCTCCTGAGC CAAAGAAACC 180 

15 CCAGACAACA GATGCCCATA CGCAGCGTAT AGCAGTAACT CCCCAGCTCG GTTTCTGTGC 240 

CGTAGTTTAC AGTATTTAAT TTTATATAAT ATATATTATT TATTATAGCA TTTTTGATAC 300 

CTCATATTCT GTTTACACAT CTTGAAAGGC GCTCAGTAGT TCTCTTACTA AACAACCACT 360 

ACTCCAGAGA ATGGCAACGC TGATTACCAG TACTACAGCT GCTACCGCCG CTTCTGGTCC 420* 

TTTGGTGGAC TACCTATGGA TGCTCATCCT GGGCTTCATT ATTGCATTTG TCTTGGCATT 480 

20 CTCCGTGGGA GCCAATGATG TAGCATU^TTC TTTTGGTACA GCTGTGGGCT CAGGTGTAGT 540 

GACCCTGAA6 CAAGCCTGCA TCCTAGCTAG CATCTTTGAA ACAGTGG6CT CTGTCTTACT 600 

G6GGGCCAAA GTGAGCXiAAA CCATCCGGAA GGGCTTGATT 6ACGT6GAGA TGTACAACTC 660 

6ACTCAA6GG CTACTGATGG CCGGCTCAGT CAGTGCTATG TTTGGTTCTG CTGTGTGGCA 720 

ACTCGTGGCT TCGTTTTTGA A6CTCCCTAT TTCTGGAACC CATTGTATTG TTGGTGCAAC 780 

25 TATTGGTTTC TCCCTCGTGG CAAAGG66CA GGAG6GTGTC AA6TGGTCTG AACTGATAAA 840 

AATTGTGATG TCTTGGTTCG TGTCCCCACT GCTTTCTGGA ATTATGTCTG GAATTTTATT 900 

CTTCCT6GTT CGT6CATTCA TCCTCCATAA GGCAGATCCA GTTCCTAATG GTTTGCGAGC 960 

TTTQCCAGTT TTCTATGCCT GCACAGTTGG AATAAACCTC TTTTCCATCA TGTATACT6G 1020 

AGCACCGTTG CTGGGCTTTG ACAAACTTCC TCTGTGGGGT ACCATCCTCA TCTCGGTGGG 1080 

30 ATGTGCAGTT TTCTGTGCCC TTATCGTCTG GTTCTTTGTA TGTCCCAGGA TGAAGAGAAA 114 0 

AATTGAACGA GAAATAAAGT GTAGTCCTTC TGAAAGCCCC TTAATGGAAA AAAAGAATAG 1200 

CTTGAAAGAA GACCATGAAG AAACAAAGTT GTCTGTTGGT GATATTGAAA ACAAGCATCC 1260 

TGTTTCTGAG GTAGGGCCTG CCACTGTGCC CCTCCAGGCT GTGGTGGAGG AGAGAACAGT 1320 

CTCATTCAAA CTTGGAGATT TGGAGGAAGC TCCAGAGAGA GAGAGGCTTC CCAGCGTGGA 1380 

35 CTTGAAAGAG GAAACCAGCA TAGATAGCAC CGTGT^TGGT GCAGTGCAGT TGCCTAATGG 1440 

GAACCTTGTC CAGTTCAGTC AAGCCGTCAG C7ACC7UUVTA AACTCCAGTG GCCACTCCCA 1500 

GTATCACACC GTGCATAAGG ATTCCGGCCT GTACAAAGAG CTACTCCATA AATTACATCT 1560 

TGCCAAGGT6 GGAGATTGCA TGGGAGACTC CGGTGACAAA CCCTTAAGGC GCAATAATAG 1620 

CTATACTTCC TATACCATGG CAATATGTGG CATGCCTCTG GATTCATTCC GTGCCAAAGA 1680 

40 AGGTGAACAG AAGGGCGAAG AAATGGAGAA GCTGACATGG CCTAATGCAG ACTCCAAGAA 1740 

GCGAATTCGA ATGGACAGTT ACACCAGTTA CTGCAATGCT GTGTCTGACC TTCACTCAGC 1800 

ATCTGAGATA GACATGAGTG TCAAGGCA6C GATGGGTCTA GGTGACAGAA AAGGAAGTAA 1660 

TGGCTCTCTA GAAGAATGGT ATGACCAG6A TAAGCCTGAA GTCTCTCTCC TCTTCCAGTT 1920 

CCTGCAGATC CTTACAGCCT GCTTTG6GTC ATTCGCCCAT GGTGGCAATG AGGTAA6CAA 1980 

45 TGCCATTGGG CCTCTGGTTG CTTTATATTT GGTTTATGAC ACAGGAGATG TTTCTTCAAA 2040 

AGTGGCAACA CCAATATGGC TTCTACTCTA TGGT6GTGTT GGTATCTGTG TTG6TCTGT6 2100 

6GTTTGGGGA AGAAGAGTTA TCCAGACCAT GGGGAA6GAT CTGACACCGA TCACACCCTC 2160 

TAGTGGCTTC AGTATTGAAC TGGCATCTGC CCTCACTGTG GTGATTOCAT CAAATATTGG 2220 

CCTTCCCATC AGTACAACAC ATTGTAAAGT GG6CTCTGTT GTGTCTGTTG GCTG6CTCCG 2280 

50 GTCCAAGAAG GCTGTTGACT GGCGTCTCTT TCGTAACATT TTTATGGCCT GGTTTGTCAC 2340 

AGTCCCCATT TCTGGAGTTA TCAGTGCTGC CATCATGGCA ATCTTCAGAT ATGTCATCCT 2400 

CAGAATGTGA AGCTGTTTGA GATTAAAATT TGTGTCAATG TTTGGGACCA TCTTAGGTAT 2460 

TCCTGCTCCC CTGAAGAATG ATTACAGTGT TAACAGAAGA CTGACAAGAG TCTTTTTATT 2520 

TGGGAGCAGA GGAGGGAAGT GTTACTTGTG CTATAACTGC TTTTGTGCTA AATATGAATT 2580 

55 GTCTCAAAAT TAGCTGTGTA AAATAGCCCG GGTTCCACTG GCTCCTGCTG AGGTCCCCTT 2640 

TCCTTCTGGG CTGTGAATTC CTGTACATAT TTCTCTACTT TTTGTATCAG GCTTCAATTC 2700 

CATTATGTTT TAATGTTGTC TCTGAAGATG ACTTGTGATT TTTTTTTCTT TTTTTTAAAC 2760 

CATGAAGAGC CGTTTGACAG AGCATGCTCT GCGTTGTTGG TTTCACCAGC TTCTGCCCTC 2820 

ACATGCACAG GGATTTAACA ACAAAAATAT AACTACAACT TCCCTTGTAG TCTCTTATAT 2880 

60 AAGTAGAGTC CTTGGTACTC TGCCCTCCTG TCAGTAGTGG CAGGATCTAT TGGCATATTC 2940 

GGGAGCTTCT TAGAGG6ATG AGGTTCTTTG AACACAGTGA AAATTTAAAT TAGTAACTTT 3000 

TTTGCAAGCA GTTTATTGAC TGTTATTGCT AAGAAGAAGT AAGAAAGAAA AAGCCTGTTG 3060 

GCAATCTTGG TTATTTCTTT AAGATTTCTG 6CAGTGT6GG ATGGAT6AAT GAA6TGGAAT 3120 

GTGAACTTTG G6CAAGTTAA ATGGGACAGC CTTCCATGTT CATTTGTCTA CCTCTTAACT 3180 

65 GAATAAAAAA GCCTACAGTT TTTAGAA7UU\ ACCCXliAATTC 

Seq ID NO: 195 Protein aeouence: 
Protein Accession #: NP_005406.2 

70 1 11 21 31 41 51 

I I I I I I 

MATLITSTTA ATAASGPLVD YIiWMLlLGFI lAFVIiAFSVG ANDVANSFGT AVGSGWTIjK 60 

QACILASIFE TVGSVLLGAK VSETIRKGLI DVEMYNSTQG IiLMAGSVSAM FGSAVWQLVA 120 

SFLKLPISGT HCIVGATIGF SDVAKGQEGV KWSELIKIVM SWFVSPIiliSG IMSGILFPLV 180 

75 . RAFIIiHKADP VPNGLRAIiPV FYACTVGINL FSIMYTGAPIi LGFDiCLPIiWG TILISVGCAV 240 

FCAliIVWFFV CPRMKRKIER EIKCSPSBSP LMEKKNI^IiKE DHBETKLSVG DIBNKHPVSB 300 
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VGPATVPLQA 
QFSQAVSNQI 
YTMAICGMPL 
DMSVKAAMGL 
PLVALYLVYD 
SIELASAIiTV 
SGVISAAXMA 



WEERTVSFK 
NSSGHSQYHT 
DSFRAKEGEQ 
6DRKGSNGSL 
TGDVSSKVAT 
VIASNIGIjPI 
IPRYVILRM 



LGDLEEAPER 
VHKDSGLYKE 
KGEEMEKLTW 
EBWYDQDKPE 
PIWIiliLYGGV 
STTHCKVGSV 



ERLPSVDIjKE 
LLHKIiHLAKV 
PNADSKKRIR 
VSLLFQFLQI 
GICVGIiWVWG 
VSVGWIiRSKK 



ETSIDSTVNG 
GDCMGDSGDK 
MDSVTSYCNA 
LTACFGSFAH 
RRVIQTMGKD 
AVDWRLFRNI 



AVQLPNGNIjV 360 

PLRRNNSYTS 420 

VSDIiHSASEl 4 80 

GGNDVSNAI6 54 0 

LTPITPSSGF 600 

FMAWFVTVPI 660 



10 



15 



20 



25 



30 



35 



40 



45 



Seq ID NO: 196 DMA sequence 

Nucleic Acid Accession #: NM_000020.1 

Coding sequence: 283-1794 (underlined sequences correspond to start and stop codons) 



1 

I 

AGGAAACGGT 
AGAAACATTT 
GA6CGAGCCC 
CCAGCGCTGG 
AGGCTAGCGC 
AGGAAAGGCC 
TCTCGGGGCC 
CGGGGGGCCT 
CGGGGCTGCG 
CACTACTGCT 
CAACCTCCTT 
CTGGCCTTGC 
CAGGAGAAGC 
TCTGAGCAGG 
GGCTCAGGGC 
T6TGT6GGAA 
GCCGTCAAGA 
AACACAGTAT 
CGCAACTCGA 
GACTTTCTGC 
GCATGCGGCC 
GCCCACCGCG 
GCCGACCTGG 
AACCCGAGAG 
ACGGACTGCT 
GAGATTGCCC 
GATGTGGTGC 
CAGACCCCCA 
ATGATGCGGG 
T^GACACTAC 
AGCACCTGAT 
CTATCTGGGT 
TGCTCGGCCC 



11 
I 

TTATTAGGAG 
TTGCTCCAGC 
CTCCCCGGCT 
CGGTGCAACT 
CCCGCCACCC 
TTCTGATGCT 
CGCTGGTGAC 
GGTGCACAGT 
GGAACTTGCA 
GCGACAGCCA 
CGGAGCAGCC 
TGGCCCTGGT 
AGCGTGGCCT 
GCGACACGAT 
TCCCCTTCCT 
AAGGCCX3CTA 
TCTTCTCCTC 
TGCTCAGACA 
GCACGCAGCT 
AGAGACAGAC 
TGGCX3CACCT 
ACTTCAAGAG 
GCCTGGCTGT 
TGGGCACCAA 
TTGAGTCCTA 
GCCGGACCAT 
CC7UVTGACCC 
CCATCCCTAA 
AGTGCTGGTA 
AAAAAATTAG 
TCCTTTCTGC 
AGAGGTAGTG 
CCAGCCCACC 



21 
I 

GGAGTGGTGG 
CCCCATCCCA 
CCAGCCCGGT 
GCGGCCGCGC 
GCAGAGCGGG 
GCTGATGGCC 
CTGCACGTGT 
AGTGCTGGTG 
CAGGGAGCTC 
CCTCTGCAAC 
GGGAACAGAT 
GGCCCTGGGT 
GCACAGCGAG 
GTTGGGGGAC 
GGTGCAGAG6 
TGGCGAAGTG 
GAGGGATGAA 
CGACAACATC 
GTGGCTCATC 
GCTGGAGCCC 
GCACGTGGAG 
CCGCAATGTG 
GATGCACTCA 
GCGGTACATG 
CAAGTGGACT 
CGTGAATGGC 
CAGCTTTGAG 
CCGGCTGGCT 
CCCAAACCCC 
CAACAGTCCA 
CTGCAGGGGG 
TGAGTGTGGT 
CAGCCAAAAA 



31 

I 

AGCTGGGCCA 
GTCCCGGGAG 
CCGGGGCCGC 
GGTGGAGGGG 
GCCAGAGGGA 
TTGGTGACCC 
GAGAGCCCAC 
CGGGAGGAGG 
TGCAGGGGGC 
CACAACGTGT 
GGCCAGCTGG 
GTCCTGGGCC 
CTGGGAGAGT 
CTCCTGGACA 
ACAGTGGCAC 
TGGCG6G6CT 
CAGTCCTGGT 
CTAGGCTTCA 
ACGCACTACC 
CATCTGGCTC 
ATCTTCGGTA 
CTGGTCAAGA 
CAGGGCAGCG 
GCACCCGAGG 
GACATCTGGG 
ATCGTGGAGG 
GACATG7UVGA 
GCAGACCCGG 
TCTGCCCGAC 
GAGAAGCCTA 
CTGGGGGGGT 
GTGTGCTGGG 
TACAGCTGGG 



41 
I 

GGCAGGAAGA 
GCTGCCGCGC 
GCCGGACCCC 
AGGTGGCCCC 
C CATGA CCTT 
AGGGAGACCC 
ATTGCAAGGG 
GGAGGCACCC 
GCCCCACCGA 
CCCTGGTGCT 
CCCTGATCCT 
TGTGGCATGT 
CCAGTCTCAT 
GTGACTGCAC 
GGCAGGTTGC 
TGTGGCACGG 
TCCGGGAGAC 
TCGCCTCAGA 
ACGAGCAC6G 
TGAGGCTAGC 
CACA66GCAA 
GC7ACCTGCA 
ATTACCTGGA 
TGCTGGACGA 
CCTTTGGCCT 
ACTATAGACC 
AGGTGGTGTG 
TCCTCTCAGG 
TCACCGCGCT 
AAGTGATTCA 
GGGGGGCAGT 
GATGGGCAGC 
CTGAAACCTG 



51 
I 

CGCT6GAATA 
CAGCTGCGCC 
AGCCCGCCGT 
GGTCCGCCGA 
GGGCTCCCCC 
TGTGAAGCCG 
GCCTACCTGC 
CCAGGAACAT 
GTTCGTCAAC 
GGAGGCCACC 
GGGCCCCGTG 
CCGACGGAGG 
CCTGAAAGCA 
CACAGGGAGT 
CTTGGTGGAG 
TGAGAGTGTG 
TGAGATCTAT 
CATGACCTCC 
CTCCCTCTAC 
TGTGTCCGCG 
ACCAGCCATT 
GTGTTGCATC 
CATCGGCAAC 
GCAGATCCGC 
GGTGCTGTGG 
ACCCTTCTAT 
TGTGGATCAG 
CCTAGCTCAG 
GCGGATCAAG 
ATAGCCCAGG 
GGATGGTGCC 
TGCGCCTGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



50 



Seq ID NO: 197 Protein sequence: 
Protein Accession #: NP 000011.1 



55. 



60 



65 



70 



75 



MTLGSPRKGL 
RHPQEKRGCG 
LlLGPVIiAIili 
DCTTGSGSGI, 
RETBIYNTVL 
RIiAVSAACGL 
YIiDIGNNPRV 
YRPPFYDWP 
TAIiRIKKTIiQ 



11 
I 

IiMIiLMALVTQ 
NIiHREIiCRGR 
AIiVAIiGVIiGL 
PFLVQRTVAR 
liRHDNIIiGFI 
AHIiHVEIFGT 
GTKRYMAPEV 
NDPSFEDMKK 
KISNSPEKPK 



21 
I 

GDPVKPSRGP 
PTEFVNHYCC 
WHVRRRQEKQ 
QVALVECVGK 
ASDMTSRNSS 
QGKPAIAHRD 
LDEQIRTDCF 
WCVDQQTPT 
VIQ 



31 
I 

LVTCTCESPH 
DSHLCNHNVS 
RGLHSEIiGES 
GRYGEVWRGL 
TQLWIiITHYH 
FKSRNVLVKS 
ESYKWTDIWA 
IPNRIiTUmPV 



41 



I 

CKGPTCRGAW 
LVIiEATQPPS 
SIjXLKASEQG 
WHGESVAVKI 
EHGSLYDFIiQ 
NLQCCIADLG 
FGLVLWEIAR 
LSGIiAQMMRE 



51 
I 

CTWLVREEG 
SQPGTDGQIiA 
DTMLGDIiIiDS 
FSSRDEQSWF 
RQTLEPHIiAIj 
IiAVMHSQGSD 
RTIVNGIVED 
CWYPNPSARIi 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 198 DNA sequence 

Nucleic Acid Accession #: NM_003199.1 

Coding sequence: 200-2203 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CGGGGGGATC TTGGCT6TGT GTCTGCGGAT CT6TAGTGGC G6CGGCGGC6 GCGGCG6CGG 60 

GGAGGCA6CA GGCGCGGGAG C6GGC6CAGG AGCAGGCGGC G6CGGTGGC6 GCGGCGGTTA 120 

GACATGAACG CCGCCTCG6C GCCG6CGGTG CAC6GAGAGC CCCTTCTCGC GCGCGGGCGG 180 

TTTGT6TGAT TTTGCTAA AA TGC ATCACCA ACAGC6AATG GCTGCCTTAG GGACGGACAA 240 

AGAGCTGAGT GATTTACTGG ATTTCAGTGC GATGTTTTCA CCTCCTGTGA GCAGTGGGAA 300 

AAATGGACCA ACTTCTTTGG CAAGTGGACA TTTTACTGGC TCAAATGTAG AAGACAGAAG 360 

TAGCTCA6GG TCCTGG6GGA ATGGAGGACA TCCAAGCCCG TCCAGGAACT ATGGAGATGG 420 



272 



3NSDOCID. <WO 02079492A2_I_> 



wo 02/079492 



PCTAJS02/04915 



GACTCCCTAT GACCACATGA CCAGCAGGGA CCTTGGGTCA CATGACAATC TCTCTCCACC 480 

TTTTGTCAAT TCCAGAATAC AAAGTAAAAC AGAAAGGGGC TCATACTCAT CTTATGGGAG 540 

AGAATCAAAC TTACAGGGTT GCCACCAGCA GAGTCTCCTT GGAGGT6ACA TGQATATGG6 600 

CAACCCAG6A ACCCTTTCGC CCACCAAACC TGGTTCCCAG TACTATC7VGT ATTCTAGCAA 660 

5 TAATCCCCGA AGGAGGCCTC TTCACAGTAG TGCCATG6A6 GTACAGACM A6AAAGTTCG 720 

AAAAGTTCCT CCAGGTTTGC CATCTTCAGT CTATGCTCCA TCAGCAAGCA CTGCCGACTA 780 

CAATAGGGAC TCGCCAGGCT ATCCTTCCTC CAAACCAGCA ACCAGCACTT TCCCTAGCTC 840 

CTTCTTCATG CAAGATGGCC ATCACAGCAG TGACCCTTGG AGCTCCTCCA GTGGGATGAA 900 

TCAGCCTGGC TATGCAGGAA TGTTGGGCAA CTCTTCTCAT ATTCCACAGT CCAGCAGCTA 960 

10 CTGTAGCCTG CATCCACATG AACGTTTGAG CTATCCATCA CACTCCTCAG CAGACTVTCT^ 1020 

TTCCAGTCTT CCTCCGATGT CCACTTTCCA TCX3TAGTGGT ACAAACCATT ACAGCACCTC 1080 

TTCCTGTACG CCTCCTGCCA ACGGGACAGA CAGTATAATG GCAAATAGAG GAAGCGGGGC 1140 

AGCCGGCAGC TCCCAGACTG GAGATGCTCT GG6GAAAGCA CTTGCTTCGA TCTATTCTCC 1200 

AGATCACACT AACAACAGCT TTTCATCAAA CCCTTCAACT CCTGTTGGCT CTCCTCCATC 1260 

15 TCTCTCAGCA GGCACAGCTG TTTGGTCTAG AAATGGAGGA CAGGCCTCAT CGTCTCCTAA 1320 

TTATGAAGGA CCCTTACACT CTTTGCAAAG CCX3AATT6AA GATCGTTTAG AAAGACTGGA 1380 

TGATGCTATT CATGTTCTCC GGAACCATGC AGTGG6CCCA TCCACAGCTA TGCCTGGTGG 1440 

TCATGGGGAC ATGCATGGAA TCATTGGACC TTCTCATAAT GGAGCCAT6G GTGGTCTGGG 1500 

CTCAGGGTAT GGAACCGGCC TTCTTTCAGC CAACAGACAT TCACTCATGG TGGGGACCCA 1560 

20 TCGTGAAGAT GGCGTGGCCC TGAGAGGCAG CCATTCTCTT CTGCCAAACC AGGTTCCGGT 1620 

TCCACAGCTT CCTGTCCAGT CTGCGACTTC CCCTGACCTG AACCCACCCC AGGACCCTTA 1680 

CAGA6GCAT6 CCACCAGGAC TACAGGGGCA 6A6TGTCTCC TCTGGCAGCT CTGAGATCAA 1740 

ATCCGATGAC GAGG6TGATG AGAACCTGCA AGACACX3AAA TCTTCGGAG6 ACAAGAAATT 1800 

AGATGACGAC AAGAA66ATA TCAAATCAAT TACTAGCAAT AATGACGATG AG6ACCTGAC 1860 

25 ACCAGAGCAG AAGGCAGAGC 6TGAGAAGGA GCGGAGGATG GCCAACAAT6 CCCGAGAGCX3 1920 

TCTGCGGGTC COTGACATCA ACGAGGCTTT CAAAGAGCTC GGCCGCATGG TGCAGCTCCA 1980 

CCTCAAGA6T GACAAGCCCC AGACCAAGCT CCTGATCCTC CACCAGGCGG TGGCCGTCAT 2040 

CCTCAGTCTG 6AGCAGCAAG TCCGAGAAAG GAATCTGAAT CC6AAAGCTG CGTGTCTGAA 2100 

AAGAAGGGAG GAAGAGAAGG TGTCCTCGGA GCCTCCCCCT CTCTCCTTGG CCGGCCCACA 2160 

30 CCCTGGAATG G6AGACGCAT CGAATCACAT GGGACAGATG l^AAAGGGTC CAAGTTGCCA 2220 

CATTGCTTCA TTAAAACAAG AGACCACTTC CTTAACAGCT GTATTATCTT AAACCCACAT 2280 

AAACACTTCT CCTTAACCCC CATTTTTGTA ATATAAGACA AGTCTGAGTA GTTATGAATC 2340 

GCAGACGCAA GAGGTTTCAG CATTCCCAAT TATCAAAAAA CAGAAAAACA AT^AAAAAGAA 2400 

AGAAAAAAGT GCAACTTGAG GGACGACTTT CTTTAACATA TCATTCAGAA TGTGCAAAGC 2460 

35 AGTATGTACA GGCTGAGACA CAGCCCAGAG ACTGAACGGC 

Seq ID NO: 199 Protein sequence: 
Protein Accession #: NP^003190.1 

40 1 11 21 31 41 5X 

.1 I I I I i 

MHHQQRMAAIa 6TDKELSDLL DFSAMFSPPV SS6KNGPTSL ASGHFT6SNV EDRSSS6SWG 60 

N6GKPSPSRN YGDGTPYDHM TSRDLGSHDN LSPPFVNSRI QSKTBRGSYS SYQRBSmjQG 120 

CHQQSIiLGGD MDMGNPGTLS PTKPGSQYYQ YSSNNPKRRP IiHSSAMEVQT KKVRiCVPPGIi 180 

45 PSSVYAPSAS TADYNRDSPG YPSSKPATST FPSSFFMQDQ HHSSDPWSSS SGMNQPGYAG 240 

ML6NSSHIPQ SSSYCSXiHPH ERIiSYPSHSS ADINSSZiPPM STFHRSGTNH YSTSSCTPPA 300 

N6TDSINANR GS6AAGSSQT GDALGKALAS lYSPDHTNNS FSSNPSTPVG SPPSLSAGTA 360 

VWSRK6GQAS SSPNYEGPLH SIjQSRIEDRL ERLDDAIHVI. RNHAVGPSTA MPGGHGDMHG 420 

IZGPSHNGAM GGIjGSGYGTG UjSANRHSLM VGTHREDGVA IiRGSHSIjXjPN QVPVPQIjPVQ 480 

50 SATSPDliNPP QDPYRGMPPG LQGQSVSSGS SEIKSDDEGD ENLQDTKSSE DKKLDDDKKD 540 

IKSIT5NKDD EDLTPEQKAE RCKERRMANN ARERLRVROI NBAFKELGRM VQLHLKSDKP 600 

QTiCLLIZiHQA VAVILSLEQQ VRBRNUTPKA ACLKRRBBEK VSSEPPPIiSIt AGPHPGMGDA 660 
SNHMGQM 

55 Seq ID NO: 200 DNA sequence 

Nucleic Acid Accession #: BC005987 (1-1286), BE888744 (1287-1756) 

Coding sequence: 124-525 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

60 I I I I 1 1 

GGCAGAAGAG GAAGATTTCT GAAGAGTGCA GCTGCCTGAA CCGAGCCCTG CCGAACAGCT 60 

GAGAATTGCA CTGCAACCAT_GAGTGAGAAC AATAAGAATT CCTTGGAGAG CAGCCTACGG 120 

CAACTAAAAT GCCATTTCAC CTGGAACTTG ATGGAGGGAG AAAACTCCTT GGAT<3ATTTT 180 

GAAGACAAAG TATTTTACCG GACTGAGTTT CAGAATCX5TG AATTCAAAGC CACAAT6TGC 240 

65 AACCTACTGG CCTATCTAAA GCACCTCAAA G6GCAAAAC6 AGGCA6CCCT GGAATGCTTA 300 

CGTAAAGCTG AA6AGTTAAT CCAGCAA6A6 CATGCT6ACC AGGCA(3AAAT CAGAAGTCTG 360 

GTCACCTGGG GAAACTAT6C CTG(3GTCTAC TATCACAKSG GCC6ACTCTC AGACGTTCAG 420 

ATTTATGTAG ACAA<3GT6AA ACATGTCTGT 6AGAAGTTTT CCIAGTCCCTA TAGAATTGA6 460 

AGTCCAGAGC TT6ACTGTGA 6GAAG6GTGG ACACGGTTAA AGT6T(3GARG AAACCAAAAT 540 

70 GAAAGAGCX3A AGGT6TGCTT T6AGAAGGCT CTGGAAAAGA AGCCAAA6AA CCCA(3AATTC 600 

ACCrrCTGGAC TGGCAATAGC AAGCTTACCGT CTTGGACAACT 6GCCACCATC TCAGAACGCC 660 

ATTGACCCTC TGAG6CAAGC CATTCG6CTG AATCCTGACA ACCaiGTACCT TAAAGTCCTTC 720 

CTGGCTCTGA AGCTTCy^TAA GATGCGTGAA GAAGGTGAAG AGGAAGGTGA AGGAGAGAAG 780 

TTAGTTGAAG AAGCCTTGGA GAAAGCCCCA GGTGTAACAG ATCTACTTTCG CAGTGCAGCC 840 

75 AAGTTTTATC GAAGAAAAGA T(3AGCCAGAC AAAGCGATTG AACTGCTTAA AAAGGCTTTA 900 

GAATACATAC CAAACAATGC CTACCTGCAT TGCCAAATTO GGTGCTGCTA TAGGGCAAAA 960 



273 



BWSOOCID: <WO 02079492A2_L> 



wo 02/079492 



PCT/US02/04915 



10 



15 



GTCTTCCAAG 
CTAATAGGAC 
CGTGTCTGTT 
TATTACTTCC 
CTGCGGTATG 
TTTATAGAGG 
CTGCAAAAAA 
GTCTTGGCAT 
AGGGGTTTGG 
ATAGAGATGT 
GGTATTCAAA 
CACTGGACAG 
GGGAGAGGGA 
CTCTTTGCGG 



TAATGAATCT 
ACGCTGTGGC 
CCATTCTTGC 
AAAAGGAATT 
GCAACTTTCA 
GT6TAA71AAT 
TTGCCAAAAT 
TCCTTCAGGA 
AGTCTGGAAG 
GGTGCCCACT 
ATATGTAATG 
GGTTATGTTA 
CAGATTGGGG 
AACTTC - 



AA6A6AGAAT 
TCATCTGAA6 
CAGCCTCCAT 
CAGTAAAGAG 
GCT6TACCAA 
AAACCAGT^ 
GCGACTTTCT 
GCTGAATGiVA 
CCTCATCCCT 
AGGCTACTGC 
ACTGGTATGG 
AACCTGAATT 
GGTCGTCCAG 



GGAATGTATG 
AAAGCTGATG 
GCTCTAGCA6 
CTTACTCCTG 
AT6AAGT6TG 
TCAA6GGAGA 
AAAAATGGAG 
AAAATGCAAC 
TCAGCATCAA 
TGAAAG66AG 
CAAAAGATT6 
GCTGGGTCTT 
GGCTGCGCTA 



6GAAAAGAAA 
AGGCCAATGA 
ATCAGTATGA 
TAGCGAAACA 
AAGACAAGGC 
AAGAAAAGAT 
CAGATTCTGA 
AAGCAGAT6A 
GCTGGAATGG 
CTGAAATTCC 
6ACTAA6ACA 
A7UVAGAGCCC 
AATTATTCTC 



GTTACTGGAA 
TAATCTCTTC 
AGAAGCAGAG 
ACTGCTCCAT 
CATCCACCAC 
GAAAGACATkA 
GGCTTTGCAT 
AGACTCTGAG 
GGA ATGAA GA 
TCCAC7AGTT 
CTGGCCATAC 
AAGGAGTTCT 
AATGATTTGT 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 201 Protein sequence: 
Protein Accession AAA59191 



20 



25 



30 



35 



40 



45 



50 



55 



1 
I 

MSENNKNSLE 
KHLKGQNEAA 
KHVCEKFSSP 
ASYRLDNWPP 
EKAPGVTDVl* 
IiRENGMYGKR 
FSKELTPVAK 
MRLSKNGADS 



11 
I 

SSLRQLKCHF 
XiECLRKAEEIi 
YRIESPEIiDC 
SQHAIDPLRQ 
RSAAKPYRRK 
KLLBIiIGHAV 
QZiLHLRYGNF 
EALHVLAFLQ 



21 
I 

TWNLMEGENS 
IQQEHADQAE 
EEGWTRIiKCG 
AIRLNPDHQY 
DEPDKAIELL 
AHLKKADEAN 
QLYQMKCEDK 
ELNEKMQQAD 



31 
I 

LDDFEDKVFY 
IRSLVTWGNY 
GNQNBRAKVC 
IiKVLLALKia 
KKALBYIPNN 
DNLFRVCSIL 
AIHHFIEGVK 
EDSERGLESG 



41 

I 

RTEFQNREFK 
AWVYYHMGRL 
PEKAIiEKKPK 
KMREEGEEEG 
AYIiHCQIGCC 
ASIjHAIjADQY 
INQKSREKEK 
SIiZPSASSWN 



51 
I 

ATMCNIiLAYL 
SDVQIYVDKV 
NPEFTSGIjAI 
EGEKLVEEAL 
YRAKVFQVMN 
EDAEYYFQKE 
MKDKLQKZAK 
GE 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 202 DNA sequence 
Nucleic Acid Accession NM_003090 

Coding sequence: 57-824 (underlined sequences correspond to start and stop codons) 



1 
I 

GAATTCCGCG 
TCAAGCTGAC 
GGGAGCTGGA 
ACCAGTTTGA 
TGTTGAGAAG 
GACTTGATCA 
AACTGGGTGA 
GAAATCCGGT 
TCAGAGTACT 
TCAAGGGCAA 
ATCCAGGTGC 
AAGCAATCAA 
GGTT6CTGCA 
GTGAAGAAGA 
ATAATAATAG 
TGTGTTAGCA 
TTTGTAATAT 
CACTCTCTAT 



11 
I 

GGAGGCCACG 
GGCGGAGCTG 
CCTCCGGGGG 
TGCTATTGAT 
ACTGAA7UVCA 
GGCTCTGCCC 
TCTGGACCCT 
AACCAATAAG 
GGATTTCCAG 
ACGGGGTGCA 
TGGTTTGCCA 
GAATGCCATA 



GATGGAA6AA 
GCCCTCTTGG 
AAGTGGAATC 
AAGTTTTGAA 
GCTAAAAAAA 



21 
I 

GGCTTTCCAC 
ATCGAGCAGG 
TATAAAATTC 
TTTTCTGACA 
TTGTTAGTGA 
TGTCTGACAG 
CTGGCATCTC 
AAGCATTACA 
AAAGTGAAAC 
CAGCTTGCAA 
ACTGACAAAA 
GCAAATGCTT 
ATCCCTGGCA 
GACACAGTCA 
AACAAGTCTT 
TATCAGCATT 
ATCTAAATGT 
AAAAAAAGGA 



31 
I 

AGCGCGGGGG 
CGGCGCAGTA 
CCGTCATTGA 
ATGAGATCAG 
ACAACAACAG 
AACTCATTCT 
TCAAATCGCT 
GATTGTATGT 
TAAAAGAGCG 
AGGATATTGC 
AGAGAGGTGG 
CAACTCTGGC 
QAGAAOSCAG 
CAAACG6GTC 
GCTTTTCGAA 
GTTGAAATGC 
CAATTTTCTA 
ATTC 



41 
I 

AACGGGAGGC 
CACCAACGCG 
AAATCTAGGT 
GAAACTGGAT 
AATATGCCGT 
CACCAATAAT 
GACTTACCTA 
GATTTATAAA 
TCAGGAAGCA 
CAGGAGAAGC 
GCCATCTCCA 
TGAAGT6GAG 
ATCAGGGCCC 
CTGAG CAGTG 
CATGGTATAA 
TTAAGACT6C 
CTUy^TTATAA 



51 
I 

TGCAGG ATG G 
GTGCGCGACC 
GCTACGTTAG 
GGTTTTCCTT 
ATAGGTGAGG 
AGTCTCGTGG 
AGTATCCTAA 
GTTCCGCAAG 
GAGAAAATGT 
AAAACTTTTA 
GGGGATGTA6 
AGGCTGAAGG 
ACTGATGATG 
A6GCAGATGT 
TAGCCTTGTT 
T6CTGATAAT 
AAATAAACTC 



Seq ID NO: 203 Protein sequence: 
Protein Accession #: NP 003081.1 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 



1 11 21 31 41 51 

60. I I I I I I 

MVKLTAELIE QAAQYTNAVR DRELDLRGYK IPVIENLGAT IiDQFDAIDFS DNEIRKLDGP 60 

PLLRRLKTLl, VNNNRICRIG EGLDQALPCL TELILTNNSL VEIiGDLDPLA SLKSLTYLSI 120 

URNPVTNKKH YRLYVIYICVP QVRVLDFQKV KLKERQEAEK MFKGKRGAQL AKDIARRSKT 180 

FNPGA6LPTD KKRGGPSPGD VBAIKNAIAN ASTIAEVERL KGLLQSGQIP GRERRSGPTD 240 
65 DGEEBMEBDT VTNGS 



70 



Seq ID NO: 204 DNA sequence 

Nucleic Acid Accession #: NM_017643.1 

Coding sequence: 169-1401 (underlined sequences correspond to start and stop codons) 



11 



21 



31 



41 



51 



AATAGCAATA GCTTTATAGC AGCTCCGGTT ACCTGTTTTA AACATGGAAG GAGAGTCGCT 60 

CCCAGATAGC CCTCACGAGT GGCCCTGGAG CAGGGAGTGG TC3GAGCAGAT CTTCCTTGTT 120 

75 TG{3GAGGAGC CT<3AGGTGGA CCTCGCGTCC TCAGTCTCGA AGGCACC TAT GG GGAPrrnr 180 

TGC;G6TGATA TCTCAGAAAA TGTGA6A6TA GAAGTTCCCA ATACAGACTG CAGCCTACCT 240 



274 



3NSDOCID: <WO 02079492A2J_> 



wo 02/079492 



PCT/US02/04915 



ACCAAAGTCT TCTGGATTGC TGGAATTGTA AAATTAGCAG GTTACAATGC CCTTTTAAGA 300 

TATGAAGGAT TTGAAAATGA CTCTGGTCTG GACTTCTGGT GCMTATATG TGGTTCTGAT 360 

ATCCATCCAG TTGGTTGGTG TGCAGCCAGC GGAAAACCTC TTGTTCCTCC TAGAACTATT 420 

CAGCATAAAT ATACAAACTG 6AAAGCTTTT CTAGTGAAAC GACTTACTGG TGCCAAAACA 480 

S CTGCCTCCTG ATTTCTCCCA AAAGGTTTCA GAGA6TAT6C AGTATCCTTT CAAACCTT6C 540 

ATGA6AGTAG AAGT66TT6A CAAGAGGCAT TTGTGTG6AA CACGAGTAGC AGTG6TGGAA 600 

AGTGTAATTG GA6GAAGATT AA6ACTAGTG TATGAA6AAA GCGAAGATAG AACAGATGAC 660 

TTCTGGTGCC ATATGCACAG CCCATTAATA CATCATATTG GTTGGTCTCG AAGCATAGGT 720 

CATC6ATTCA AAAGATCTGA TATTACAAAG AAACAG6ATG GACATTTTGA TACACCACCA 780 

10 CATTTATTTG CTAAGGTAAA AGAAGTAGAC CA6AGT66GG AAT6GTTCAA GGAAGGAATG 640 

AAATTGGAAG CTATAGACCC ATTAAATCTT TCTACAATAT GTGTCGCAAC CATTAGAAAG 900 

GTGCTAGCTG ACGGATTCCT GAT6ATTGGG ATCGATGGCT CAGAAGCAGC AGACGGATCT 960 

GACTGGTTCT GTTACCATGC AACCTCTCCT TCTATTTTCC CTGTCGGTTT CTGTGAAATT 1020 

AACATGATTG AACTTACTCC ACCCAGA6GT TACACAAAAC TTCCTTTTAA ATGGTTTGAC 1080 

15 TACCTCAGGG AAACTGGCTC CATTGCAGCA CCAGTAAAAC TATTTAATAA GGATGTTCCA 1140 

AATCACGGAT TTCGTGTAGG AATGA7UVTTA GAAGCAGTAG ATCTCATGGA GCCACGTTTA 1200 

ATATGTGTAG CCACAGTAAC TCGAATTATT CATCGTCTCT TGAGGATACA TTTTGATGGA 1260 

TGGGAAGAAG AGTATGATCA GTGGGTAGAC TGTGAGTCAC CTGACCTCTA TCCTGTAGGG 1320 

TGGTGTCAGT TAACTGGATA TCAACTACAG CCTCCAGCAT CACAGTGTAA GTTGGTATAC 1380 

20 AGAAAAGGTG TCCTTTTG TA AA AATCAGCA ATTCTCCAGA GGACTATCTC ACATTUi^GTCA 1440 

TCTTATGAGC TCACAGGACA AGAATATACC TATGTCTGAT TGGTTGCCAG GTAAGACATT 1500 

AAGACTCAAC AACAATATCA CAGAATCAGA CCATGTGTCC CATGGCAATG TGAATCCAAT 1560 

AGTCAATTAC ATAATGACTA TAGAAACACA ACAGTCACCA AATTAAACTA GACTTACTAT 1620 

TTTAGTGAGT TAAAAATTAC ATACTAAAAG TTTATTGGTA GGTAATAAAT GCTTTTGAGT 1680 

25 AAATA6TGGA AAATGTCTCA TGTTGAGGCT ATGGTTTTGT A60AACAAGT ACCCTTATTT 1740 

TCAGAGCATC ATGTACTTAA GTATAATGGT CTTGGTAAA6 ATAGTTCATA TAAGTTGTAT 1800 

CTAGACAACT 6TATC6TCTA AATTGTAAAC AATTATCTAG TACCAATTTT CCCTTTTTAT 1860 

TTTTCAGCAT CAAGAGAAAA CCAATCAGCT TCATCAAAAC A6AAGAAAAA GGCTAA6TCC 1920 

CAGCAATACA AAGGACATAA 6AAAAGTGGG TCACCACGTG GTGTTCACAT ACATTTTCTA 1980 

30 ATTGTTAACT AATTGGAGTC ACAGTATTCT TGGACAGAAA ATGATATATC TT6TGAGAAC 2040 

TGATGATTGT GCATTATGTA TTATGCTTAA AGGTGCAGTA TGCCATAAAA GGCAAACCCT 2100 

TGCAATAATG AGAAACACTG ATATTTTACT AACAGGAGAA ATGATTACCA CAGTATTTAA 2160 

AGTATACXSTO GTAAA6AATA GAOTCTGTGA AT6ATTCTTG AAATAATATG TAAAACCTAC 2220 

T6AAA6TTAA TCCTTTTTAA AAACTTTATT TAAAAAGAAA AATTAGCAGC CAGGTGCAGT 2280 

35 GGCTCACGCC TGTAATCCCA GCACTTTAGG AGGCCGAGGC T6GCAGATCA CAAGGTCAGG 2340 

AGATCGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CCACCAAAAA TACAAAAAAT 2400 

CTGCCGGGCG TGGTGGCACA CGCCTGAAGT CCCAGCTACT CAGGAGGCTG AGGCAAGAGA 2460 

ATCACTTGAA CCCAGGAGGC AGAGGTTGCA GTGGGCCAAG ATCACGCCAC TACATTCCAG 2520 
CTGGGCAACA CAGCAAGACT CTGTCTCAAA AAAAAAAAAA AAAA 
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Seq ID NO: 205 Protein sequence t 
Protein Accession #: NP 060113.1 



1 11 '21 31 41 51 

45 I I I I I I 

M6TCWGDISE NVRVBVPNTD CSIiPTICVFWI AGZVKIiAGYN ALLRYEGFEN DSGIiDFWCNI 60 

CGSDXHPVGW CAASGKPIiVP PRTIQHKYTN WKAFLVKRXiT GAKTIjPPDFS QKVSESMQYP 120 

PKPCMRVEW DKRHIiCRTRV AWESVIGGR LRLVYEESED RTDDFWCHMH SPIiIHHIGWS 180 

RSXGHRFKRS DZTKKQDGHF DTPPHItFAKV KEVDQSGEWF KBGMKZjEAID PI1NI.STXCVA 240 

50 TIRKVLAD6F LMIGIDGSEA ADGSDWFCYH ATSPSIFPV6 FCEINMIE1*T PPRGYTKXiPF 300 

KWFDYI.RETG SIAAPVKLFN KDVPNH6FRV GMKZiBAVDIiM EPRLICVATV TRZXHRXiLRI 360 
HFXJGWBEEYD QWVDCBSPDIj YPVGWCQLT6 YQLQPPASQC KLVYRKGVIA 



55 

Seq ID NO: 206 PNA aecmence 
Nucleic Acid Accession #: NM_012334 

Coding sequence: 223-6399 (underlined sequences correspond to start and stop codons) 

60 1 11 21 31 41 51 

I I I I I I 

GAGACAAAGG CTGCCGTCGG GACG6GCGAG TTAGGGACTT GGGTTTGGGC GAACAAT^GG 60 

TGAGAAGGAC AAGAAGGQAC CGGGCGATGG CAGCAGGGGA GCCCCGCGGG CGCGCGTCCT 120 

CGGGAGTGGC GCCGTGACAC GCATGGTTTC CCCGGACCCG CGGCGGCGCT GACTTCCGCG 180 

65 AGTCGGAGC6 GCACTCGGCG AGTCCGGGAC TGCGCTGGAA CAATCGATAA CTTCTTCACC 240 

GAGGGAACAC GG6TCTGGCT GAGAG/U^AAT GGCCAGCATT TTCCAAGTAC T6TAAATTCC 300 

TGTGCAGAAG GCATCGTCGT CTTCCGGACA GACTATGGTC A6GTATTCAC TTACAAGCA6 360 

AGCACAATTA CCCACCAGAA GGTGACTGCT ATGCACCCCA 06AACGAGGA GGGCGTG6AT 420 

6ACATGGCGT CCTTGACAGA GCTCCAT6GC GGCTCCATCA TGTATAACTT ATTCCAGGGG 480 

70 TATAA6AGAA ATCAAATATA TACCTACATC GGCTCCATCC TGGCCTCCGT GAACCCCTAC 540 

CAGCCCATC6 CCGGGCTGTA CGAGCCT6CC ACCATGGAGC AGTACAGCCG 6C6CCACCTG 600 

6GCGAGCTGC CCCCGCACAT CTTCGCCATC 6CCAACGAGT GCTACCGCTG CCTGTGGAAG 660 

CGCTACGACA ACCA6TGCAT CCTCATCAGT 6GTGA7UVGT6 GGGCAGGTAA AACCGAAA6C 720 

ACTAAATTGA TCCTCAAGTT TCTGTCAGTC ATCAGTCAAC AGTCTTTGGA ATTGTCCTTA 780 

75 AAGGA6AAGA CATCCTGTGT TGAACGAGCT ATTCTTGAAA GCAGCCCCAT CATG GAAGCT 840 

TTCGQCAATG CGAA6ACCGT GTACAACAAC AACTCTAGTC GCTTTGGGAA GTTTGTTCAG 900 
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CTGAACATCT GTCAGAAAGG AAATATTCAG 
3UWAACCGAG TAGTAAGGCA AAATCCCGGG 
CTGGCAGGGC TGGAACATGA AGAAAGAGAA 
CACTACTTGA ATCAGTCTGG ATGTGTAGAA 
5 AGGGAAGTTA TTACGGCAAT GGACGTGATG 
TCGAGGCTGC TTGCTGGTAT ACTGCATCTT 
GCACAGGTTT CCTTCAAAAC AGCTTTGGGC 
ACACAGCTCA CA6ATGCTTT GACCCAGAGA 
ACGCCTCTCA ATGTTCAACA GGCAGTAGAC 

10 GCGTGCTGCT TTGAGTGGGT AATCAAGAAG 
TTCAAGTCTA TTGGCATCCT CGACATCTTT 
GAACAGTTCA ATATAAACTA TGCAAACOAG 
TTTTCTTTAG AACAACTAGA ATATAGCCGG 
ATAGACAATG GAGAATGCCT GGACTTGATT 

15 AATGAAGAAA GCCATTTTCC TCAAGCCACA 
CAGCATGCGA ATAACCACTT TTATGTGAAG 
AAGCACTATG CTGGAGAGGT GCAATATGAT 
ACATTTCGAG ATGACCTTCT CAATTTGCTA 
CTTTTTGAAC ATGTTTCAAG CCGCAACAAC 

20 CGGCGGCCTA CAGTCAGCTC ACAGTTCAAG 
AGCTCCTCTA ATCCTTTCTT TGTTCGCTGT 
CAGTTTGACC AGGCGGTTGT GCTGAACCAG 
AGAATCCGCA AAGCTGGGTA TGCGGTCCGA 
AAAGTGCTGA TGAGGAATCT GGCTCTGCCT 

25 CTGCAGCTCT ATGATGCCTC CAACAGCGAG 
CGAGAATCCT TGGAACAGAA ACTGGAGAAG 
ATGGTGATTC 6GGCCCATGT . CTTGGGCTTC 
TATTGTGTG6 TGATAATACA GAAGAATTAC 
CACCT6AAAA AGGCAGCCAT AGTTTTCCAG 

30 GTTTACAGAC AATTGCTGGC AGA6AAAAG6 
6AAGAAAAGA AGAAAC6GGA GGAAGAA6AA 
GAGCTCCGCG CCCAGCAGGA AGAAGAAACG 
AAGAGCCAGA AGGAAGCTGA ACTGACCCGT 
GTGGAAGAGA TCCTCC6TCT GGAGAAAGAA 

35 CAGGAGCTGT CGCTGACCGA GGCTTCCCTG 
CTCCGCAGGC TGGAGGAGGA AGCGTGCAGG 
TTCGACGAGA TCGACGAGTG TGTCCGGAAT 
TTTTCCAGCG AGCTGGCTGA GAGCGCATGC 
CCCTACCCAG AGGAGGAGGT CGATGAGGGC 

40 TCCCCCAACC CCAGCGAGCA CGGCCACTCA 
GATGACTCTT CAGAGGAGGA CCCATACATG 
GCGGACAGCA CGGTGCTGCT CGCCCCATCA 
TCCAGCGGCG AGTCCACCTA CTGCATGCCC 
GGCGACTACG ACTACGACCA GGATGACTAT 

45 GTGACCTTCT CCAACTCCTA CGGCAGCCAG 
ACXTTACAACA GCTCGGGTGC CTACCGGTTC 
GATAGTQAAG AGGACTTTGA TTCCAGGTTT 
GACTCTGTGT ACAGCTGTGT CACTCTGCCG 
6GCCTGATGA ACTCTTGGAA ACGCC6CTG6 

50 TTCCX3CTCCA AGCAGGAGGC CCTCAA6CAA 
TCCACGCT6T CCAGGAGAAA TTGGAAGAAG 
ATGTACTTT6 AAAACGACAG . CGAGGAGAAG 
AAA6AGATCA TAGATAACAC CACCT^GAG 
ACTTTCCACC TGATTGCAGA GTCCCCAGAA 

55 CAGGTCCACG CGTCCACGGA CCAGGAGATC 
CAGAATGCTG TGGGCACCTT GGATGTGGGG 
CCTGATAGAC CCAACTCGTT TGT6ATCATC 
GACACGCCGG AGGAGATGCA CCACTGGATA 
AGAGTGGAGG GCCAGGAATT CATCGTGAGA 

60 CCGAAGATGT CTTCACTGAA ACTGAAGAAA 
GATTACTACA AGAGTTCAGA GAAGAACGCG 
CTCTGCTCTG TCGTCCCCCC AGATGAGAAG 
ACCGTGTACG GGCGCAAGCA CTGTTACCGG 
. CGGTGGTCCA GTGCCATTCA AAACGTGACT 

65 CAGCAGCTGA TTCAAGATAT CAAGGAGAAC 
TACAAGCGGA ACCCGATCCT TCGATACACC 
CTTCCGTATG GGGACATATU^ TCTCAACTTG 
6ATGAGGCCA TCAAGATATT CAATTCCCTG 
CCAATAATCC AGGGCATCCT ACAGACA6GG 

70 TACTGCCAGC TTATCAAACA GACCAACAAA 
TACAGCTGGC AGATCCTGAC ATGCCTGAGC 
AAGTATCTCA AGTTCCATCT 6AAAA6GATA 
AAATACGCTC TCTTCACTTA CGAATCTCTT 
TCCCGAGAT6 AAATAGAAGC TCTGATCCAC 

75 CATG6CGGC6 GCTCCT6CAA GATCACCATC 
GAGAA6CTGA TCCGAG6CCT G6CCATGGAG 



GGCGGGAGAA TTGTAGATTA TTTATTAGAA 960 

GAAAGGAATT ATCACATATT TTATGCACTG 1020 

GAATTTTATT TATCTACGCC AGAA3\ACTAC X080 

GACAAGACAA TCAGTGACCA GGAATCCTTT 1140 

CAGTTCAGCA AGGAGGAA6T TCGGGAAGTG 1200 

GGGAACATAG AATTTATCAC TGCTGGTGGG 1260 

AGATCTGCGG AGTTACTTGG GCTGGACCCA 1320 

TCAATGTTCC TCAGGG6AGA AGAGATCCTC 1380 

AGCAGG6ACT CCCTGGCCAT GGCTCTGTAT 1440 

ATCAACAGCA GGATCAAAGG CAATGAGGAC 1500 

GGATTTGAAA ACTTTGAGGT TAATCACTTT 1560 

AAACTTCA6G AGTACTTCAA CAA6CATATT 1620 

GAAGGATTAG TGTGGGAAGA TATT6ACTGG 1680 

GAGAAGAAAC TTGGCCTCCT AGCCCTTATC 1740 

GACAGCACCT TATTGGAGAA GCTACACAGT 1800 

CCCAGAGTTG CAGTTAACAA TTTTGGAGTG 1860 

GTCCGAGGTA TCTTGGAGAA GAACAGAGAT 1920 

AGAGAAAGCC GATTTGACTT TATCTACGAT 1980 

CAGGATACCT TGAAATGTGG AAGCAAACAT 204 0 

GACTCACTGC ATTCCTTAAT GGCAACGCTA 2100 

ATCAAGCCAA ACATGCAGAA GATGCCAGAC 2160 

CTGCGGTACT CAGGGATGCT GGAGACTGTG 2220 

AGACCCTTTC AGGACTTTTA CAAAAGGTAT 2280 

GAGGACGTCC GAGGGAAGTG CACGAGCCTG 234 0 

TGGCAGCTGG GGAAGACCAA GGTCTTTCTT 24 00 

CGGAGGGAAG AGGAAGTGAG CCACGCGGCC 2460 

TTAGCACGAA AACAATACAG AAAGGTCCTT 2520 

AGAGCATTCC TTCTGAGGAG GAGATTTTTG 2580 

AAGCAACTCA GAGGTCAGAT T6CTCGGAGA 2640 

6AGCAAGAAG AAAAGAAGAA ACA6GAA6A6 2700 

AGAGAAAGAG AGAGAGAGCX3 AAGAGAA6CC 2760 

AGGAAGCAGC AAGAACTC6A AGCCTT6CAG 2820 

GAACTGGA6A AACAGAA6GA AAATAAGCAG 2880 

ATCGAGGACC TGCAGCGCAT 6AAGGAGCAG 2940 

CAGAAGCTGC AGGA6CGGCG GGACCAGGAG 3000 

GCGGCCCAGG AGTTCCTCGA GTCCCTCAAT 3060 

ATCGAGCGGT CCCTGTCGGT GGGAAGCGAA 3120 

GAGGAGAAGC CCAACTTCAA CTTCAGCCAG 3180 

TTCGAAGCCG ACGACGACGC CTTCAAGGAC 324 0 

GACCAGCGAA CAAGTGGCAT CCGGACCAGC 3300 

AACGACACGG TGGTGCCCAC CAGCCCCAGT 3360 

GTGCAGGACT CCGGGAGCCT ACACAACTCC 3420 

CAGAACGCTG GGGACTTGCC CTCCCCAGAC 34 80 

GAGGACGGTG CCATCACTTC CGGCAGCAGC 354 0 

TGGTCCCCCG ACTACCGCTG CTCTGTGGGG 3600 

AGCTCTGAGG GGGCGCAGTC CTCGTTTGAA 3660 

GATACAGATG ATGAGCTTTC ATACCGGCGT 3720 

TATTTCCACA GCTTTCTGTA CATGAAAGGT 3780 

TGCGTCCTCA AG6AT6AAAC CTTCTTGTGG 3840 

GGCTGGCTCC ACAAAAAA66 GGGGGGCTCC 3900 

CGCTGGTTTG TCCTCCGCCA GTCCAAGCTG 3960 

CTCAAGGGCA CCGTrAGAAGT GCGAACGGCA 4020 

AATGGGATCG ACATCATTAT GGCC6ATAG6 4080 

GATGCCAGCC AGTGGTTCAG CGTGCTGAGT 4140 

CAGGAGAT6C ATGATGAGCA GGCAAACCCA 4200 

CTGATTGATT CTGTGTGTGC CTCTGACAGC 4260 

ACGGCCAACC GGGTGCTGCA CTGCAACGCC 4320 

ACCCTGCTGC AGAGGTCCAA AGGGGACACC 4380 

GGATGGTTGC ACAAAGAGGT GAAGAACAGT 4440 

CGGTGGTTT6 TACTCACCCA CAATTCCCTG 4500 

CTCAAACTGG GGACCCTGGT CCTCAACAGC 4560 

ATATTCAAAG AGACAGGCTA CTGGAACGTC 4 620 

CTCTACACCA AGCTGCTCAA CGAGGCCACC 4680 

GACACCAAGG CCCCGATCGA CACCCCCACC 4740 

TGCCTGAACT CGGATGTGGT GGAACAGATT 4800 

CATCACCCCT TGCACTCCCC GCTCCTGCCC 4860 

CTCAAA6ACA AAGGCTATAC CACCCTTCAG 4920 

CA6CAACTGG AGTCCAT6TC TGACCC7VATT 4980 

CATGACCT6C 6ACCTCTGCG GGACGAGCTG 5040 

GT6CCCCACC CC6GCAGTGT GGGCAACCTG 5100 

TGCACCTTCC TGCC6AOTCG AGGGATTCTC 5160 

CGGGAACAGT TTCCAGGAAC CGAGATGGAA 5220 

AAGAAAACCA AATGCCGAGA GTTT6TGCCT 5280 

AGGCAGGAAA TGACATCCAC GGTCTATTGC 5340 

AACTCCCACA CCACTGCT6G GGAGGTG6TG 5400 

GACAGCAGGA ACAT6TTTGC TTT6TTT6AA 5460 
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TACAACGGCC ACGTCGACAA AGCCATTGAA AGTCGAACCG TCGTAGCTGA TGTCTTAGCC 5520 

AAGTTT6AAA AGCTGGCTGC CACATCCX3A6 GTTGGGGACC T6CCATGGAA ATTCTACTTC 5580 

AAACTTTACT GCTTCCTGGA CACAGACAAC GTGCCAAAAG ACAGTGTGGA GTTTGCATTT 5640 

ATGTTTGAAC AGGCCCACGA AGCGGTTATC CATGGCCACC ATCCAGCCCC GGTU^GTVAAAC 5700 

5 CTCCAG6TTC TTGCTGCCCT GCGACTCCAG TATCTGCAGG G66ATTATAC TCTGCACGCT 5760 

GCCATCCCAC CTCTCGAAGA GGTTTATTCC CTGCAGAGAC TCAAGGCCCG CATCAGCCAG 5820 

TCAACCAAAA CCTTCACCCC TTGTGAACGG CTGGAGAAGA GGCGGACGAG CTTCCTAGAG 5880 

GG6ACCCTGA G6CGGAGCTT CCX3GACAGGA TCCGTGGTCC GGCAGAAGGT CGAGGAGGAG 5940 

CAGATGCTGG ACATGT6GAT TAAGGAAGAA GTCTCCTCTG CTCGAGCCAG TATCATTGAC 6000 

10 AAGTGGAGGA AATTTCAGGG AATGAACCAG GAACAGGCCA TGGCCAAGTA CATGGCCTTG 6060 

ATCAAGGAGT GGCCTGGCTA TGGCTCGACG CTGTTTGATG TGGAGTGCAA GGAAGGTGGC 6120 

TTCCCTCAGG AACTCTGGTT GGGTGTCAGC GCGGACGCCG TCTCCGTCTA CAAGCGTGGA 6180 

GAGGGAAGAC CACTGGAAGT CTTCCAGTAT GAACACATCC TCTCTTTTGG GGCACCCCTG 6240 

GCGAATACGT ATAAGATCGT GGTCGATGAG AGGGAGCTGC TCTTTGAAAC CAGTGAGGTG 6300 

15 GTGGATGTGG CCAAGCTCAT GAAAGCCTAC ATCAGCATGA TCGTGAAGAA GCGCTACAGC 6360 

ACGACACGCT CCGCCAGCAG CCAGGGCAGC TCCAG GTGAA GGCGGGACAG AGCCCACCTG 6420 

TCTTTGCTAC CTGAACGCAC CACCCTCTGG CCTAGGCTGG CTCCAGTGTG CCATGCCCAG 6480 

CCAAAACAAA CACAGAGCTG CCCAGGCTTT CTGGAAGCTT CTGGTCTGAG GGAGGTGTCT 6540 

CCGAGGATCC TTTTGCCTGC CGCCTTCATT GATCCTGTAT TAAGCTGTCA ACTTTAACAG 6600 

20 TCTGCACAGT TTCCAAAGCT TTACTACTCT TAGAGGACAC ATGCCTTAAA AAAGGAGGGG 6660 

AGGAACCACG CTGCCACCAA AGCAGCCGGA AGTGCCTTAA CTTGTGGAAC CAACACTAAT 6720 

CGACCGTAAC TGTGCTACTG AAGGGAACTG CCTTTCCCCC TTCTGGGGGA GACTTAACAG 6780 

AGCGTGGAAG GGGGGCATTC TCTGTCAATG ATGCACTAAC CTCCCAACCT GATTTCCCC6 6840 

AATCTGAGGG AAGGT6AGGG A6TG6GAA6G G6GAT66AGA GCTCGAGGGG ACAGT6TGTT 6900 

25 TGA6CTGGAG TGCTGCGGGC AGCCTTTCTC ATGGAATGAC ATGAATCAAC TTTTTTCTTT 6960 

GTTTCATCTT TTAAGTGTAC GTGCTT6CCT GTTCGTGCAT GTGTTCATAA ACTCAACACT 7020 

TTAATCATGG TTTCATGA6C ATTAAAAAGC AAAGGGAAAA A6GATGTGTA ATGGT6TACA 7080 

CAGTCTGTAT ATTTTAATAA TGCAGAGCTA TAGTCTCAAT TGTTACTTTA TAAGGTG6TT 7140 

TTATTAACAA ACCCAAATCC TGGATTTTCC TGTCTTTGCT GTATTTTGAA AAACACGTGT 7200 

30 TGACTCCATT OTTTTACATG TAGCAAAGTC TGCCATCTGT GTCTGCTGTA TTATAAACAG 7260 

ATAAGCAGCC TACAAGATAA CTGTATTTAT AAACCACTCT TCAACAGCTG GCTCCAGTGC 7320 

TGGTTTTAGA ACAAG7VATGA AGTCATTTTG GAGTCTTTCA TGTCTAAAAG ATTTAAGTTA 7380 

AAAACAAAGT GTTACTTGGA AGGTTAGCTT CTATCATTCT GGATAGATTA CAGATATAAT 744 0 

AACCATGTTG ACTATGGGGG AGAGACGCTG CATTCCAGAA ACGTCTTAAC ACTTGAGTGA 7500 

35 ATCTTCAAAG GACCCTGACA TTAAATGCTG AGGCTTTAAT ACACACATAT TTTATCCCAA 7560 

GTTTATAATG GTGGTCTGAA CAAGGCACCT GTAAATAAAT CAGCATTTAT GACCAGAAGA 7620 

AAAATAATCT GGTCTTGGAC TTTTTATTTT TATATGGAAA AGTTTTAAGG ACTTGGGCCA 7680 

ACTAAGTCTA CCCACACGAA AAAAGAAATT TGCCTTGTCC CTTTGTGTAC AACCATGCAA 7740 
AACTGTTTGT TGGCTCACAG AAGTTCTGAC AATAAAAGAT ACTAGCT 
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Seq ID NO: 207 Protein sequence; 
Protein Accession #: HP 036466 



1 11 21 31 41 51 

45 I I I I I I 

MDNFPTEGTR VWLRENGQHP PSTVNSCAEG IWFRTDYGQ VPTYKQSTIT HQKVTAMHPT 60 

NBE6VDDMAS LTBLHGGSZN YNIiFQRYKRN QIYTYI6SZL ASVNPYQPIA GIiYEPATMEQ 120 

ySRRHLGEI>P PHXFAIANEC YRCIiWKRYDN QCIIiISGESG AGKTESTKIiI LKFLSVISQQ 180 

SIiEIiSLKEKT SCVBRAIZjES SPZMEAFGHA KTVYNNNSSR FGKFVQIiNZC QKGNIQ6GRI 240 

50 VDYLLEKNRV VRQNPGERNY HIFYALLAGL EHEEREEFYL STPENYHYLN QSGCVEDKTI 300 

SDQBSFREVI TAMZ>VMQFSK EEVRBVSRLL AGIliHIiGMIE FITAGGAQVS FKI'AIjGRSAE 360 

IjLGLDPTQ1»T DALTQRSMFL RGEEIIiTPliN VQQAVDSRDS lAMALYACCF EWVIKKINSR 420 

IKGNEDFKSI GILDIFGFEN FEVNHFEQFN INYANEKLQE YFNKHIFSLE QLEYSREGLV 480 

WEDIDWIDNG ECliDLIEKKL GLIiAIiINEES HFPQATDSTI. LEKIiHSQHAN NHFYVKPRVA 540 

55 .VNNFGVKHYA GEVQYDVRGI LEKNRDTFRD DLIoNLLRESR FDPIYDIiFEH VSSRNNQDTIi 600 

KCGSKHRRPT VSSQPKDSLH SLMATLSSSN PFFVRCIKPN MQKMPDQFDQ AWliNQIiRYS 660 

GMLETVRIRK AGYAVRRPFQ DPYKRYKVLM RNliALPEDVR GKCTSLLQLY DASNSEWQLG 720 

KTKVFIiRESL EQKLEKRREE BVSHAAMVIR AHVLGFLARK QYRKVIiYCW IIQKNYRAFli 780 

IiRRRFIiHI*KK AAIVFQKQLR GQIARRVYRQ LLAEKREQEE KKKQEEEEKK KREEBERERE 840 

60 RERREAELRA QQEEETRKQQ ELEALQKSQK EAELTRELEK QKENKQVEEI IiRLEKEIEDIj 900 

QRMKEQQELS LTEASLOKLQ ERRDQELRRL EEEACRAAQE FLESLNFDEI DECVRNIERS 960 

IiSVGSEFSSE LAESACEEKP NFNFSQPYPE EEVDEGFEAD DDAFKDSPNP SEHGHSDQRT 1020 

SGIRTSDDSS EEDPYMNDTV VPTSPSADST VIiIiAPSVQDS GSMNSSSGE STYCMPQNAG 1080 

DLPSPDGDYD YDQDDYEDGA ITSGSSVTFS NSYGSQWSPD YRCSVGTYNS SGAYRFSSEG 1140 

65 AQSSFEDSBB DFDSRFDTDD BLSYRRDSVY SCVTIiPYFHS FIiYMKGGLMN SWKRRWCVLK 1200 

DETFLWFRSK QEALKQGWXiH KKGGGSSTLS RRNWKKRWFV LRQSKLMYFE NDSEBKIjKGT 1260 

VEVRTAKBII DNTTKENGZD IIMADRTFHL ZABSPEDASQ WFSVI>SQVHA STDQBZQBMH 1320 

DBQANPQNAV GTIjDVGLZDS VCASDSPDRP NSFVZZTANR VIiHCNADTPB EMHHWITIiIjQ 1380 

RSKGDTRVEG QBFZVRGWLH KBVKHSPKMS SliKLKKRHFV liTHHSIiDYYK SSBKNAIiKIiG 1440 

70 TLVLNSLCSV VPPDEKZFKE TGYWNVTVYG RKHCYRLYTK IiUIBATRWSS AZQNVTDTKA 1500 

PZDTPTQQLZ QDXKENCLNS DWEQZYKRN PZIiRYTHHPL HSPUtPLPYG DINLNliIiKDK 1560 

GYTTLQDEAZ KZFNSLQQLE SMSDPZPZZQ GZIiQTGHDLR PLRDBliYCQL IKQTNKVPHP 1620 

GSVGNIiYSWQ ZLTCLSCTFL PSRGZLKYIiK FHLKRZREQF PGTEMEKYAI, FTYESLKKTK 1680 

CREFVPSRDB lEALZHRQEM TSTVYCHGGG SCKZTXNSHT TAGEWEKIiI RGIiAMEDSRN 1740 

75 MFALFBYNGH VDKAIESRTV VADVIjAKFBK IiAATSEVGDL PWKFYFKLYC PIjDTDNVPKD 1800 

SVEFAFMFEQ AHEAVZHGHH PAPEENLQVI* AAIiRLQYIiQG DYTLHAAZPP LEEVYSLQRL 1860 



277 



BNSOOCID: <WO 02079492A2_L> 



wo 02/079492 



PCT/US02/04915 



KARISQSTKT FTPCERLEKR RTSFLEGTLR RSFRTGSWR QKVEEEQMLD MWIKEEVSSA 1920 

RASIIDKWRK FQGMNQEQAM AKYMALIKEW PGYGSTLFDV ECKEGGFPQE LWLGVSADAV 1980 

SVYKRGEGRP LEVFQYEHIIi SFGAPIiANTY KIWDERELL FETSEWDVA KMKAYISMI 204 0 
VKKRYSTTRS ASSQGSSR 



Seg ID NO: 208 DNA aeouence 

Nucleic Acid Accession #: XM;_059761.1 

Coding sequence: 124-525 (underlined sequences correspond to start and stop codons) 



10 



15 



20 



25 



CGAAGATCTA 
6ATGATCTCA 
GGCATCGCTC 
TCTCTGA6CG 
GATTCTGTTIA 
TCAAATACCC 
GTGAGAAGTT 
GGCTGCCGTC 
ATTTTCTGTT 
CTCTGTGTAA 
TACTGCTTCT 
ACAACAGGTC 
TCAGAATGAA 
TGGA6GTGTT 



11 
I 

TCCAAAATCA 
ATCATGTGGA 
TTATGGAAGT 
AGACA6TGAA 
ATGAAACCCA 
AAGATGCTTC 
ACAACTCTGA 
CTTGTGAGGA 
TCAAGCTTCT 
CACTAACATT 
ATTTTGAAAA 
CTAGCATGTA 
TGCAGTTGTG 
TGTTTTCTCC 



21 
1 

AGAAGCCTTT 
■ TTTGAATGTG 
TAACCTATTA 
GAAAGTGGAA 
GTTTTGTGTT 
AGTGTCCATA 
AGTGAAGCTG 
TGGAGCTTCA 
GTACTTTATG 
TCCAGTAGTC 
AAGAGTTTTT 
TAGCTGCATA 
TGTCTATATT 
AGAATAAAGG 



31 

1 ' 

GATTTAGATG 
TGTACAAGCT 
AGTGGCTTTA 
TATGATCATG 
AATATTCCTG 
GTGGATTACT 
TCCTCCTGTG 
GGCTCCCATC 
GAACTTTGGC 
ACATGTGATT 
TTTCTTTCTA 
GATTTCTTCA 
TTCCCCTCTC 
TATTACTTTA 



41 
I 

TTGCTGTAAA 
TTTCGGGCCC 
TGGT6CCTTC 
GAAAACTCAA 
CTGTGAGAAA 
ATGAGCCAAG 
ACCTTTGCAG 
ATCACTCTTC 
TG TGA TTTAT 
GTTTTGTTTT 
TGGGGTTGCA 
CCTGATCTTT 
AAAATCTTTT 
G 



51 
I 

AGj\AAATAAA 
GGGTAG6AGT 
AGAAGCAATT 
CCTCTATTTA 
CTTTAAAGTT 
GAGACAGGCG 
TGATGTCCAG 
AGTCATTTTT 
TTTTAAAGGA 
CGTAGAAGAA 
GGGATGGTGT 
GTGTGGAAGA 
AGAATTTTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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35 



Seq ID NO: 209 Protein sequence: 
Protein Accession #: XP 059761.1 



11 



21 



31 



41 



51 
I 



HALMEVNUiS 6FMVPSBAIS IjSETVKKVEY DHGKIiNIjYIiD SVMETQPCVII IPAVRNFKVS 60 
IIT<»}ASVSIV DYYEPRRQAV RSYNSBVKI.S SCDXiCSDVQG CRPCEDQASG SHHHSSVIFI 120 
FCPKLLYFME LWL 
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45 



50 



55 



60 



65 



70 



Seq ID NO; 210 DNA sequence 
Nucleic Acid Accession #: NM_015472 

Coding sequence: 258-1460 (underlined sequences correspond to start and stop codons) 



1 
I 

GACACACTCC 
AGCTCACATC 
AGAAGACATG 
QACTCTTATC 
TGGTGATTTT 
AGCAAGT6AT 
TCATGAATCC 
AGCCTGATTC 
GGCCTCGACT 
AGCTGGGCAC 
AGCAGTCCTA 
CGGCCACTGG 
CTAGGAAGGC 
CACCAGTGCC 
ACCAGCAGCA 
CCGCAGGGCT 
TGCGGCTTCA 
TGAGGCAGGA 
TTCAGGCTGC 
GCTCAGATCC 
GCCTGGGGTT 
ATGAGATGGA 
CCCGTTTCCC 
AATCTGAAGA 
CCTTTCTAAC 
TTCCTGGGCC 
GCCTCCATGA 
ATTGA6AGTA 



11 
I 

TCTACAACAC 
CTGGCGACTC 
TACATTCAGT 
TTGGCATCTG 
CCAGAA GATG 
CCACGTCACG 
GAA6CCTAGC 
GGGCTCGCAC 
G6CTGGGG6T 
CGGCGCGGGT 
CGACGTGACC 
CCAGAGGTAC 
GATGAATCAG 
TCAGAGGTCC 
GATGGCCCCC 
CATGAGTATG 
GAGAATCCAG 
AGCTGCCCTC 
TGTCAACCCA 
TTTCCTCAAT 
AGGGTGCTAC 
TACAGGAGAA 
TGATTTCCTt 
CCTGATCCCC 
CTGGCTGTAA 
TCTTGGAAAA 
CTCGTGCTCC 
ACTTAA6TTA 



21 
I 

CAGAGACTCC 
TCAAGAGAGA 
ATCTATTTTG 
CTTCCTGGTG 
AATCCGGCCT 
CAG6ACCTAG 
TCGTGGCGGA 
TCGCGCCAGT 
GCCCAGCATG 
GCTGCGGGTA 
GACGAGCTGC 
TTCCTCAATC 
CCTCTGTU^TC 
ATGGCAGTAT 
AGTACCCTGA 
CCCAATGCGC 
ATGGAGAGAG 
TGTCGACAGC 
CCCACGATGA 
GGAGGGCCAT 
AGTGTCCCCA 
AACGCAGGAC 
GACTGTCTTC 
CTCTTCAATG 
TCACTACCAT 
AGTOATGGAG 
CTCCTTTTTA 
AACATAAATA 



31 
I 

CAAACACAAG 
AACCTCAGAG 
GCATTTTCCC 
GATCTGAACT 
CGGCGCCCCC 
ACACAGACCT 
AGAAGATCCT 
CCAGCACGGA 
TCCGCTCGCA 
GCCCCGCGCA 
CACTGCCCCC 
ACATAGAAAA 
ATATGAACCT 
CCCAGCCAAA 
GCCAGCAGAA 
TGACCACTCA 
AAAGGATTCG 
TCCCCATGGA 
CCCCAGACAT 
ATCATTCGAG 
CAACTCCGGA 
AAACACCCAT 
CAGGAACAAA 
ATGTAGAGTC 
TGTAACTTGG 
CAGAGCAAGT 
TGTTGCCAGT 
AATATTCTAT 



41 
I 

GCCTTATATT 
TGACTAAAAT 
CAATACATCT 
GACCCATAAG 
TCCGCTCCCG 
CGAAGCCCTC 
GCCGGAGTCT 
CTCGTCGGGC 
CTCGTCGCCC 
GCA6CAC6CG 
GGGCTG6GAG 
AATCACCACA 
CCACCCTGCC 
TCTCGTGATG' 
CCACCCCACT 
GCAGCAGCAG 
AATGCGCCAA 
AGCTGAGACT 
GAGATCCATC 
GGAGCAGAGC 
GGACTTCCTC 
GAACATCAAT 
CGTTGACTTA 
TGCTCTGAAC 
ATGTAGCCAT 
CTGCAGGTGC 
TTAATCATTG 
TTTCATTTTC 



51 

1 

GACTCATTTC 
CTCCATAATG 
CTGCTCATCT 
CCACGCTTAC 
CCGCCTGGGC 
TTCAACTCTG 
TTCTTTAAGG 
GGCCT^CCGG 
GCGTCCCTGC 
CACCTCCGCC 
ATGACCTTCA 
TGGCAA6ACC 
GTCAGTTCCA 
AATCACCAAC 
CAGAACCCAC 
CAGCAGAAAC 
GAGGAGCTCA 
CTTGCCCCAG 
ACTAATAATA 
ACTGACAGTG 
AGCAATGTGG 
CCCCAACAGA 
GGAACTTTGG 
AAAAGTGAGC 
GACCTTACAT 
ACCACTTCCC 
CCTGGTTTTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



75 



Seq ID NO: 211 Protein seouenee; 
Protein Accession #: NP 056287.1 
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MNPASAPPPL PPPGQQVZHV TQDLDTDIiEA IiFNSVMMPKP 5SWRKKILPE SFFKEPDSGS 60 

HSRQSSTDSS GGHP6PRLAG 6AQHVRSHSS PASLQL6TGA GAAGSPAQQH AHIiRQQSYDV 120 

TDBIiPLPPGW EMTFTAT6QR .YFLNHZEKXT THQDPRKAMN QPLNHMNLHP AVSSTPV£^R IBO 

SMAVSQPNLV MNHQHQQQMA PSTLSQQNHP TQNPPAGLMS MPNALTTQQQ QQQKI.RLQRZ 240 

S QMERERIRMR QEELMRQEAA LCRQLPMEAE TLAPVQAAVN PPTMTPDMRS ZTNNSSDPFIi 300 

NGGPYHSRBQ STDSGLGLGC YSVPTTPEOF LSNVDEMDTG ENA6QTPMNZ NPQQTRFPDF 360 
USCIiPGTNVD liGTIiESEDIiZ PLFNDVESAL NKSEPFIiTVIL 

Seq ID NO: 212 DNA aecaience 
10 Nucleic Ac id Accession NM_0 18174 

Coding sequence: 176-2194 (underlined sequences correspond to start and stop codons) 



15 



20 



25 



30 



35 



40 



45 



50 



CATCTCCCCC 
GCGCGGCGAG 
TCTGCCACTC 
CGTGGGCCGG 
GGCCTCTGTG 
CGTGCTGTTC 
GCACTTGAGG 
A6CCGAGAGC 
CACCCACCCT 
TGA6GCCCCA 
CAAACCGAGT 
CAACCTCAAG 
CCACTCTG6C 
T^AAGCCAGC 
CAGCCTGGAG 
CGCCAGCTCA 
6GGCAGCGAG 
ACCCACAGTG 
GCACTCGACC 
CGCCCCCACC 
GGCTTCCCCA 
GGCGGTGCCA 
GTCACAGGAA 
GTCCCTGCCC 
AGACGAAGAC 
GGTCCCCCCA 
CCCCAAGACA 
dAACTCACGC 
TGCTGGTGGG 
AGGCCGGGCA 
GGGGTCAGCC 
GGACCTGGCC 
GCGCGTGCX3C 
GC666CC6TC 
GACCCTGATC 
CCGGCACCAG 
TGAOGCCTTC 
AGCCCAGCCC 



AACCTGGGGG 
GATGAGGCGG 
AGCCX3CGGCC 
CTGGACATGT 
TGCGCCCTGC 
CCCGGTTGCA 
TTCCTGCGAG 
AAAGAGAGCG 
AGACCTGGCC 
CGCMGACTG 
6TCTCCCGGA 
AAGACGAATG 
TTCCCGCCGG 
CCCCCCAGTG 
CTGGG6CCGA 
ATCCCAA66C 
CGGCTGTCGC 
ACCACACCCA 
GAGGTGGACX3 
AGTGAGGCTG 
CACGATGTGG 
ATGGCACCGG 
CGGGCAGGTG 
ACCCTGTCTG 
ACAGAGGGCT 
CCACTGCCTG 
GCACGGCAAA 
GCTGCCGCCC 
GACCGTGCCA 
CCCCTGTCCA 
AGCAGCCGGC 
TACCTGCCCA 
GCGCTCTGCT 
CT6GACGCX3C 
CCCACTTTCG 
GCGCTGGGCA 
CCGGCCTGCA 
GCCTGTCCCT 



TCGTGTTCTT 
AGCTGGCGCT 
CCGTGCCAGC 
ATGTGCTGCA 
TGGTGT6GCA 
CCCCGCCCGC 
AGCCCGTGGT 
TGGGCTCCCG 
AGGAGCGCCC 
AGAAAGAAGC 
CCCAGCC6C6 
CCCAGGCGGC 
TGGCAAATGG 
CAGCCT6CGG 
TCCCAGCCGG 
CACGCACACC 
TGAGCCCACT 
CGGTGACCAC 
AGTCCCTGTC 
GGCTGAGCCT 
ACCTGTGCCT 
CACCTGCGTC 
GGCTGGGGGC 
ACTCGGATCC 
TTGGAGTCCC 
ACCCATCCAG 
CGGAGAACGT 
CCAAAGCCAC 
GCCGACCACT 
GAAAGTCCTC 
CCGGGGTGTC 
6CGGGA6CA6 
AC6TCATCAG 
TACTGGCCAG 
ACTCGGT6GC 
TCACGGTGTT 
AGGTGGAGTT 
AGATTCA6CC 



CAACGCCTGC 
GAGCCTCCTG 
CAAACCCACC 
CCCGCCCTCC 
CCCCGCCGGC 
CTGCCTCCTG 
GAC6CCCCAG 
GGACAGCTCG 
TGGG6TGGCC 
CAAGACCCCC 
6GAGGTGCGC 
ACCCAAGCCC 
ACCCXX3CAGC 
CTCTCCGGCC 
G6AGGAGAAQ 
CTCCCCTGAG 
GCGGGGC6GG 
GCCCTCACTA 
GGTGTCCTTT 
CCCGCTGCGT 
GGTGTCACCC 
CCCCGGCAGC 
CGAGGAGACG 
CGTGCCCCTG 
TCGCCACGAC 
CATCTGCATG 
CAGCCGCACC 
TCCAGTGGCT 
CAGTGCCCGG 
AACCCCCAA6 
AGCCACCCCA 
CGCCCACCTG 
TGGCCAGGAC 
CAA6CAGCAT 
CATGCATACG 
GG6CAQCAAC 
CTAGCCX:CAT 
ACATCAGAAA 



GAGGCCGCGT 
GCGCAGCTGG 
GTGCTCTTCG 
GCCGGCGCCG 
CCCGGCGAGA 
GACGGCCTGG 
GACCT6GAGG 
AAGAGAGAGG 
CGCAAGGAGC 
CG6GAGTTGA 
GG6GCA6CCT 
CX3CAAAGCGC 
CC6CCCAGCC 
TCCCAGCTGG 
GCACTGGA6C 
TCCCACCX3GA 
GAGGCCGGGC 
CCCGCAGAGG 
GAGCAGGTGC 
GGCCCCCGGG 
TGTGAATTTG 
TCGAATGACA 
CCACCCACAT 
GCCCCCGGTG 
CCTTTGCCTG 
GTGGACCCCG 
CGGAAGCCCC 
GCTGCCAATJi 
AGTGAGCCCA 
ACTGCCACTC 
CCCAAGTCCC 
GTGGAT6AGG 
CAGCGCAAGG 
TGGGACCGTG 
TGGTACGCA6 
6GCATG6TGT 
CX3CCGACACG 
TAAACTGTGA 



CGCGGCTGGC 
GCATCACGCC 
AGAAG ATGG G 
AGCGCACGCT 
AGGTGGTGCG 
TCCGCCTGCA 
GGCCGGGGCG 
GCCTCCTGGC 
CAGCACGGGC 
AGAAAGACCC 
CTTCTGTGCC 
CCAGCACGTC 
TCCGATGTGG 
TGGCCACGCC 
T6CCTTTGGC 
GCCCCGCAGA 
CAGACGCCTC 
TGGGCTCCCC 
TGCCGCCATC 
CGCGGCGCTC 
AGCATCGCAA 
GCAGTGCCCG 
CGGTCAGCGA 
CGGCAGACTC 
ACCCCCTCAA 
AGATGCTGCC 
TGGCCCGCCC 
CCAAGGGGCT 
GTGA6AAG6G 
GAGGCCC6TC 
CGGTCTACCT 
AGTTCTTCCA 
AGGAAGGCAT 
ACCTGCAGGT 
AGACGCAGGC 
CCAT6CAG6A 
CCCCCCACTC 
CTACACTTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



55 



60 



65 



Seq ID NO: 213 Protein sequence; 
Protein Accession #: NP 060644.1 



MGVGRIiDMYV 
LQHIiRFLREP 
RAEAPRKTEK 
TSHSGFPPVA 
lAASSIPRPR 
SPHSTEVDES 
RKAVPMAPAP 
DSDEDTEGFG 
RPNSRAAAPK 
PSGSASSRPG 
GMRAVLDAIiL 
QDDAFPACKV 



liHPPSAGAER 
WTPQDLEGP 
EAKTPRELKK 
NGPRSPPSLR 
TPSPESHRSP 
LSVSFEQVLP 
ASPGSSNDSS 
VPRHDPLPDP 
ATPVAAAKTK 
VSATPPKSPV 
ASKQHWDRDIi 
EF 



TUVSVCAIjIiV 
GRAESKESVG 
DPKPSVSRTQ 
CGEASPPSAA 
AEGSERIiSIiS 
PSAPTSEAGL 
ARSQERAGGL 
LKVPPPLPDP 
GliAGGDRASR' 
YLDIiAYIiPSG 
QVTIjIPTFDS 



WHPAGPGEKV 
SRDSSKREGL 
PREVRRAASS 
CGSPASQLVA 
PIiRGGBAGPD 
SI^PLRGPRAR 
GAEETPPTSV 
SSICMVDPEM 
PIiSARSEPSE 
SSAHIiVDEBF 
VAMHTWYAET 



VRVLFPGCTP 
liATHPRPGQE 
VPNLKKTNAQ 
TPSLELGPIP 
ASPTVTTPTV 
RSASPHDVDL 
SESIiPTLSDS 
IiPPKTARQTE 
K60RAPI.SRK 
FQRVRALCYV 
HARHQALGZT 



PACIjIjDGIiVR 
RPGVARKEPA 
AAPKPRKAPS 
AGEEKALBLP 
TTPSIjPAEVG 
CliVSPCEFEH 
DPVPIiAPGAA 
NVSRTRKPIiA 
SSTPKTATR6 
ISGQDQRKEE 
VLGSNGMVSM 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



Seq ID NO: 214 DNA sequence 
70 Nucleic Acid Accession #: NM_002019.1 

Coding sequence: 250-4266 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

_^ I I I I I I 

75 GCGGACACTC CTCTCGGCTC CTCCCCGGCA GCGGCGGCGG CTCGGAGCGG GCTCCGGGGC 
TCGGGTGCAG CGGCCAGC6G GCCTGGCG6C GAGGATTACC CGG6GAAGTG GTTGTCTCCT 



60 
120 



279 
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GGCTGGAGCC GCGAGACGGG CGCTCAGGGC 
GACTCTGGCG GCCGGGTCGT TGGCCGGGGG 
GCGCTCACCA^IGGTCAGCTA CTGGGACACC 
CTGCTTCTCA CAGGATCTAG TTCAGGTTCA 
5 GGCACCCAGC ACATCATGCA AGCAGGCCAG 
GCCCATAAAT GGTCTTTGCC TGAAATGGTG 
AAATCTGCCT GTGGT^GAAA TGGCAAACAA 
CAAGCAAACC ACACTGGCTT CTACAGCTGC 
AAGGAAACAG AATCTGCAAT CTATATATTT 

10 ATGTACAGTG AAATCCCCGA AATTATACAC 
TGCCGGGTTA CGTCACCTAA CATCACTGTT 
ATCCCTGATG GAAAACGCAT AATCTGGGAC 
ACGTACAAAG AAATAGGGCT TCTGACCTGT 
ACAAACTATC TCACACATCG ACAAACCAAT 

15 CGCCCAGTCA AATTACTTAG AGGCCATACT 
TTGAACACGA GAGTTCAAAT GACCTGGAGT 
GTAAGGCGAC GAATTGACCA AAGCAATTCC 
ATTGACAAAA T6CA6AACAA AGACAAAGGA 
TCATTCAAAT CT6TTAACAC CTCAGTGCAT 

20 CATCGAAAAC AGCAGGTGCT TGAAACCGTA 
AAAGTGAAGG CATTTCCCTC GCCGGAAGTT 
GAGAAATCTG CTCGCTATTT GACTCGTGGC 
GAGGATGCAG GGAATTATAC AATCTTGCTG 
CTCACTGCCA CTCTAATTGT CAATGTGAAA 

25 TTTCCAGACC CGGCTCTCTA CCCACTGGGC 
GGTATCCCTC AACCTACAAT CAAGTGGTTC 
GCAAGGTGTG ACTTTTGTTC CTVATAATGAA 
ATGGGAAACA GAATTGAGAG CATCACTCAG 
ATGGCTAGCA CCTTGGTTGT GGCTGACTCT 

30 TCCAATAAAG TTGGGACTGT GGGAAGAAAC 
GGGTTTCATG TTAACTTGGA T^AAAATGCCG 
ACAGTT7UVCA AGTTCTTATA CAGAGACGTT 
AGAACAAT6C ACTACAGTAT TAGCAAGCAA 
ACTCTTAATC TTACCATCAT GAATGTTTCC 

35 GCCAGGAATG TATACACAGG GGAAGAAATC 
CAGGAA6CAC CATACCTCCT GCGAAACCTC 
ACCACTTTAG ACEGTCATGC TAATGGTGTC 
AACCACAAAA TACAACAAGA GCCTGGAATT 
ATTGAAAGAG TCACA6AA6A 6GATGAAGGT 

40 GGCTCTGTGG AAAGTTCAGC ATACCTCACT 
GAGCTGATCA CTCTAACATG CACCTGTGTG 
CTCCTTATCC GAAAAATGAA AAGGTCTTCT 
ATAATGGACC CAGATGAAGT TCCTTTGGAT 
AGCAAGTGGG AGTTTGCCCG GGAGAGACTT 

45 TTTGGAAAAG TGGTTCAAGC ATCAGCATTT 

GTGGCTGTGA AAATGCTGAA AGAGGGGGCC 
GAGCTAAAAA TCTTGACCCA CATTGGCCAC 
TGCACCAAGC AAGGAGGGCC TCTGATGGTG 
TCCAACTACC TCAAGAGCAA ACGTGACTTA 

50 ATGGAGCCTA AGAAAGAAAA AATGGAGCCA 
GATAGCGTCA CCAGCAGCGA AAGCTTTGCG 
AGTGAT6TTG AGGAAGAGGA GGATTCTGAC 
GATCTGATTT CTTACAGTTT TCAAGTGGCC 
TGCATTCATC GGGACCTGGC AGCGA6AAAC 

55 ATTT6TGATT TTGGCCTTGC CCGGJGATATT 
GATACTCXSAC TTCCTCTGAA ATGGATGGCT 
ACCAAGAGCG AC6TGT6GTC TTACGGAGTA 
TCTCCATACC CAGGAGTACA AATGGATGAG 
AGGATGAGAG CTCCTGAGTA CTCTACTCCT 

60 CACAGAGACC CAAAAGAAAG GCCAAGATTT 
CTTCAAGCAA ATGTACAACA GGATGGTAAA 
GGAAATAGTG GGTTTACATA CTCAACTCCT 
ATTTCAGCTC CGAAGTTTAA TTCAGGAAGC 
AAGTTCATGA GCCTGGAAAG AATCAAAACC 

65 ATGTTTGATG ACTACCAGGG CGACAGCAGC 
TTCACCTGGA CTGACAGCAA ACCCAAGGCC 
AAAAGTAAGG AGTCGGGGCT GTCTGATGTC 
GGGCACGTCA GCGAAGGCAA GCGCAGGTTC 
ATCGCGTGCT GCTCCCCGCC CCCAGACTAC 

70 ATC TAGA GTT TGACACGAAG CCTTATTTCT 
AACTAGCTTT TGCCAGTATT ATGCATATAT 
CAGCTGCTTT TTGTGATTTT TTTAATAGTG 
TCCAGATA6A GAAATA6T6A CAAGTGAAGA 
TGTTAGAGAA ATCCTTCCTA AACCCAATGA 

75 GCACGCAGGA CCAGTTTGAT TGAGGAGCTG 
CTGGGCCAGC CCTGCAGCCC AAAACCCAGG 



GCGGGGCCGG CGGCGGCGAA CGAGAGGACG 180 

AGCGCGGGCA CCGGGCGAGC AGGCCGCGTC 24 0 

GGGGTCCTGC TGTGCGCGCT GCTCAGCTGT 300 

AAATTAAAAG ATCCTGAACT GAGTTTAAAA 360 

ACACTGCATC TCCAATGCAG GGGGGAAGCA 420 

AGTAAGGAAA GCGAAAGGCT GAGCATAACT 4 80 

TTCTGCAGTA CTTTAACCTT GAACACAGCT 540 

AAATATCTAG CTGTACCTAC TTCAAAGAAG 600 

ATTAGTGATA CAGGTAGACC TTTCGTAGAG 66 0 

ATGACTGAAG GAAGGGAGCT CGTCATTCCC 720 

ACTTTAAAAA AGTTTCCACT TGACACTTTG 780 

AGTAGAAAGG GCTTCATCAT ATCAAATGCA 840 

GAAGCAACAG TCAATGGGCA TTTGTATAAG 900 

ACAATCATAG ATGTCCAAAT AAGCACACCA 960 

CTTGTCCTCA ATTGTACTGC TACCACTCCC 1020 

TACCCTGATG AAAAAAATAA GAGA6CTTCC 1080 

CATGCCAACA TATTCTACA6 TGTTCTTACT 1140 

CTTTATACTT GTCGTGTAAG GAGTGGACCA 1200 

ATATATGATA AAGCATTCAT CACT6TGAAA/' 1260 

GCTGGCAAGC GGTCTTACCG GCTCTCTATG 1320 

GTATGGTTAA AAGATGG6TT ACCT6CGACT 1380 

TACTCGTTAA TTATCAAGGA CGTAACTGAA 1440 

AGCATAAAAC AGTCAAATGT GTTTAAAAAC 1500 

CCCCAGATTT ACGAAAAGGC CGTGTCATCG 1560 

AGCAGACAAA TCCTGACTTG TACCGCATAT 1620 

TGGCACCCCT GTAACCATAA TCATTCCGAA 1680 

GAGTCCTTTA TCCTGGATGC TGACAGCAAC 1740 

CGCATGGCAA TAATAGAAGG AAAGAATAAG 1800 

AGAATTTCTG GAATCTACAT TTGCATAGCT 1860 

ATAAGCTTTT ATATCACAGA TGTGCCAAAT 1920 

ACGGAAGGAG AGGACCTGAA ACTGTCTTGC 1980 

ACTTGGATTT TACTGCGGAC AGTTAATAAC 204 0 

AAAATGGCCA TCACTAAGGA GCACTCCATC 2100 

CTGCAAGATT CAG6CACCTA TGCCTGCAGA 2160 

CTCCAGAAGA AAGAAATTAC AATCAGAGAT 2220 

AGTGATCACA CAGTGGCCAT CAGCAGTTCC 2280 

CCCGAGCCTC AGATCACTTG GTTTAAAAAC 2340 

ATTTTAGGAC CAGGAAGCAG CACGCTGTTT 2400 

GTCTATCACT GCAAAGCCAC CAACCAGAAG 2460 

GTTC7AGGAA CCTC6GACAA GTCTAATCTG 2520 

GCTGCGACTC TCTTCTGGCT CCTATTAACC 2580 

TCTGAAATAA AGACTGACTA CCTATCAATT 2640 

GAGCAGTGTG AGCGGCTCCC TTATGATGCC 2700 

AAACTGGGCA AATCACTTGG AAGAGGGGCT 2760 

GGCATTAAGA AATCACCTAC GTGCCGGACT 2820 

ACGGCCAGCG AGTAGAAAGG TCTGATGACT 2880 

CATCTGAACG TGGTTAACCT GCTGGGAGCC 2940 

ATTGTTGAAT ACTGCAAATA TGGAAATCTC 3000 

TTTTTTCTCA ACAAGGATGC AGCACTACAC 3060 

GGCCTGGAAC AAGGCAAGAA ACCAAGACTA 3120 

AGCTCCGGCT TTCAGGAAGA TAAAAGTCTG 3180 

GGTTTCTACA AGGAGCCCAT CACTATGGAA 324 0 

AGAGGCATGG AGTTCCTGTC TTCCAGAAAG 3300 

ATTCTTTTAT CTGAGAACAA C6TGGTGAA6 3360 

TATAAGAACC CCGATTATGT GAGAAAA6GA 3420 

CCCGAATCTA TCTTTGACAA AATCTACAGC 3480 

TTGCTGTGGG AAATCTTCTC CTTAGGT6GG 3540 

GACTTTTGCA 6TCGCCTGAG GGAAIGGCATG 3600 

GAAATCTATC AGATCATGCT GGACTGCT6G 3660 

GCAGil^CTTG TGGAAAAACT AGGTGATTTG 3720 

GACTACATCC CAATCAATGC CATACT6ACA 3780 

GCCTTCTCTG AGGACTTCTT CAAGGAAAGT 3840 

TCTGATGATG TCAGATATGT AAATGCTTTC 3900 

TTTGAAGAAC TTTTACCGAA TGCCACCTCC 3960 

ACTCTGTTGG CCTCTCCCAT GCTGAAGCGC 4020 

TCGCTCAAGA TTGACTTGAG AGTAACCAGT 4 080 

AGCAGGCCCA GTTTCTGCCA TTCCAGCTGT 414 0 

ACCTACGACC ACGCTGAGCT GGAAAGGAAA 4200 

AACTCGGTGG TCCTGTACTC CACCCCACCC 4260 

AGAAGCACAT GTGTATTTAT ACCCCCAGGA 4320 

AAGTTTACAC CTTTATCTTT CCATGGGAGC 4380 

CTTTTTTTTT TTGACTAACA AGAATGTAAC 444 0 

ACACTACTGC TAAATCCTCA TGTTACTCAG 4500 

CTTCCCTGCT CCAACCCCCG CCACCTCAGG 4560 

CACTGATCAC CCAATGCATC ACGTACCCCA 4620 

GCAACAAGCC C6TTAGCCCC A6GGGATCAC 4680 
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TGGCTGGCCT GAGCAACATC TCGGGAGTCC TCTAGCAGGC CTAAGACATG TGAGGAGGAA 4740 

AAGGAAAAAA AGCAAAAA6C AAGGGAGAAA A6AGAAACCG GGAGAAGGCA TGAGAAAGAA 4800 

TTTGAGACGC ACCATGT6GG CACG6AG6GG 6ACGGGGCTC AGCAATGCCA TTTCAGTGGC 4860 

TTCCCAGCTC TGACCCTTCT ACATTT6AGG GCCCAGCCAG GAGCAGATGG ACAGCGATGA 4920 

5 GGG6ACATTT TCTGGATTCT GGGAGGCAA6 AAAA6GACAA ATATCTTTTT TGGAACTAAA 4980 

GCAAATTTTA GACCTTTACC TATGGAAGTG GTTCTATGTC CATTCTCATT CGTGGCATGT 5040 

TTTGATTTGT AGCACTGAGG GTGGCACTCA ACTCTGAGCC CATACTTTTG GCTCCTCTAG 5100 

TAAGATGCAC TGAAAACTTA GCCAGAGTTA 6GTTGTCTCC AGGCCATGAT GGCCTTACAC 5160 

TGAAAATGTC ACATTCTATT TTGGGTATTA ATATATJW3TC CAGACACTTA ACTC7ATTTC 5220 

10 TTGGTATTAT TCTGTTTTGC ACAGTTAGTT GTGAAAGAAA GCTGAGAAGA ATGAAAATGC 5280 

AGTCCTGAGG AGAGTTTTCT CCATATCAAA ACGAGGGCTG ATGGAGGAAA AAGGTCAATA 5340 

AGGTCAAGGG AAGACCCCGT CTCTATACCA ACCAAACCAA TTCACCAACA CAGTTGGGAC 54 00 

CCAAAACACA GGAAGTCAGT CACGTTTCCT TTTCATTTAA TG6GGATTCC ACTATCTCAC 5460 

ACTAATCTGA AAGGATGTGG AAGAGCATTA GCTGGCGCAT ATTAAGCACT TTAAGCTCCT 5520 

15 TGAGTAAAAA GGTGGTATGT AATTTATGCA AGGTATTTCT CCAGTTGGGA CTCAGGATAT 5580 

TAGTTAATGA GCCATCACTA GAAGAAAAGC CCATTTTCAA CTGCTTTGAA ACTTGCCTGG 564 0 

6GTCTGAGCA TGATGGGAAT AGGGAGACAG GGTAGGAAAG GGCGCCTACT CTTCAGGGTC 5700 

TAAAGATCAA GTGGGCCTTG GATCGCTAAG CTGGCTCTGT TTGATGCTAT TTATGCAAGT 5760 

TAGGGTCTAT GTATTTAGGA TGCGCCTACT CTTCAGGGTC TAAAGATCAA GTGGGCCTTG 5820 

20 GATCGCTAAG CTGGCTCTGT TTGATGCTAT TTATGCAAGT TAGGGTCTAT GTATTTAGGA 5880 

TGTCTGCACC TTCTGCAGCC AGTCAGAAGC TGGAGAGGCA ACAGTGGATT GCTGCTTCTT 594 0 

GGGGAGAAGA GTATGCTTCC TTTTATCCAT GTAATTTAAC TGTAGAACCT GAGCTCTAAG 6000 

TAACCGAAGA ATGTATGCCT CTGTTCTTAT GTGCCACATC CTTGTTTAAA GGCTCTCTGT 6060 

ATGAAGAGAT GGGACCGTCA TCAGCACATT CCCTAGTGAG CCTACTGGCT CCTGGCAGCG 6120 

25 GCTTTTGTGG AAGACTCACT AGCCAGAAGA GAGGAGTGGG ACAGTCCTCT CCACCAAGAT 6180 

CTAAATCCAA ACAAAAGCAG GCTAGAGCCA GAAGAGAGGA CAAATCTTTG TTGTTCCTCT 6240 

TCTTTACACA TACGCAAACC ACCTGTGACA GCTGGCAATT TTATAAATCA GGTAACTGGA 6300 

AGGAGGTTAA ACTCAGAAAA AAGAAGACCT CAGTCAATTC TCTACTTTTT TTTTTTTTTT 6360 

TCCAAATCAG ATAATAGCCC AGCAAATAGT GATAACT^T AAAACCTTAG CTGTTCATGT 6420 

30 CTTGATTTCA ATAATTAATT CTTAATCATT AAGAGACCAT AATAAATACT CCTTTTCAAG 6480 

AGAAAAGCAA AACCATTAGA ATTGTTACTC A6CTCCTTCA AACTCAGGTT TGTAGCATAC 6540 

ATGAGTCCAT CCATCA6TCA AAGAATGGTT CCATCT6GAG TCTTAATGTA GAAAGAAAAA 6600 

TGGAGACTTG T71ATAATGAG CTAGTTACAA AGTGCTTGTT CATTAAAATA GCACTGAAAA 6660 

TTGAAACATG AATTAACTGA TAATATTCCA ATCATTTGCC ATTTATGACA AAAATGGTTG 6720 

35 GCACTAACAA AGAACGAGCA CTTCCTTTCA GAGTTTCTGA GATAATGTAC GTGGAACAGT 6780 

CTGGGTGGAA TGGGGCTGAA ACCATGTGCA AGTCTGTGTC TTGTCAGTCC AAGAAGTGAC 684 0 

ACCGAGATGT TAATTTTAGG GACCCGTGCC TTGTTTCCTA GCCCACAAGA ATGCAAACAT 6900 

CAAACAGATA CTCGCTAGCC TCATTTAAAT TGATTAAAGG AGGAGTGCAT CTTTGGCCGA 6960 

CAGTGGTGTA ACTGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGGGTGTG 7020 

40 GGTGTATGTG TGTTTTGTGC ATAACTATTT AAGGAAACTG GAATTTTAAA GTTACTTTTA 7080 

TACAAACCAA GAATATATGC TACAGATATA AGACAGACAT GGTTTGGTCC TATATTTCTA 7140 

GTCATGATGA ATGTATTTTG TATACCATCT TCATATAATA TACTTAAAAA TATTTCTTAA 7200 

TTGGGATTTG TAATCGTACC AACTTAATTG ATAAACTTGG CAACTGCTTT TATGTTCTGT 7260 

CTCCTTCCAT AAATTTTTCA AAATACTAAT TCAACAAAGA AAAAGCTCTT TTTTTTCCTA 7320 

45 AAATAAACTC AAATTTATCC TTGTTTAGAG CAGAGAAAAA TTAAGAAAAA CTTTGAAATG 7380 

GTCTCAAAAA ATTGCTAAAT ATTTTCAATG GT^AACTAAA TGTTAGTTTA GCTGATT6TA 7440 

TGGGGTTTTC GAACCTTTCA CTTTTTGTTT GTTTTACCTA TTTCACAACT GT6TAAATTG 7500 

CCAATAATTC CTGTCCATGA AAATGCAAAT TATCCAGTGT AGATATATTT GACCATCACC 7560 

CTATGGATAT TGGCTAGTTT TGCCTTTATT AAGCAAATTC ATTTCAGCCT 6AATGTCTGC 7620 

50 CTATATATTC TCTGCTCTTT GTATTCTCCT TTGAACCCGT TAAAACATCC TGTGGCACTC 

Seq ID NO: 215 Protein sequence; 
Protein Accession #: NP_002010.1 

55 

1 11 21 31 41 51 

I I I I I I 

MVSYWDTGVL liCTVLLSCIiLL TGSSSGSKLK DPELSLKGTQ HIMQAGQTLH liQCRGEAAHK 60 

WSLPEMVSKE SERLSITKSA CGRNGKQFCS TLTIiNTAQAN HTGFYSCKYL AVPTSKKKET 120 

60 ESAIYIFISD TGRPFVEMYS EIPEIIHMTE GRELVIPCRV TSPNITVTIiK KFPLDTIjIPD 180 

GKRIIWDSRK GFIISNATYK EIGLLTCEAT VNGHIjYKTNY LTHRQTNTII DVQISTPRPV 24 0 

KLLRGHTLVL NCTATTPLNT RVQMTWSYPD EKNKRASVRR RIDQSNSHAN IFYSVIiTIDK 300 

MQNKDKGLYT CRVRSGPSPK SVNTSVHIYD KAFITVKHRK QQVLETVAGK RSYRIiSMKVK 360 

AFPSPEWWL KDGLPATEKS ARYLTRGYSL IIKDVTEEDA GNYTILIiSIK QSNVFKNLTA 420 

65 TLIVNVKPQI YEKAVSSFPD PALYPLGSRQ ILTCTAYGIP QPTIKWFWHP CNHNHSEARC 4 80 

DFCSNNEESF ILDADSNMGN RIESITQRMA IIEGKNKMAS TLWADSRIS GIYICIASNK 540 

VGTVGRNISF YITDVPNGFH VNLEKMPTEG EDIjKIjSCTVN KFliYRDVTWI IiliRTVNNRTM 600 

HYSISKQKMA ITKEHSITIiN LTIMNVSLQD SGTYACRARN VYTGEEILQK KEITIRDQEA 660 

PYLLRNIjSDH TVAISSSTTL DCHANGVPEP QITWFKNNHK IQQEP6IILG P6SSTLFIBR 720 

70 VTEEDEGVYH CKATNQKGSV ESSAYIiTVQG TSDKSNIiELX TIiTCTCVAAT IjFWIiLIiTLLI 780 

RKMKRSSSBZ KTDYIiSIIMD PDEVPU3EQC ERLPYDASKW EFARERLKLG KSL6RGAF6K 840 

WQASAFGIK KSPTCRTVAV KMLKE6ATAS EYKALMTEIiK ZIiTHIGHHIiN WNLIiGACTK 900 

QGGPLMVIVE YCKYGNLSNY IiKSKRDX.FF]j NKDAALHMEP KKEKMEPGLB QGKKPRLDSV 960 

TSSESFT^SG FQBDKSLSDV EBEEDSDGFY KKPXTMEDLX SYSFQVARGM EFIjSSRKCXH 1020 

75 RDLAARNIIJj SENNWKICD FGLARDXYKN PDYVRKGDTR LPUCWMAPES XFDKXYSTKS 1080 

DVWSYGVLLW EXFSLGGSPY P6VQMDEDFC SRLREGMRMR APEYSTPEIY QIMLDCWHRD 1140 



281 



BNSDOCID: <WO^_^02079492A2_I_> 



wo 02/079492 



PCT/US02/04915 



PKERPRFAEL VEKLGDLLQA NVQQDGKDYI PINAILTGNS GFTYSTPAFS EDFFKESISA 1200 

PKFNSGSSDD VRYVNAFKFM SLERIKTFEE LLPNATSMFD DYQGDSSTLL ASPMLKRFTW 1260 

TDSKPKASLK IDDRVTSKSK ESGLSDVSRP SFCHSSCGHV SEGKRRFTYD HAELERKIAC 1320 
CSPPPDYNSV VLYSTPPI 



Seq ID NO: 216 DMA sequence 

Nucleic Acid Accession #: NM__024689 

Ccxiing sequence: 76-624 (underlined sequences coxxespond to start and stop codons) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



CTCTTTG6CC 
AGCCCAGAGC 
TGGCTGGCCC 
TCCCTTTCTT 
GGTCTTGATA 
GAAATAAGAT 
TCTTATCCTG 
CTGGTCAGCA 
CCAAAGACCT 
CAGGCCAAGC 
AAGTTCCTGT 
AGAACTTCCA 
AGTAATCATT 
AGAATTCACC 
GACAACTCCA 
TGAGCCCACC 
CTGGACAGGT 
TTACCAAAG6 
CCCTAGTGAT 
CTTTTACTTT 
TTTATGTTTC 
TTTTTTGAGC 
G6G6AAGCTA 
TAGATTTCGG 
GTGTATCAGG 
AGATTTATTT 
TGGTAGGCCT 
ATTCCTTGTT 
GCCCACCCAG 
TCACATCTAT 
GAATACAGCC 
TTTTATCGAC 
AGCATCACCC 
CAATAGAGAC 
AGGAT6AGGA 
TTTTTATTTT 
AAGTGATTTA 
AT6AAACCTT 
TAATTGTGGG 
TGTAA6TGGG 
GTCTCTTAAG 
TAGCAGTTAT 



11 
I 

AAGCCCTGCC 
CCAAGATGGA 

tgctgctgtg 
ctctggtgcc 
aatgcaatgc 
ctgacaactg 
caaattactc 
aatatca;^ 
gcagcattga 

GCCTCACGCC 
GTATGCTGAG 
G7VAAGTGTTA 
AAAGAGGCTT 
AACACACAGG 
AAGCCCCGGC 
CCAATCCAGA. 
CTTCCCTATG 
CCCACATAAC 
GGATGAACTC 
TTAAGTACCT 
CATTCTGGTA 
AAACACTCGG 
CTTCTCCTCT 
TCTTCATTGC 
CAGGGTTCTA 
CAAAGAATTG 
GCAATCTGTA 
CCTTAAAAAA 
ATTACCTAGA 
GAAATGCCTT 
TAGCCAAGTT 
CGTCTTCAGA 
TGAACCAAAG 
ATTGACTGGT 
AACCAGGCAC 
TCACTGGGAG 
GAAACTCCAA 
ATTTTATTGG 
TTTGCACATG 
ACCTTGGGGA 
CCTGTTCCTG 
CTATTGTTGT 



21 
I 

TCTGTACAGC 
GCCCCAGCTG 
GGTCTCAGCC 
CCAAGTCAGA 
CTGCATCGGG 
GCTGGCTTCC 
AGATGATTCC 
CGAGATCTCA 
GCGTGTCCTG 
GGACCTGGTG 
ATAA CACCAG 
GCCTTCTCCC 
CTGCATCAAA 
CCCACCAGCA 
TCTTTCCACC 
TGTGATCCCC 
AGATAGAACC 
TTCTAAATTT 
TCTTATCTCT 
CCATCAGAGT 
AGAACTCTTT 
G6GTATGGAT 
ATTCAGATTT 
TGTCCATTTT 
CCAGAGA7AC 
ATTTACATGA 
AACCTTTGGG 
ATCTGTTTTT 
TAATCTCTTT 
CACAGCAACA 
GACACATAAA 
CTGTTAAGGA 
GCCCCTATCA 
TGGCTGGCTT 
GGGAGAGGGA 
GTGGTAAGTT 
AGCAATTGGT 
AAATG6TT6G 
GCCA6CACAT 
OGAGCTGCCT 
CTACA6TTAT 
GTATTAAACC 



31 
I 

CTCGAGTGGA 
G6GCCTGAGG 
CTGAGCTGTT 
ACCAGCTACA 
ACATCTATTT 
CACCTTGGAC 
AAAATCTGGC 
GACAGGAAAA 
CGGAAAACAG 
CAGGACTGTC 
TGAAAAAGCC 
AACTGTGTTA 
CCTTCACATG 
ACAGGCTACC 
ACACTGTGGT 
CTGTGATCTA 
TGATAAGGAG 
TGGTCTGGTC 
GGCTTCTAGA 
CATGAAATCA 
AAAT6AGGAC 
GAAAGCCAAT 
CACTAAAATC 
TGTAATGAAC 
AGAACCAGTA 
TTGTGGGGAT 
CAGGAGCTGA 
GTTCTTAAGG 
TACTTAAAGT 
CCTAGATTAG 
ATTAACCATC 
TTGTGGTAGA 
AGTAACAATA 
CCCAAGGGAT 
GGGGCAACAG 
AGCCCTGTTG 
AATCCCCAAA 
TTTCCCAATT 
6CCAAACAGA 
CCATCATAAA 
A6AGGTTGCT 
ATTTCAACAC 



41 
I 

CAGCCA6AGG 
CTGCCGCCCT 
CTTTCTCCTT 
ATTTTGGAAG 
GCAAG7AGTT 
TGCCTCCCGA 
GCCCTGTGGA 
TCTGTGCCTC 
AGAGGTTCCA 
ACCAGGGCCA 
TGGCATGGAG 
TACCAACCAC 
CAGCTCCCAT 
TTTGCACAAT 
CCCCTAGATG 
CTTCTGGCAA 
CTAGGGCAAT 
TGAA66AAAA 
GGGAAAAAAA 
CCTGTCAAGA 
ACTGCTGATT 
CGCAGGTCAA 
TTCCAAGAT6 
GAGTGTTTTT 
GGAGATACAT 
TGGCAAGTCG 
TGCTGTAGTT 
GCTTTGAATG 
AAACTGATTG 
CATTCAATTG 
ACAGCAACAT 
GAACTGTGAC 
TAGCCAAGCA 
AGCACCAGAC 
AGGTCCAGGG 
CCCATGTATG 
ATGGGT6TAT 
CTGTTTGCCA 
AGTAGACAAA 
GGGAGGGGTT 
CAGAACCTTC 
AT 



51 
I 

CTGCAGCTGG 
CCGCCCTGGC 
GCCAGCTTCT 
GACTTTCCTC 
CTTTAAAGAA 
TTCCTTGCTT 
GATCTTTAGA 
TGCATCAGCC 
GAAATGGCTG 
GAGAGAACTA 
CCCAGCACTG 
ATTTTCAAAT 
GCCACCCTCC 
ATTCTCTGAT 
GGGCTGTTGC 
GATTCTCAGT 
TCTGACAACA 
CCTGTTCTCG 
AAGCATACCT 
CTATCTATCT 
GCT6GTGATG 
ATGACTCCTT 
AAAGCAAATC 
CCTTTAGCTA 
ATACATGTCC 
AAAATCCATA 
TGCAGATAGA 
ATTGGATCAG 
TAGGTGCTAA 
AATAACTGGG 
GCCTGCTAAA 
AGCCACTCTC 
AAATTCCAGT 
AAGAZVATGCA 
TTTGGTTATC 
CAGATGGGAG 
CTGGTTTGAA 
TTGGCCAATA 
GGTCTGACTC 
AGTAAAAATG 
TCAGCAAATA 



60 
120 
180 
240 

300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



Seq ID NO: 217 Protein sequence; 
Protein Accession #: NP 078965.1 



60 



65 



70 



75 



1 11 21 31 41 51 

I I I I I I 

MEPQLGPEAA AIiRPGWLAIjL LWVSALSCSF SIiPASSIiSSL VPQVRTSYNF GRTFIiGLDKC 
NACIGTSICK KFFKEEIRSD NWIASHLGLP PDSLLSYPAN YSDDSKIWRP VEIFRLVSKY 
QNEISDRKIC ASASAPKTCS lERVLRKTER FQKWLQAKRL TPDLVQDCHQ GQRELKFLCM 



60 
120 
180 



Seq ID NO: 218 PNA sequence 

Nucleic Acid Accession #: AF075027.1 

Coding sequence: 3-269 (underlined sequences correspond to start and stop codons) 



GA TTAAT TAA 
TG6GATGGCT 
CTTTCTGTGC 
CTTTCTCCCT 
CCATAGATCT 
6ACAAAA6TG 



11 
I 

GTGCTTTAAA 
GTA6CTTGAG 
CAAG6T6CTG 
GGAGTAAGGC 
CAT6CATCAT 
GTGGGTTTTC 



21 



31 
1 



CGGTCTTGGT AAATATTCCG 
TTGAATTTTA ACTGTCCTCA 
TGTTAGGGGA 6AGAGTGACT 
CGAAGACTGG ATTACTACAC 
TAATGCCA TA TGA CATTGCC 
ATTGTCTTCA CTGATT6TCA 



41 
I 

CGGGAGCTGG 
TTCTGGGTTT 
GGAAAGTAAC 
GCCTA6AC6T 
ATTTTCTTTC 
ATGCATTAAT 



51 
I 

GGA66ACCGT 60 

T6TCGCTCTG 120 

AAAGCTGAAT ISO 

6 ACACTACAC 240 

TCAGTTCACG 300 

AAAQAAGAT6 360 



282 



3NSDOCID. <WO 02079492A2.L> 



wo 02/079492 



PCT/US02/04915 



TGTGGT 



Seq ID NO: 219 Protein sequence; 
Protein Accession #: AF075027 



10 



15 



20 



25 



30 



35 



40 



1 11 21 31 41. 51 

I I I I I I 

BRKWQCHMAIj mmhbiygcsv tsrrwiqss allqgerpsf vtfqslspvt qhlgtesrat 

KPRMRTVKIQ LKLQPSQRSS PAPAEYLPRP FKALN 



60 



Seq ID NO: 220 DNA sequence 

Nucleic Acid Accession #: AIj133411.8 

Coding sequence: 1-1395 (underlined sequences correspond bo start and stop codons) 



M!GGGCAAGG 
AAATGGGATC 
AACAGGCAAC 
ACATCCAGAA" 
AAAAAGGACA 
AACCAAACAC 
CAGG6AGGGG 
CTTGAACATG 
TGGGCCCAGG 
ATCTGGGTGG 
CTACCTTTAA 
TATGCTCCTG 
ACAACACAAA 
AATTTTGCTC 
GAAGAAGAAA 
GTGCCTGCAT 
GATAAAGATC 
AATCAGCCAG 
CTCCTCCTCT 
CTGAGTTATA 
TACTTTCATC 
ACATTTTTGG 
AAGATAATGA 
TCGGTTACCC 



11 
I 

ACTTCATGAC 
TAATCAAATT 
CTACAGAATG 
TCTATAAGGA 
TGGATGAAGC 
CACATGTTCT 
AACATCACAC 
CTGGGAGGAA 
AACAATCCAT 
CACAAACTCC 
ATGAAGATGT 
CTAAT6A6AA 
ATCCAAATGG 
TAAAAAAC6A 
CAACTACTAG 
TTTGGACAAT 
AATTATTTCA 
ATCTAGAGGA 
TTGTGGTCCT 
AAAGTTGTGA 
CATCAGAAGG 
GTACCACTTC 
CGGATATCAT 
GGTGA 



21 
I 

TAAAACACTA 
AAAGAGCTTC 
GGAGAAAAAT 
ACTTAAACAA 
TGGAAACCGT 
CACTCATAAG 
ACTGGGGCCT 
ATTACAATTC. 
AATACAAAAG 
TGGTGAATCT 
TAATAA6CA6 
AAATGGCAAT 
CACTGAGTCT 
TAAAACTGTC 
CGAACCCTCT 
GTTAGCTAAA 
CCCAATTCCA 
TCTGAAGATC 
CTTGGCATTC 
GAGTCAGTAC 
TGTTTCAGAT 
TTCAGATATG 
TTCCATAGGC 



31 
I 

AAAGCAATGG 
CGCACAGCAA 
TTTGCAATGT 
TTTTACAAGA 
CATTCTCAGA 
TGGGAGTTGA 
GTCAGAAGCC 
ATCCATGGGC 
AAATATGCAT 
ATCTCCA6TT 
GAAGAAAAGA 
TATTATAAA6 
GAAATATCTG 
AATGCAACTA 
CATAAAAATA 
GCTATAAAT6 
GAGTCTGATG 
AAAATAATGC 
TGTAGTGCTA 
TCTGTCAACC 
ACATCCTTTT 
AGAAGATCAG 
T.CAGATAATG 



41 

I 

CAACAAAAGC 
AAGAAACTAT 
ATCCATCTGA 
AAAAACCAAA 
AAACTAACAC 
ACAATGAGAA 
CCTCTGGCCT 
TGTTTACCCT 
TATGGATTGG 
CACCAGCATT 
ATGAAGATCA 
ATATAAAACA 
TGAGA6CCAC 
CATAT6AAAA 
TTCAAAGATC 
GAACAGCAGT 
T6AAT6CTAC 
TGGGAATCTC 
CACTGTACAA 
CAGAGCTGGC 
CCAAGAGTGC 
GCACAAGAAC 
AGATGCATGA 



51 

I 

CAAAATTGAC 
TATCAGAGTG 
CAAAGGGCTG 
CAACGCCATC 
AGGAACAGAA 
CACATGGACA 
CCTGGCTGGC 
TGAAAATGAA 
AACCAAGCAG 
GCCTAATGTG 
TACTCCCAAT 
ATATGTGTTC 
AACTGACCTG 
ATCCACCATT 
AACCCCAAAC 
GGTCAT6GAT 
ACAGGGAGAA 
GTTGATGACC 
ACTGAGGCAT 
CACGATGTCT 
AGAGAGCAGC 
ATCAGAATCT 
AAACGATGA6 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



45 



Seq ID NO: 221 Protein sequence; 
Protein Accession #: AItl33411.8 



1 11 21 31 41 51 

I I I I I I 

MGKDFMTKTIj KAMATKAKID KWDLIKIiKSF RTAKETIIRV NRQPTEWEKN FAMyPSDKGIi 60 

50 TSRIYKBIiKQ FYKKKPNNAI KKDMDEAGNR HSQKTNTGTB NQTPHVIiTHK WELNNENTWT 120 

QG0EHHTL6P VRSPS6LLA6 LEHAGRKIiQF IH6LFTLENE WAQEQSIIQK KYAItWIGTKQ 180 

JWVAQTPOES ISSSPALPNV LPIiNBDVNKQ BEKNEDHTPN YAPANEKMGN YYKDIKQYVF 240 

TTQNPN6TSS EISVRATTDIj NFALKNDKTV NATTYBKSTI EEBTTTSBFS HKNIQRSTPN 300 

VPAFWTMliAK AIN6TAWMD DKDQLFHPIP BSDVNATQGE NQPDLEDIiKI KIMLGISLMT 360 

55 LLIjFWLLAF CSATLYKIjRH LSYKSCESQY SVNPBZiATMS YFHPSBGVSD TSFSKSAESS 420 
TFLGTTSSDM RRSGTRTSES KIMTDIISXG SDKEMHENDE SVTR 

Seq ID NO: 222 DNA sequence 
Nucleic Acid Accession #: aIj050295.1 
60 Coding sequence: 237 >- 2 073 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

65 GAAGGGGACA GAAGGCAGTT CACCTCTGCT CCCGACAGCC TGGGAACCCG CAAGAGCCCC 60 

AGCATTTGAA GTCTGGTCTT GTGAAACCCC ACCCTCCTCT GGCTGTGTGA TTGAATGGGA 120 

TGCCCTCGAG GTACACCTCA CCTGAGAG6G TTTTGGGCAG ATCAGCAGTA AGGT6TTAAA 180 

TTTTAGAAGC CTGAAAACTC CAGAAGA6AA AGGCCAACCA ACTCAAACTT GAAGA CATGA 240 

AATCCCCAAG GAGAACCACT TTGTGCCTCA TGTTTATTGT GATTTATTCT TCCAAAGCTG 300 

70 CACTGAACTG GTUITTACGAG TCTACTATTC ATCCTTTGAG TCTTCATGAA CATGAACCAG 360 

CTGGTGAAGA GGCACTGAGG CAAAAACGAG CC6TTGCCAC AAAAAGTCCT ACGGCTGAAG 420 

AATACACTGT TAATATTGAG ATCAGTTTTG AAAATGCATC CTTCCTGGAT CCTATCAAAG 480 

CCTACTTGAA CAGCCTCAGT TTTCCAATTC ATGGGAATAA CACTGACCAA ATTACTGACA 540 

TTTTGA6CAT AAATGTGACA ACAGTCTGCA GACCTGCTGG AAATGAAATC TGGTGCTCCT 600 

75 GCGAGACA6G TTATGGGTGG CCTCGGGAAA GGTGTCTTCA CAATCTCATT TGTCAAGAGC 660 

GTGAGGTCTT CCTCCCAGGG CACCATTGCA GTTGCCTTAA AGAACTGCCT CCCAATGGAC 720 



283 



BNSDOCID: <WO ^02079492A2_I_> 



wo 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



CTTTTTGCCT 
TTCAAGAAGA 
AAACAGCGTT 
GGTTCAAGTC 
AGTTAATACA 
ACTACAACTC 
TCATCTTTGA 
TGTCTTGGCG 
ACACCGCACT 
CAGGTGATGC 
AGAA6AAAAT 
GCGACAACAA 
TAGAATGGAA 
CTAGCTGCAG 
6AACAACA6T 
ACATAAAAGT 
TTTCTGAGGG 
TTTATTGGAA 
ATCTTGATGG 
CCTATCACTG 
TTCACCCGCT 
GCAGTGGTTC 
TCCATATGGG 



GCTTCAGGAA 
CCTCATGAAC 
CCGGAAGGGT 
TGGAAGTGTG 
TAAAGCCAAT 
CTTTCAAGCA 
AGGGGACACA 
CTATGAAGAA 
TTTCAACAAC 
A66TGAATAT 
A6ATGTTATG 
TCCTGTATCT 
GCAGGAAGGA 
CAGATACACC 
CATCTACACT 
GACATTCATC 
ACAAAACTTT 
CACTTCTGCT 
AGCAGAATCA 
CATATTTAGA 
GCCTCTAAAG 
CCATCACATC 
TTCCTCATCC 



GATGTTACCC 
ACTTCCTCCG 
TACGGAATTT 
GTTGTGACAT 
GAACAAGTTG 
GTTACTATCA 
GTCAGTCTGG 
CAGCAGTTG6 
ATGACTTCGG 
GTTTGCAAAC 
CCCATCCAAA 
TTGAACTGCT 
AAAATAAATA 
CTCAAGGCTG 
TGTGA6TTCA 
TCTGTG6CCA 
TCTATAAAAT 
GGAATTAAAA 
GTACTGACAG 
TATAAGAATT 
CTGAACATCA 
AAGTGCTGCA 
CTTCCTGCTG 



TGAACATGAG 
CCCTCTATAG 
TACCAGGCTT 
ATGAAGTCAA 
TACAGAGCCT 
ATGAAAGCAA 
TGTGTGAAAA 
AAATCCA6AA 
TGTCCAAGCT 
TGATATTAGA 
TTTTGGCAAA 
GCAGTCAGGG 
TTCCAG6AAC 
ATGGAACCCA 
TCAGTGCCTA 
ATCTAACAAT 
GCATCAGTGA 
TATACCAAAG 
TCAAGACCTC 
CATACAGTAT 
TGATTGATCC 
TAGAGGAGGA 
TAAAAA7U\AA 



AGTCAGACTA 
GTCCTACAAG 
CAAGGGCGTG 
GACTACACCA 
CAATCAGACC 
TTTCTTTGTC 
GGAAGTTTTG 
CAGCA6CAGA 
CACCATCCAC 
CATTTTTGAA 
T6AAGAAAT6 
TAATGTTAAT 
CCCTGAGACA 
GTGCCCZU^GC 
TGGAGCCAGA 
AACCCCX^GAC 
TGTGAGTAAC 
ATTTTATACC 
GACCAGGGAG 
TGCAACCAAA 
TTTGGAAGCT 
TGGAGACTAC 
AAAAAAAAAA 



AATGTAGGCT 
ACCGACTTGG 
ACTGTGACAG 
CCATCACTTG 
TACAAAATGG 
ACACCAGAAA 
TCCTCCAATG 
TTCTCGATTT 
AACATCACTC 
TATGAGTGCA 
AAG6TGATGT 
T6GA6CAAAG 
GACATAGATT 
G6GTCGTCTG 
G6CAGTGCAA 
CCAATTTCTG 
TATGATGA6G 
ACGAGGAGGT 
TGGAATGGAA 
GACGTCATTG 
ACTGTTTCAT 
AAAGTTACTT 
A 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



Seq ID NO; 223 Protein sequence; 
Protein Accession #: CAB43394.1 



30 



35 



40 



45 



MKSPRRTTIiC 
EEYTVNIEIS 
SCETGYGWPR 
GFQEDLMNTS 
LELIHKANEQ 
NVSWRYEEQQ 
CKKKIDVMPI 
DSSCSRYTIjK 
SVSEGQNFSI 
GTYHCIFRYK 
TFHM6SSSLP 



11 
I 

liMFIVIYSSK 
FENASFIiDPI 
ERCUQ7LICQ 
SALYRSYKTD 
WQSIiNQTYK 
IiEIQNSSRFS 
QlIiANEENKV 
ADGTQCPSGS 
KCISDVSNYD 
NSYSIATKDV 
AVKKKKKK 



21 
I 

AALNWNYEST 
KAYLNSI.SFP 
BRDVFIiPGHH 
IiETAFRKGYG 
MDYNSFQAVT 
lYTAIiPNNMT 
MCDMNPVSIiN 
SGTTVIYTCE 
EVYWNTSAGI 
IVHPLPIiKIiN 



31 
I 

IHPIiSIiHBHB 
IH6NNTDQIT 
CSCLKELPPN 
IIiPGFKGVTV 
INESNFFVTP 
SVSKIiTIHHI 
CCSQGNVNWS 
FISAYGARGS 
KIYQRFYTTR 
IMIDPLEATV 



41 

I 

PAGEEALRQK 
DIIiSlNVTTV 
GPFCLIiQEDV 
TGFKSGSVW 
BIIFEGDTVS 
TPGDAGEYVC 
KVEWKQEGKI 
ANIKVTPISV 
RYIiDGAESVIi 
SCS6SHHIKC 



51 
I 

RAVATKSPTA 
CRPAGNEIWC 
TZiNMRVRIjNV 
TYEVKTTPPS 
LVCEKEVIiSS 
KLXLDIFEYB 
NZPGTPETDI 
ANLTITPDPI 
TVKTSTREWN 
CZEED6DYKV 



Seq ID NO: 224 DNA sequence 

Nucleic Acid Accession #: NM_007268 

Coding sequence: 46-1245 (underlined sequences correspond to 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



start cuid stop codons) 



11 



21 



31 



41 



51 



50 



55 



60 



65 



70 



75 



GGTAGCAGGA 
GGCCTGCTAC 
CCAGAGAGTG 
CTGCAAGGCT 
ACCATCTTTC 
CTGCATGTGA 
GATGACC6GA 
GTGAGA6ATA 
ACAACTGGCA 
CAGGCTCGGG 
GAACCCATCA 
TCAGGCTCCT 
GTGAAGTTTG 
ACCATGACAT 
GACATGGATG 
GCCATCATCC 
CTCTGTCGGA 
GAGGCCAACG 
GATGAGCCAA 
GAGTACCAGA 
CTGGATTATG 
CCAGGATCTG 
GCTACCTCTC 
TCACTGGCTT 
GCTCTGGGCC 
GGG/U^TGC 
GCATCTTGCC 
TATCCAGGAT 
ACAGGCQAGG 



GGCTGGAAGA 
TCCTGG6GCA 
TAACAGGACC 
ACACCCAAGT 
TACGTGACTC 
GCCACAAGGT 
GCCACTACAC 
AGATTACTGA 
GCGGTTATGG 
GTTCTCCTCC 
AAGTAGCAAC 
ATTTCTGCAC 
TGGTCAAAGA 
ACCCCTTGAA 
GCTACCTTGG 
TCATCATCTC 
AGACATCCCA 
ACTCTGGAGA 
CTTCCCAGAA 
TCATCGCCCA 
AGTTTCTGGC 
CTGACa^TAAT 
TTCCTGGATA 
TGCCCTGGAA 
CTTCTAGTAT 
CCATAGCACT 
ACCAGAAGAC 
CATTTCTCTT 
GTTCAGTTCT 



AAGGACAGAA 
CCTAACAGTG 
TTGGAAAGGG 
CTTGGTGAAG 
TTCTGGAGAC 
TCCAGGAGAT 
GTGTGAAGTC 
GCTCCGTGTC 
CTTCACGGTG 
CATCAGTTAT 
CCTAAGTACC 
TGCCAAGGGC 
CTCCTCAAAG 
AGCAACATCT 
AGAGACCAGT 
CTTGTGCTGT 
ACAAGAGCAT 
AACCATGAGG 
TCTGGGCAAC 
GATCAATGGC 
CACTGAGGGC 
TGCCTAGTCA 
GCCCAAAGTG 
TTTGCCAGAT 
CTCTGCCGGG 
AGGACTTGGT 
CCGAGGGAGG 
TCTTCAGGGC 
GCTCCTCCAC 



GTAGCTCTGG 
GACACTTATG 
GATGTGAATC 
TGGCTGGTAC 
CATATCCAGC 
GTATCCCTCC 
ACCTGGCAGA 
CAGAAAOTCT 
CCCCAGGGAA 
ATTTGGTATA 
TTACTCTTCA 
CAGGTTGGCT 
CTACTCAAGA 
ACAGTGAAGC 
GCTGGGCCAG 
AT6GTGGTTT 
GTCTACGAAG 
GTGGCCATCT 
AACTACTCTG 
AACTACGCCC 
AAAAGTGTCT 
GTCCTTGCCT 
TCCGCCTACC 
GCATCTCAAG 
GGCTTCTGGT 
CATCATGCCT 
CTCAGCTCTG 
CAGACAGCTT 
TATAAGTCTA 



CTGTGATGGG 
GCCGTCCCAT 
TTCCCTGCAC 
AACGTGGCTC 
A6GCAAAGTA 
AATTGAGCAC 
CTCCTGATGG 
CTGTCTCCAA 
TGAGGATTAG 
AGCAACAGAC 
AGCCTGCGGT 
CTGAGCAGCA 
CCAAGACTGA 
AGTCCTGGGA 
GAAAGAGCCT 
TTACCATGGC 
CAGCCAGGGC 
TCGCAAGTGG 
ATGAGCCCTG 
GCCTGCTGGA 
GTTAAAAATG 
TCTGCATGGC 
AACACTGGAG 
TAAGCCAGCT 
ACTCCTCTCT 
ACAGACACTA 
CCAGCTCAGA 
TTAATTGAAA 
ATGTTCTGAC 



GATCTTACTG 
CCTGGAAGTG 
CTATGACCCC 
AQACCCTGTC 
CCAGGGCCX3C 
CCTGGAGATG 
CAACCAAGTC 
GCCCACAGTG 
CCTTCAATGC 
TAATAACCA6 
GATAGCCGAC 
CAGCGACATT 
GGCACCTACA 
CTGGACCACT 
GCCTGTCTTT 
CTATATCATG 
ACATGCCAGA 
CTGCTCCAGT 
CATAGGACAG 
CACAGTTCCT 
CCCCATTAGG 
CTTCTTCCCT 
CCGCTGGGAG 
GCTGGATTTG 
AAATACCAGA 
TTCAACTTTG 
GGACCAGCTA 
TTGTTATTTC 
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TGCTCMTAA ATATCTAATC ATAACAGCAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 225 Protein sequence; 
Protein Accession #: NP 009199.1 



10 



15 



1 
I 

MGILLGLIiLL 
GSDPVTIFLR 
DGNQWRDKI 
QTNNQEPIKV 
TEAPTTMTYP 
MAYIMLCRKT 
PCIGQEYQII 



11 
I 

GHLTVDTYGR 
DSSGDHIQQA 
TELRVQKLSV 
ATLSTLLFKP 
LKATSTVKQS 
SQQEHVYEAA 
AQXNGNYARXi 



21 
I 

PILEVPESVT 
KYQGRIiHVSH 
SKPTVTTGSG 
AVIADSGSYF 
WDWTTDMDGY 
RAHAREANDS 
IiDTVPLDYEP 



31 
I 

GPWKGDVNIiP 
KVPGDVSIiQIi 
YGFTVPQGMR 
CTAKGQVGSE 
LGETSAGPGK 
GETMRVAIFA 
liATBGKSVC 



41 . 
I 

CTYDPLQGYT 
STLEMDDRSH 
ISLQCQARGS 
QHSDIVKFW 
SLPVFAIIIiI 
S6CSSDBPTS 



51 
I 

QVLVKWLVQR 
YTCEVTWQTP 
PPISYIWYKQ 
KDSSKLLKTK 
ISLCCMWFT 
QNLGMNYSDE 
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Seq ID NO: 226 DNA sequence 

Nucleic Acid Accession #: XM_64321 

Coding sequence: 1-2079 (underlined sequences correspond to start and stop codons) 



1 
I 

ATGG TCGCCA 
CCAG6ACCAC 
6A6CCTG6TA 
G6GGTTGGGA 
ATCCGTCAGG 
CCATAC6AAG 
CTCACAAATG 
CATAATCAAT 
TTCCCCTCCA 
CCTGAAGTTA 
TCCCTTTGTG 
GAAAGCA6AA 
CGCAAGTCAC 
GTTCACACCA 
CAGCCAATAC 
GGTGATCTTG 
CGGAGGAGCC 
CTCAAGAGGT 
CCCACATTCA 
TCCCTGCTCC 
GGGGGGCGCC 
AGGCCAGGGA 
TCCTCTTTAG 
T6CTTAGAAG 
GGTCCAGACA 
CAGCACCAGA 
AATGAATATG 
TATTCCCCAA 
GTCTCCCTTT 
GTCGTAGCAG 
CAGAAGCAGC 
CGCAAGTTCC 
CCCCCGGCCG 
TGCTCCTGCT 
TTCATTCCAC 



11 
I 

GTTCCGATCA 
GCCTCCGCTC 
CTTACAGGGA 
TCACGGGAAA 
AGGATGCCTT 
CTACCTTGCA 
GCTACCTGCC 
ATCCTAATG6 
CAGCCACCAC 
AACTAAAAAT 
GAGACCTTTT 
AAGAAAAGAG 
ACAAAATCCC 
TATCAGAAAA 
TATCTTCTGT 
TGTGGTCCAA 
ATCACTGTTC 
CTTTCATGGT 
AGGGAACTGC 
TTGGTCATTG 
GCTGGGTGTG 
AGGAACCTCT 
CCACCTCTCA 
ACTATGCAG6 
6CAGGACTGG 
GATATCTCCT 
ATTGCAAGCT 
CCCACATCCT 
CCAAGTTCCT 
TACGTAGGAG 
CCTGTCCTGC 
ACGTGATGGC 
CGCCCCGCAA 
CCCAGGATGT 
ATCCCTGGGT 



21 

I 

AGACAGAGCC 
TGCCCA6AGG 
GGGTGGTGGA 
CACAGTTCAA 
TGATAACAAA 
GCAAA6CTTT 
ATCAATCAGC 
AAATTCTAAA 
TTCGGTACCA 
AACCAAAACT 
AAATGAA6TA 
GAAAAAACCC 
CAAATTAGAA 
ACCAAGGGAA 
TCCAACAACA 
GGTGACGGTC 
CAGCTGCCTG 
TTCTTCCTTG 
CCAGATGGGC 
GGAAGGAACA 
GCAGCATCAG 
GAGACTCAGT 
GGGCT6CTGG 
GCGCCGCCAT 
AAGCCTTAGA 
GCAAGGCCTC 
AGAGACGAGA 
TCAGTCTISAA 
CAAACGCAAA 
ATCCAATATG 
CAAGTACACG 
TCAGAAGAGG 
GGTGGCTGAC 
GTATCTGACT 
GCCCTTCGGC 



31 
I 

CCGTATCTTC 
CCAAAAGCAG 
GCCATCGTCC 
CAACCACCTC 
ATTGACATTG 
CAATACTCAC 
ATGTATGAAA 
CAGAAGACCA 
CAAACTGTGA 
ATCCAGAATG 
CAGGCAAGTG 
AAAAAGCATG 
CCAGAGGAAC 
GATCCAGTAC 
GAAGTGTCCA 
ACACCCTGTT 
GAGATCTTGG 
AAGTTCCTCA 
TGGTCACCTA 
GACCAGATGT 
AAGCCTCAGA 
TTCCTACGAT 
TGTTCGCCCC 
TTGACACTCA 
GCTGTCGGCA 
CTAGGTGGGT 
GAAGCG6CGT 
AGTGCCCCTA 
GCAAACAGCC 
CCTGGCACAA 
CCTGCCTGCC 
GGCCTGTCAG 
AGACGCCAGC 
GGAGTTTCTG 
TCCTC CTAG 



41 
I 

CAGGGACACT 
CCCAACAAGA 
TCACGTATGC 
AACTCACTGA 
CTGAAGATGG 
CTACAACAGA 
TTCAAACCAA 
CATTAAATTC 
TTCCAAAGAA 
GCAGGGAATT 
AGCACACGAA 
ACTCATCAAG 
AAAATAGACC 
TAAAAGAGGA 
CTGGTGTTAA 
GGGTGCCCCG 
TGCTGGTGCC 
CCTCCACGGG 
TGGCCTCCAC 
CATCCAGGGG 
TCCGCATCTC 
GTGAAGT6GA 
CAGACCACGT 
GAGCCCAGGA 
AGAGATACTG 
TCTTGGAAGA 
CCTCAACTCC 
ACCACTACTT 
ATTTCCTGCA 
GGGGGTGGGG 
ACGCACAATG 
GAAGATGTAG 
AGCTGCCGGG 
GATTA7AGGC 



51 
I 

A6ACAAGATG 
GCCCGGCATT 
GCTG6GGATC 
CTCCGCCAGC 
TG6CCAGACA 
TCTTCCTCCA 
ATACCAGTCG 
TAGAAAACCC 
GAGTGGCTCA 
GTTCAAGTCT 
GTCAAAGCAT 
ATCTGAAGAG 
AAATGAGAGG 
AGCCCCAGTT 
GTTTCAAGTT 
CCTGCGAGGA 
AGCCCTCAGC 
CAAACAGAAG 
GACCAATGTC 
CCCGGAATTT 
CATCTGCCAC 
GAGAA6AATC 
CTGTGAGAAA 
AGCCTTTCTT 
CAGGAACAGC 
AAG6AATGCC 
AAGAATCCCG 
TCCCTACCAC 
CCTGTGTGCA 
TGGCCACAAA 
GGAGACATTC 
GGGCCAGCAG 
GGCTCCGGGC 
CAGTCGTGGC 
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Seq ID NO: 227 Protein sequence; 
Protein Accession #: XP_064321.1 



MVASSDQDRA 
GVGITGNTVQ 
liTNGYLPSIS 
PEVKLKITKT 
RKSHKIPKLE 
GDLVWSKVTV 
PTFKGTAQMG 
RPGKEPLRLS 
GPDSRTGSIiR 
YSPTHIIiQSE 
QKQPCPAKYT 
CSCSQDVYLT 



11 
I 

PYLPGTLDKM 
QPPQLTDSAS 
MYEIQTKYQS 
IQNGRBLFKS 
PEEQNRPNER 
TPCWVPRLRG 
WSPMASTTNV 
FLRCEVERRI 
AVGKRYCRNS 
SAPNHYFPYH 
PACHAQWETF 
6VSGLKASRG 



21 
I 

PGPRLRSAQR 
IRQEDAFDNK 
HNQYPNGNSK 
SliCGDUiNEV 
VHTISEKPRB 
RRSHHCSSCL 
SLLLGHWEGT 
SSZiATSQGCW 
QHQRYLIjQGIi 
VSIiSKFLKRK 
RKFHVMAQKR 
PIPHPWVPFG 



31 

I 

PKAAQQEP6I 
IDIAEDGGQT 
QKTTUISRKP 
QASEHTKSKH 
DPVI.KBEAPV 
EIIiVLVPALS 
DQMSSRGPBF 
CSPPDHVCEK 
LGGFLEERNA 
ANSHFLHLCA 
GLSGRCRGQQ 
SS 



41 
I 

EP6TYREGGG 
PYEATIiQQSF 
FPSTATTSVP 



QPILSSVPTT 
LKRSFMVSSIj 
G6RRWVWQHQ 
CIiEDYAGRRH 
NEYDCKIiETR 
WAVRRRSNM 
PPAAPRKVAD 



51 
I 

AIVIjTYALGI 
QYSPTTDIjPP 
QTVIPKKSGS 
KXHDSSRSEE 
EVSTGVKPQV 
KFLTSTGKQK 
KPQIRISICH 
IjTLRAQEAFIi 
EAASSTPRIP 
PGTRGWGGHK 
RRQQLPGAPG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



285 



BNSDOCID: <WO 02079492A2_I_> 



wo 02/079492 



PCT/US02/04915 



Seq ID NO: 228 DNA gequence 

Nucleic Acid Accession #: NM_006033 

Coding sequence: 253-1752 (underlined sequences correspond to start and stop codons) 
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AGCAGC6A6T 
CCGTTGACAC 
CCTTCTCTGC 
GCTGGi\AACA 
TGGCG6GGCA 
TTTGCTGC6G 
AAACCCAAAG 
AAGGACCCAG 
TGCAGTTTCA 
ATCTTTGAAA 
AATGTAGTTG 
AATACCAGGG 
GATTTTTCTC 
TATGCAGGCA 
CCCATGTTTG 
GATGTCCTCC 
GGCCACATTG 
GTCTTGGGAT 
GCCGTCCACC 
TGCACTGACT 
AATA6CATTG 
AAAACCCGGG 
AGTTACAA6A 
GCAGATTCCC 
ACCTTCCTGG 
GAG6GGGCCT 
CCCCGCJ^CC 
CAGCGGAAAC 
GAGCTCTGGT 
GTGGAGCTTC 
TATCCAAGCC 
AGCCTTGACC 
GGAGGGGACT 
TCTGTCCACA 
GCCTCTCGTG 
6AGGAACGCT 
GAGCCTCAGT 
ATCTTGCTCG 
TCTTAGCCAT 
TATTTTTCTG 
ATGTGGTAAT 
TAGACCCTCT 
CT6GGG6A6G 
CTTCTTAGTT 
GTGTTTGATC 
GCCGGTTGCC 
AT6ACATATG 
TCTAAGATTT 
CTTTTCTTGT 
ACACAATATC 
GTAGGGGGAG 
TCAGCCACAT 
TTCCTGGAGG 
AATATGCCCC 
ATCCTAAGCC 
TTATGGCTGA 
TGTACTTAGG 
TTAAACTTCT 
GTGTGCTTCT 
TATCCCTTAT 
TTTGGAATGT 
AAAATTCATT 
TATTTTAAAT 
CATTTTTTGT 
TTTTTGGATT 
TCTGTTTTTA 



11 
I 

CCTTGCCTCC 
TCGCTCCCTG 
CTCCAGTCCC 
CCAAGAGGTG- 
GGATGAGCAA 
GGAGCCCCX3T 
CTACACAGAC 
AGCATGAAGG 
ACATGACAGC 
ACTGGCTGCA 
TGGTTGACTG 
TGGTGGGACA 
TCGGGAATGT 
ACTTCGTGAA 
AAGGGGCCGA 
ACACCTACAC 
ACATCTACCC 
CAATTGCATA 
TCTTTGTTGA 
CCAATCGCTT 
GCTACAATGC 
CAGGCATGCC 
ACATG6GAGA 
AGACTCTGCC 
TCTACACCGA 
CTCAGTCTTG 
CCGGACGGGA 
TGACATTTTG 
TTCGCAAGTG 
CCTGAGGGTG 
CATGGAGGAA 
CTGGAGCACT 
GCGCTGCTAT 
CCTCCAGAGC 
CACACTGGAT 
GGCTCCGAAG 
GAGAAGTCCT 
GGCCCTAGCT 
TCCGTCCTGC 
TTCATTTTTT 
G6ACATATTA 
TTCTGTTTGG 
CTATAGGATA 
ATTATGTGCC 
ACTAGCAGAA 
A6ATATAACT 
TAATACACAT 
AATACAGTGC 
TGAACTTCTT 
TGAGACACTT 
AAGGGCAACT 
GCCTAGACTT 
CAAAGTCTAT 
AGACGTGAGA 
CTTTTATTAA 
GATTCGGGAG 
TTTTCTAAGG 
GAAGACATAA 
CTTGTTTCTG 
CCAAAATGCT 
TTGCATATAC 
GATGTGTCAG 
AGTTGTGTAC 
CTTGTTGGTG 
AGGGTTGCTC 
CTAATGGAAG 



21 
I 

CX3GC6GCTCA 
CCACCGCCCG 
CCAGCCCCTG 

CTCCGTTCCT 
ACCTTTTGGT 
TGAGGTCAAA 
ATGCTACCTC 
TAAAACCTTT 
CAAACTCGTG 
GCTCCCCCTG 
CAGCATTGCC 
CCACTTGATC 
AGGAACGGTG 
CATCCACAAG 
GCGTTCCTTC 
CAATGGGGGT 
TGGAACAATC 
CTCTCTGGTG 
CAAAAAGGGG 
CAAGAJ\AATG 
TTTCAGAGTT 
AATTGAGCCC 
ACTGGAAATA 
6GAG6ACTTG 
GTACAACCTG 
GCTGAATATC 
TACAGAAGAC 
TCGGGATGGC 
CCCGGGCAAG 
AGTTACTGCT 
GGGAACAACT 
AGCTCTTGCT 
ACCAAGTCCA 
TGGTTTCTCA 
AGGCCCTGTG 
TCCGACAGGA 
GTTGGGGTTC 
TCCCCAGCTC 
AATTGAGCAA 
CTGAGCCTCT 
ATGGTGTAT6 
TAAGCATTA6 
ACCTTCTTAG 
TAGCAAGCAG 
GCTTTG6AGC 
CTGTGTACAC 
TTTTTTTCCT 
GGAAAA6CCA 
ACACTTTTCA 
ATTATTATCC 
ATATACTAGT 
CTCTGAAACT 
CAAACAAGGA 
TGTATAACCA 
GAAGTGTGAC 
ACATTGTTTT 
CCAGTTGAGT 
TGATTGCTTT 
TGGAACCAGA 
ATAATGAGAT 
TTACACCTTA 
ATGAAGCATG 
CTCAAAAAGT 
AACCCATATT 
CTTTGCA 



31 
I 

G6ACGAGGGC 
GGCTCCGTGC 
GCC6AGAGAA 
TTTAAAACTT 
CTGCTCTGTT 
CCAGAGGGAC 
CCATCTGTGA 
TCCGTCGGCC 
TTCATCATTC 
TCAGCCCTGC 
GCCCACCAGC 
AGGATGCTCG 
GGCTACAGCC 
GGCCGAATCA 
AGGCTCTCTC 
GGCTTGAGCA 
GACTTCCAGC 
ACAGAGGTGG 
AATCAGGACA 
ATCTGTCTGA 
AGGAACAAGA 
TACCATTATC 
ACCTTTTACG 
GTGGAGCGGA 
GGA6ACCTCT 
TGGAAGGA6T 
AGGCX3CATCC 
CCTGA6AACA 
TGGAGGATGA 
TCTTGCCAGC 
GAGGACCCAC 
GGTCTCCTGT 
GCCTCTCTTG 
GATTTGTGTG 
GTTGCTGGGC 
TAGAAGGCTG 
GCTGACTCAT 
TCATGGGTTG 
ACTCTCTGAA 
ATGTCTATTG 
CCATTTGGAA 
TGTATATGCA 
GGACCCTGAG 
TTATTATGTG 
AGTATCATTC 
AAATCTCTTC 
AGAAACCGGC 
CTTTGAAATA 
CCAATTCTAG 
AAAGATTTGT 
CTATTTTACA 
TAGTGGTGCA 
CCATGAAGAC 
CTTTTTTTTT 
GGAGAACATC 
ACCAAGCAGG 
AATCTGTATC 
CTTATTTCAA 
CTAGCCAAAG 
AGTGTTTCAA 
ATTTTGGGAA 
TCCACATAGC 
GTTTGTGGTA 
TTTGGATTTT 
ATTGGCTGTA 



41 
1 

AGATCTCGTT 
CGCCAAGTTT 
GGGTCTTACC 
CTGTTTCTTG 
TCTGGAGCCT 
GGCTGGAAGA 
G6TTTAACCT 
ACAGCCAGCC 
ACGGATGGAC 
ACACAAGAGA 
TTTACACGGA 
ACTGGCTGCA 
TCGGAGCGCA 
CAGGTTTGGA 
CGGACGATGC 
TTGGTATTCA 
CAGGCTGTGG 
TAAAATGTGA 
AGCCGAGTTT 
GCTGCCGCAA 
G6AACA6CAA 
AGATGAAAAT 
TCACCCTTTA 
TCGAGCAGAA 
TGAAGATCCA 
TTCGCAGCTA 
GGGTGAAGTC 
CCAGCATATC 
AAAACGAAAC 
AAGGCAGCAA 
CCAATGGAAG 
GATGGCTGGG 
AATAGCTCTA 
TAAGCAGCTG 
GAGCCTGTAC 
TCAGCTGCTC 
GTCAGGATGG 
CACTGACCAT 
GCACACATCA 
AACACTTAAA 
CCCAGTGGAG 
TGGGGAAAG6 
GCTTTAAGT6 
CCACCTCCCC 
ATGCTGGGGC 
TGTTTAGAGA 
ACCTGCCAGA 
TTTTACTTTA 
ATCTTGATTT 
GTATGCATTG 
AAACTGAGGC 
GCCAGGGAGA 
TTTTGCAGCC 
TATATAGAGC 
TGTGCCAACG 
AGAGGAAGAA 
GTGCCAAAGT 
GATATGTTCT 
CGAAGCTTGT 
ATTTTAGATT 
TAGGACCCGA 
CTGAGGGTAA 
ACTTATGTGA 
GGAGCATTTC 
CATCCTGGTC 



51 
I 

CTG6GGCAAG 
TCATTTTCCA 
GGCC6GGATT 
GGA6GGGGTG 
CT6CTATTGC 
TAAGCTCCAC 
CCGCACCTCC 
CTTAGAAGAC 
GATGAGCGGT 
GAAAGACGCC 
TGCGGTCAAT 
GGAGAAGGAC 
CGTGGCCGGG 
TCCTGCCGGG 
AGATTTTGTG 
GATGCCTGTG 
ACTCAACGAT 
GCATGAGCGA 
TGCCTTCCAG 
GAACCGTTGT 
AATGTACCTA 
CCATGTCTTC 
TGGCACTAAT 
TGCCACCAAC 
GCTCACCTGG 
CCTGTCTCAA 
TGGGGAAACC 
CCCAGGCCGG 
CAGTCCCACT 
GACTTCCTGC 
GATTCTTCTC 
ACTCCTCGCG 
ACTCCAAACC 
GGTGCCTGGG 
TCTGCCTGAC 
AGCCTGCTTT 
CAGGCCTGGT 
ACTGCTTACG 
TTGGCTTTCC 
ATTAATTAGA 
TTG6GATTTC 
CACCTGGGGC 
GTTTCTATTT 
TATGAGT6AC 
CAGAATGATG 
GATAGAAGTT 
CA6AGCTGGT 
ATACCAGTGC 
GJ^TTAATAC 
CCTAATTAGA 
TTAGTGAGGT 
GGACTCAGAT 
AGTTCCCACC 
CATCCATAAA 
GTTGGACTTT 
TGATTTTCTT 
TGTATCACTG 
CAAGCCAATT 
ACAGGTTGAG 
ATTTTCAGAT 
GCCTAAACAC 
TTTTATACGA 
GGGGTTTTCC 
GGATTTTGGA 
ACTTCTGACT 
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Protein Accession #r NP_006024.1 

1 IX 21 31 41 51 

.1 I I I I 1 

3 MSNSVPLLCF WSLCYCFAAG SPVPPGPEGR LBDKI/HKPKA TQTEVKPSVR FNI*RTSKDPE 60 

HEGCYXiSVGH SQPLEDCSFH MTAKTFPIIH GWTMSGIFEN WI,HKLVSALH TREKDANVW 120 

VDWLPIiAHQL YTDAVNNTRV VGHSIARMLD WLQEKDDFSL GNVHLIGYSL GAHVAGYAGN 180 

PVKGTVGRIT GLDPAGPMFE GADIHKRIiSP DDADFVDVLH TYTRSFGIiSI GIQMPVGHID 240 

lYPMGGDFQP GCGLNDVLGS lAYGTITBW KCEHERAVHIi FVDSLVNQDK PSFAFQCTDS 300 
10 NRFKK6ICLS CRKNRCNSXG YNAKKMRNKR NSKMYLKTRA GMPFRVYKYQ MKXKVFSYKt? 360 

MGEIEPTFYV TI/YGTNADSQ TLPliBIVBRI BQNATNTFLV YTEBDLGDLL KIQLTP»EGAS 420 

QSWYNtiWKEF RSYLSQPRHP GREI^IRRIR VXSGSTQRKIi TFCTEDPBNT SISPGRELWF 480 
RKCRDGWRMK NETSPTVELP 

15 
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1 LA method of detecting an angiogenesis-^associated transcript in a cell in 

2 a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridized to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1-8. 

1 2. The method of claim 1 , wherein the biological sample is a tissue 

2 sample. 

1 3. The method of claim 1, wherein the biological sample comprises 

2 isolated nucleic acids. 

1 4. The method of claim 3, wherein the nucleic acids are mRNA. 

1 5. The method of claim 3, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 6. The method of claim 1 » wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1 -8 . 

1 7. The method of claim 1 , wherein the polynucleotide is labeled. 

1 8. The method of claim 7, wherein the label is a fluorescent label. 

1 9. The method of claim I , wherein the polynucleotide is inmiobilized on 

2 a solid surface. 

1 10. The method of claim 1 , wherein the patient is undergoing a therapeutic 

2 regimen to treat a disease associated with angiongenesis. 

1 II. The method of claim 1 , wherein the patient is suspected of having 

2 cancer. 

1 12. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1-8. 

1 13. The nucleic acid molecule of claim 12, which is labeled. 

1 14. The nucleic acid of claim 13, wherein the label is a fluorescent label 

289 

CNSOOCID: <WO ^02079492A2_I_> 



wo 02/079492 



PCT/IIS02/04915 



15. An expression vector comprising the nucleic acid of claim 12. 



1 1 6, A host cell comprising the expression vector of claim 15. 

1 1 7. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-8 

1 1 8. An antibody that specifically binds a polypeptide of claim 1 7. 

-1 19. The antibody of claim 18, further conjugated or fused to an effector 

2 component. 

1 20. The antibody of claim 19, wherein the effector component is a 

2 fluorescent label. 

1 21 . The antibody of claim 19, wherein the effector component is a 

2 radioisotope. 

1 22. The antibody of claim 19, which is an antibody firagment. 

1 23. The antibody of claim 1 9, which is a humanized antibody 

1 24. A method of detecting a cell undergoing angiogenesis in a biological 

2 sample from a patient, the method comprismg contacting the biological sample with an 

3 antibody of claim 18. 

1 25. The method of claim 24, wherein the antibody is further conjugated or 

2 fused to an effector component. 

1 26. The method of claim 25, wherein the effector component is a 

2 fluorescent label. 

1 27. The method of detecting antibodies specific to angiogenesis in a 

2 patient, the method comprising contacting a biological sample from the patient with a 

3 polypeptide which is encoded by a nucleotide sequence of Tables 1-8. 
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